[go: up one dir, main page]

WO2024125342A1 - Adapter containing blocking elements used in combination, construct, method and use - Google Patents

Adapter containing blocking elements used in combination, construct, method and use Download PDF

Info

Publication number
WO2024125342A1
WO2024125342A1 PCT/CN2023/136344 CN2023136344W WO2024125342A1 WO 2024125342 A1 WO2024125342 A1 WO 2024125342A1 CN 2023136344 W CN2023136344 W CN 2023136344W WO 2024125342 A1 WO2024125342 A1 WO 2024125342A1
Authority
WO
WIPO (PCT)
Prior art keywords
polynucleotide
blocking
analyte
binding protein
elements
Prior art date
Application number
PCT/CN2023/136344
Other languages
French (fr)
Chinese (zh)
Inventor
刘先宇
刘铮
刘少伟
王玉博
罗江燕
Original Assignee
成都齐碳科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 成都齐碳科技有限公司 filed Critical 成都齐碳科技有限公司
Publication of WO2024125342A1 publication Critical patent/WO2024125342A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Definitions

  • the present application relates to the technical field of nanopore sequencing, and in particular to adaptors, constructs, methods and applications comprising a combined blocking element for characterizing analytes.
  • Analytes such as polynucleotides (such as RNA or DNA) sequencing technologies are widely used.
  • the sequencing technologies for polynucleotides, especially DNA, are also constantly being updated.
  • NGS next generation sequencing
  • the fourth-generation nanopore sequencing refers to the situation where when a DNA molecule passes through a nanopore driven by electrophoresis, the different sizes and shapes of each base in the DNA molecule will cause characteristic current changes. Therefore, the base type and arrangement order of the DNA molecule can be determined by electron conduction detection, thus achieving single-molecule sequencing.
  • the polynucleotide binding protein when no potential is applied, the polynucleotide binding protein is usually required to be stagnant on the polynucleotide to prevent the polynucleotide binding protein from moving further along the polynucleotide.
  • the stagnant polynucleotide binding protein can be moved along the polynucleotide sequence to be tested, thereby achieving the purpose of sequencing.
  • single-stranded nucleic acids formed by abasic groups such as iSp18 or iSpC9 are usually used as spacers to inhibit the translocation function of polynucleotide binding proteins, thereby achieving the effect of stagnating polynucleotide binding proteins, and then the polynucleotide binding proteins are allowed to cross the spacers through the electric field force to start normal sequencing.
  • the spacers that can effectively stagnate polynucleotide binding proteins in the prior art are very limited, and some groups as spacers will cause the polynucleotide binding proteins to have a high shedding rate or low data throughput.
  • a first aspect of the present invention relates to an adaptor for characterizing an analyte, the adaptor comprising one or more first blocking elements and one or more second blocking elements different from the first blocking elements, and after the adaptor is contacted with a polynucleotide binding protein, the one or more first blocking elements and the one or more second blocking elements can act in combination to stop The polynucleotide binding protein is retained.
  • the linker comprises a complementary main chain and a blocking chain, and the one or more first blocking elements are covalently connected to the main chain; the one or more second blocking elements are covalently modified to the main chain or the blocking chain or interact with the main chain and/or the blocking chain by non-covalent bonds.
  • the one or more second blocking elements are complementary to the blocking chain or the main chain, wherein, when the one or more second blocking elements are modified to the main chain by covalent bonds, the one or more second blocking elements are complementary to the blocking chain; when the one or more second blocking elements are modified to the blocking chain by covalent bonds, the one or more second blocking elements are complementary to the main chain; when the one or more second blocking elements interact with the main chain and/or the blocking chain by non-covalent bonds, the second blocking elements may be complementary to the blocking chain or the main chain.
  • the one or more first retarding elements have a structure different from that of nucleotides; and the one or more second retarding elements have a structure for improving double-strand stability.
  • the first blocking element comprises one or more selected from the group consisting of organic oligocations, iSpC3, iSp18, iSp9, nitroindole, inosine, acridine, 2-aminopurine, 2-6-diaminopurine, 5-bromo-deoxyuracil, inverted thymidine (inverted dT), inverted dideoxythymidine (ddT), dideoxycytidine (ddC), 5-methylcytidylic acid, 5-hydroxymethylcytidine, 2'-O-methyl RNA base, isodeoxycytidine (iso-dC), isodeoxyguanosine (iso-dG), a photocleavable (PC) group or hexanediol; the second blocking element comprises one or more selected from the group consisting of nucleotides modified with locked nucleic acid (LNA), peptide nucleic acid (PNA), me
  • R 4 is a C1-C5 lower alkylene group
  • R 5 and R 6 are the same or different C1-C5 lower alkylene groups
  • X 1 is a putrescine, spermidine or spermine residue
  • the organic oligocation is selected from spermine (Sp).
  • the first blocking element is not complementary to the second blocking element.
  • the adapter further comprises a third strand, which is partially complementary to the main strand; more preferably, the main strand is partially complementary to the blocking strand and the complementary ends of the main strand and the blocking strand are used to directly or indirectly connect to the analyte.
  • the analyte is selected from a polynucleotide, a polypeptide, a lipid or a polysaccharide, preferably a polynucleotide, which is a fully double-stranded polynucleotide, a partially double-stranded polynucleotide or a single-stranded polynucleotide.
  • the polynucleotide binding protein is derived from a polynucleotide processing enzyme; the polynucleotide processing enzyme is selected from a polymerase, a helicase or a nuclease exonuclease. More preferably, the helicase is selected from He1308 helicase, RecD helicase, XPD helicase, Dda Helicase or ED1 helicase.
  • the second aspect of the present invention relates to a construct for characterizing an analyte, the construct comprising the analyte and the adaptor of the first aspect of the present invention, wherein the adaptor is directly or indirectly linked to either or both ends of the analyte.
  • the third aspect of the present invention relates to a complex for characterizing an analyte, which comprises a polynucleotide binding protein and a linker of the first aspect of the present invention or a construct of the second aspect of the present invention; wherein the polynucleotide binding protein is arrested on the linker under the combined action of the first blocking element and the second blocking element.
  • the polynucleotide binding protein is derived from a polynucleotide handling enzyme; the polynucleotide handling enzyme is selected from a polymerase, a helicase or an exonuclease. More preferably, the helicase is selected from a He1308 helicase, a RecD helicase, a XPD helicase, a Dda helicase or an ED1 helicase.
  • a fourth aspect of the present invention relates to a method for controlling the loading of a polynucleotide binding protein on an analyte, the method comprising:
  • the polynucleotide binding protein is loaded onto a linker, wherein the linker comprises one or more first blocking elements and one or more second blocking elements different from the first blocking elements, and the polynucleotide binding protein is arrested on the linker under the combined action of the one or more first blocking elements and the one or more second blocking elements; and the linker loaded with the polynucleotide binding protein is connected to the analyte.
  • the adapter is as defined in the first aspect of the present invention.
  • the polynucleotide binding protein is derived from a polynucleotide handling enzyme; the polynucleotide handling enzyme is selected from a polymerase, a helicase or an exonuclease. More preferably, the helicase is selected from a He1308 helicase, a RecD helicase, a XPD helicase, a Dda helicase or an ED1 helicase.
  • a fifth aspect of the invention relates to a method of controlling the movement of an analyte through a transmembrane pore, the method comprising:
  • step (b) contacting the analyte loaded with the polynucleotide binding protein provided in step (a) with the transmembrane pore;
  • the second blocking element comprises covalently modified nucleotides or non-covalently modified nucleotides; when the polynucleotide binding protein moves through the one or more second blocking elements, the polynucleotide binding protein moves through the nucleotides of the one or more second blocking elements without passing through the covalent or non-covalent modifications of the one or more second blocking elements.
  • the method includes providing a tether for bringing the analyte loaded with the polynucleotide binding protein close to the transmembrane pore; the tether includes a capture region and an anchor region, the capture region is used to capture the adapter, and the anchor region is used to anchor and bind to the transmembrane pore or the membrane where the transmembrane pore is located.
  • a sixth aspect of the present invention relates to a method for characterizing an analyte, the method comprising:
  • the transmembrane pore is a protein pore or a solid-state pore, and/or the membrane is an amphiphilic layer or a solid-state layer.
  • the analyte is selected from a polynucleotide, a polypeptide, a lipid or a polysaccharide, preferably a polynucleotide, which is a fully double-stranded polynucleotide, a partially double-stranded polynucleotide or a single-stranded polynucleotide.
  • a seventh aspect of the present invention relates to a kit for characterizing an analyte, the kit comprising:
  • the polynucleotide binding protein is derived from a polynucleotide handling enzyme; the polynucleotide handling enzyme is selected from a polymerase, a helicase or an exonuclease. More preferably, the helicase is selected from a He1308 helicase, a RecD helicase, a XPD helicase, a Dda helicase or an ED1 helicase.
  • the transmembrane pore is a protein pore or a solid-state pore.
  • R 4 is a C1-C5 lower alkylene group
  • R 5 and R 6 are the same or different C1-C5 lower alkylene groups
  • X 1 is a putrescine, spermidine or spermine residue
  • the organic oligocation is selected from spermine.
  • the ninth aspect of the present invention relates to the use of the linker of the first aspect of the present invention, the construct of the second aspect of the present invention, the complex of the third aspect of the present invention, the methods of the third to sixth aspects of the present invention, or the kit of the seventh aspect of the present invention in the preparation of a product for characterizing an analyte or in characterizing an analyte.
  • the present invention uses more than one blocking element to jointly act to arrest the enzyme, further expanding the application range of the spacer region, so that some blocking elements cannot play a good blocking effect or even have no blocking effect when used alone, but can achieve a good blocking effect when used in combination with another blocking element.
  • the prior art uses a variety of spacers, such as single-stranded nucleic acids formed by a base-free group such as iSp18 or iSpC9 as spacers.
  • spacers such as single-stranded nucleic acids formed by a base-free group such as iSp18 or iSpC9 as spacers.
  • the electric field force causes the enzyme to cross the spacer, the blocking element will always exist and will not disappear. Therefore, this spacer is only suitable for entry sequencing and cannot be used for outry sequencing.
  • the combined blocking structure formed by the first blocking element and the second blocking element will be destroyed or eliminated as the sequence passes through the nanopore, so it can be Further expand its application, so that it can be used not only for internal push sequencing, but also for external pull sequencing.
  • Internal push sequencing refers to a method in which the direction of the electric field force is the same as the direction of the enzyme movement, so that the enzyme passes through and then the analyte is sequenced
  • exital pull sequencing refers to a method in which the direction of the electric field force is opposite to the direction of the enzyme movement, so that the enzyme passes through and then the analyte is sequenced.
  • Figure 1 shows a schematic diagram of a Y adapter containing two blocking elements and an enzyme passing through the blocking element 1 according to an embodiment.
  • the labels are as follows: (Y1) the main strand containing the blocking element 1; (Y2) the third strand, also known as the fluorescent strand; (B) the blocking strand containing the blocking element 2.
  • Part of the sequence Y1 forms a complementary double-stranded region with Y2, part of the sequence Y1 is complementary to the B part, and the nucleotides of the blocking element 2 on B are complementary to Y1.
  • Fig. 2 shows a schematic diagram of the exemplary structures of various Y adapters containing two blocking elements.
  • the combination relationship of the two blocking elements is blocking element 1 + blocking element 2, blocking element 1 + blocking element 2 + blocking element 1, blocking element 1 + blocking element 2 + blocking element 1 + blocking element 2, and blocking element 2 + blocking element 1.
  • SEQ ID NO: 1 shows the main chain Y1 of the Y linker:
  • SEQ ID NO: 2 shows the main chain Y1-3C3 containing 3C3 of the Y linker:
  • SEQ ID NO: 3 shows the main chain Y1-5C3 containing 5C3 of the Y linker:
  • SEQ ID NO: 4 shows the main chain Y1-5C3-cPIP containing 5C3 of the Y linker:
  • SEQ ID NO:5 shows the main chain Y1-Sp containing Sp of the Y linker:
  • SEQ ID NO:6 shows the main chain Y1-2Sp containing 2Sp of the Y linker:
  • SEQ ID NO:7 shows the main chain Y1-4C18 containing 4C18 of the Y linker:
  • SEQ ID NO: 8 shows the fluorescent chain Y2 of the Y linker, and the 3' end of the sequence is connected to the fluorescent group CY5:
  • SEQ ID NO:9 shows the blocking strand B of the Y linker, in which some nucleotides are modified with OMe to prevent them from binding to the helicase:
  • SEQ ID NO: 10 shows the blocking strand B-LNA of the Y linker comprising LNA-modified nucleotides:
  • SEQ ID NO: 11 shows the blocking strand B-PNA of the Y linker comprising a PNA-modified nucleotide:
  • SEQ ID NO: 12 shows the blocking strand sequence B-cPIP of the Y linker:
  • SEQ ID NO: 13 shows a tether with cholesterol attached to the 5' end of the chain:
  • SEQ ID NO: 14 shows the wild-type sequence of helicase E1:
  • SEQ ID NO: 15 shows the wild-type sequence of helicase E2:
  • the present invention provides an adapter for characterizing nucleic acids, wherein the adapter is capable of connecting an analyte.
  • An adapter is also referred to as a joint, an adapter, etc.
  • the adapter of the present invention comprises one or more first retarding elements and one or more second retarding elements, and the first retarding element and the second retarding element are completely different in structure and composition.
  • the adapter comprises a complementary, preferably partially complementary main chain and a blocking chain, and both the main chain and the blocking chain are single-stranded structures formed by connecting nucleotides, nucleotide analogs or modified nucleotides, or single-stranded structures in which some nucleotides are modified.
  • first retarding elements such as 1, 2, 3, 4, 5, 6, 7, 8 and, 9, 10 or more first retarding elements are covalently connected to the main chain.
  • second blocking elements such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more second blocking elements are modified to the main chain or the blocking chain with a covalent bond (such as LNA, PNA, etc.) or interact with the main chain, the blocking chain or the double chain formed by the main chain and the blocking chain with a non-covalent bond (such as cPIP, etc.).
  • the one or more second retarding elements are complementary to the blocking chain; when one or more second retarding elements are modified to the blocking chain by covalent bonds, the one or more second retarding elements are complementary to the main chain; when one or more second retarding elements interact with the main chain and/or the blocking chain by non-covalent bonds, the second retarding elements may be complementary to the blocking chain or the main chain.
  • the second blocking element comprises a nucleotide modified by a modifier, and when the nucleotide portion of one or more second blocking elements is located on the main chain and the modifier portion interacts with the main chain and/or the blocking chain by a non-covalent bond, the second blocking element is complementary to the blocking chain; when the nucleotide portion of one or more second blocking elements is located on the blocking chain and the modifier portion interacts with the main chain and/or the blocking chain by a non-covalent bond, the second blocking element is complementary to the main chain.
  • the second blocking element comprises nucleotides modified with a modifier.
  • the phrase "the second blocking element is located on the main strand or the blocking strand” herein means that the nucleotides comprised by the second blocking element are connected to the main strand or the blocking strand, rather than limiting the position of the modifier.
  • the one or more first retarding elements may not bind to the one or more second retarding elements in a complementary base pairing manner.
  • the one or more second retarding elements may bind to the barrier strand or the main strand in a complementary base pairing manner.
  • the one or more first blocking elements and the one or more second blocking elements may be disposed adjacent to each other or spaced apart from each other.
  • one or more first retarding elements are disposed adjacent to one or more second retarding elements.
  • at least one first retarding element is disposed adjacent to at least one second retarding element or to a segment complementary to at least one second retarding element, and no group exists between the two.
  • one or more second blocking elements are disposed adjacent to each other.
  • One or more first blocking elements may be disposed adjacent to each other, or adjacent to one or more second blocking elements, or adjacent to a segment on the main chain that is complementary to a second blocking element.
  • the meaning of “the first blocking element is arranged in close proximity” in this article is that there is no
  • the meaning of “the first blocking element and the second blocking element are arranged in close proximity” as described herein refers to that there is no gap formed by any group between the first blocking element on the main chain and the second blocking element located on the main chain, or between the first blocking element on the main chain and the segment on the main chain that is complementary to the second blocking element located on the blocking chain, but they are directly connected together.
  • the meaning of “the second blocking element is arranged in close proximity” as described herein refers to that there is no gap formed by any group between one or more second blocking elements located on the main chain or the blocking chain, but they are directly connected together.
  • one or more first retarding elements are spaced apart from one or more second retarding elements.
  • one or more groups exist between at least one first retarding element and at least one second retarding element or a segment complementary to at least one second retarding element.
  • one or more second blocking elements are spaced apart.
  • One or more first blocking elements may be spaced apart, or spaced apart from one or more second blocking elements, or spaced apart from a segment on the main chain that is complementary to the second blocking element.
  • the meaning of “the first retarding element is arranged at intervals” as described herein refers to the presence of one or more groups, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more groups, between one or more first retarding elements on the main chain.
  • the meaning of “the first retarding element and the second retarding element are arranged at intervals” as described herein refers to the presence of one or more groups, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more groups, between the first retarding element on the main chain and the second retarding element located on the main chain, or between the first retarding element on the main chain and the segment on the main chain that is complementary to the second retarding element located on the blocking chain.
  • the meaning of "the second retarding element is arranged at intervals" as described herein refers to the presence of one or more groups, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more groups, between one or more second retarding elements located on the main chain or the blocking chain.
  • the adapter includes 1 first retarding element and 1 second retarding element that are arranged adjacent to or spaced in the direction of movement of the polynucleotide binding protein. In some embodiments, the adapter includes 1 first retarding element, 1 second retarding element and 1 first retarding element that are arranged adjacent to or spaced in the direction of movement of the polynucleotide binding protein. In some embodiments, the adapter includes 1 first retarding element, 1 second retarding element, 1 first retarding element and 1 second retarding element that are arranged adjacent to or spaced in the direction of movement of the polynucleotide binding protein.
  • the adapter includes 1 second retarding element and 1 first retarding element that are arranged adjacent to or spaced in the direction of movement of the polynucleotide binding protein.
  • the direction of movement of the polynucleotide binding protein is usually 5'-3', and some polynucleotide binding proteins move in the 3'-5' direction.
  • one or more first blocking elements and one or more second blocking elements can work together to arrest the polynucleotide binding protein.
  • the polynucleotide binding protein can be arrested before the first blocking element or the second blocking element, preferably before the first first blocking element or the first second blocking element.
  • the first blocking element may have any molecule or any combination of molecules that arrest one or more polynucleotide binding proteins, preferably having a structure different from nucleotides or consisting of a structure different from nucleotides. More preferably, the first blocking element has one or more organic oligocations, one or more iSpC3, one or more iSp18, one or more iSp9, one or more nitroindoles, one or more inosine, one or more acridine, one or more 2-aminopurine, one or more 2-6-diaminopurine, one or more 5-bromo-deoxyuracil, one or more inverted thymidine (inverted dT), one or more inverted dideoxythymidine (ddT), one or more dideoxycytidine (ddC), one or more 5-methylcytidylic acid, one or more 5-hydroxymethylcytidine, one or more 2'-O-methyl RNA bases
  • R 4 is a C1-C5 lower alkylene group
  • R 5 and R 6 are the same or different C1-C5 lower alkylene groups
  • X 1 is a putrescine, spermidine or spermine residue
  • R 7 is C1 ⁇ C5 lower alkylene
  • R 8 is C1 ⁇ C5 lower alkylene
  • serine natural amino alcohol
  • (aa) n2 is a peptide containing a natural amino acid having a cationic side chain
  • n2 2 ⁇ 20.
  • the organic oligocation is selected from spermine.
  • the first retarding element comprises 1, 2, 3, 4, 5, 6, 7, 8 or more spermine, iSpC3, iSp18 or iSp9. Most preferably, the first retarding element is 3, 4 or 5 iSpC3, or 1, 2, 3, 4 or 5 spermine.
  • the second blocking element has one or more groups that improve the stability of the double-stranded chain, such as any group or combination of any groups that increases the difficulty of unwinding the double-stranded chain or increases the Tm value of the double-stranded chain, preferably has a modified nucleotide or is composed of a modified nucleotide.
  • the second blocking element comprises one or more nucleotides modified by locked nucleic acid (LNA), one or more nucleotides modified by peptide nucleic acid (PNA), one or more nucleotides modified by methoxy (OMe), one or more nucleotides modified by bicyclic nucleoside (BNA), one or more nucleotides modified by glycerol nucleic acid (GNA), one or more nucleotides modified by threose nucleic acid (TNA), one or more nucleotides modified by cyclic pyrrolimidazole polyamide (cPIP) or one or more nucleotides modified by double-stranded binding protein.
  • LNA locked nucleic acid
  • PNA peptide nucleic acid
  • OMe methoxy
  • BNA bicyclic nucleoside
  • GNA glycerol nucleic acid
  • TAA threose nucleic acid
  • cPIP cyclic pyrrol
  • the second blocking element comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 13, 15, 20, 23, 25 or more LNA-modified nucleotides, PNA-modified nucleotides, OMe-modified nucleotides.
  • the second blocking element is 3, 4, 5 or 6 LNA-modified nucleotides, or 15, 20, 23 or 25 PNA-modified nucleotides, 5, 10, 15 or 20 OMe-modified nucleotides.
  • first blocking elements on the main chain, two ends of one or more first blocking elements, second blocking elements, or a combination of the first blocking element and the second blocking element that are arranged adjacently or spaced apart are connected to one end of the first segment and one end of the second segment, respectively.
  • the other end of the first segment first enters the transmembrane pore and guides the entire adapter into the transmembrane pore, and the other end of the second segment is directly connected to the analyte or indirectly connected to the analyte through nucleotides, thereby characterizing the analyte.
  • the first segment is a single-stranded segment formed by nucleotides and abasic groups
  • the second segment is a single-stranded segment formed by nucleotides.
  • two ends of one or more second blocking elements disposed adjacently or spaced apart are connected to one end of the third segment and one end of the fourth segment, respectively.
  • the other end of the third segment is a free end, which is used to bind to the tether during sequencing to bring the adapter close to the transmembrane pore.
  • the other end of the fourth segment is directly connected to the analyte or indirectly connected to the analyte through nucleotides, thereby characterizing the analyte.
  • the other end of the fourth segment is a free end and is not connected to any group.
  • the third segment is a single-stranded segment formed by connecting nucleotides and/or modified nucleotides
  • the fourth segment is a single-stranded segment formed by nucleotides.
  • the second segment of the main strand can be combined with the fourth strand of the blocking strand in a base complementary pairing manner to form a double strand or a partial double strand.
  • a partial double strand refers to a structure that contains both a double strand and a single strand.
  • the adapter also includes a third strand, which is a single strand formed by nucleotides.
  • the third strand is also called a fluorescent strand.
  • the third strand can be combined with the first segment of the main strand in a base complementary pairing manner to form a double strand or a partial double strand.
  • the main strand, the blocking strand and the third strand form a Y-shaped adapter.
  • the structure of the Y-shaped adapter is a handle connecting two arms, wherein the second segment of the main strand is combined with the fourth segment of the blocking strand to form the handle of the Y-shaped adapter, one end of the handle is connected to the two arms, and the other end is connected to the analyte.
  • One arm is composed of the free end of the blocking strand, and the other arm is formed by the first segment of the main strand combined with the third strand.
  • the adapter of the present invention is preferably combined with a tether, which is a single-stranded polynucleotide composed of multiple nucleotides and also contains a baseless group that cannot form a complementary double strand.
  • a tether which is a single-stranded polynucleotide composed of multiple nucleotides and also contains a baseless group that cannot form a complementary double strand.
  • One end of the tether is complementary to the adapter and the other end is anchored to the membrane or the hole, thereby surrounding the analyte connected to the adapter around the hole.
  • the adapter of the present invention can be used independently, or it can be used in combination with other adapters (such as hairpin adapters, hairpin-like adapters or conventional Y adapters, etc.) to form a construct for sequencing.
  • the adapter of the present invention connects the two ends of the analyte, such as a polynucleotide, to form a construct for characterizing the polynucleotide.
  • the adapter of the present invention is connected to the 5' end of the polynucleotide, and the 3' end of the polynucleotide is connected to a conventional Y adapter other than the adapter of the present invention to form a construct.
  • the adapter of the present invention can be connected to the 5' end of the double-stranded polynucleotide, and the 3' end of the double-stranded polynucleotide can be connected to a hairpin adapter or a hairpin-like adapter to form a construct.
  • a hairpin-like adapter is a loop having a similar structure to a conventional hairpin, but the loop is not the same as a conventional hairpin structure but is formed by a single-stranded linear molecule folding back on itself, and can connect the two chains of a polynucleotide.
  • hairpin-like adapters see CN113462764A.
  • abasic groups used in the adapter or tether are groups such as iSp18, iSpC3 or iSp9 that cannot form base pairs, which may also be referred to as "abasic sites" or "abasic nucleotides.”
  • Abasic groups are nucleotides or nucleosides that lack a nucleobase at the 1' position of the sugar portion.
  • the analyte is selected from one or more of polynucleotides, polypeptides, polysaccharides and lipids.
  • the analyte is preferably a polynucleotide such as a nucleic acid, including deoxyribonucleic acid (DNA) and/or ribonucleic acid (RNA).
  • the polynucleotide can be single-stranded or double-stranded.
  • the polynucleotide can be circular.
  • the polynucleotide can be an aptamer, a probe hybridized with a microRNA, or the microRNA itself.
  • the polynucleotide can be of any length.
  • the polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 nucleotide pairs in length.
  • the polynucleotide can be 1000 or more nucleotide pairs, 5000 or more nucleotide pairs in length or 100000 or more nucleotide pairs in length.
  • the analyte may be present in any suitable sample.
  • the present invention is generally implemented on a sample known to contain or suspected of containing the analyte.
  • the present invention may be implemented on a sample containing one or more analytes of unknown species.
  • the present invention may be implemented on a sample to confirm the species of one or more analytes known or expected to be present in the sample. It is contemplated by those skilled in the art that "providing an analyte" of the present invention refers to providing a sample containing the analyte, and "connecting a sequencing adapter to an analyte" of the present invention refers to connecting a sequencing adapter to a sequencing adapter. Binds to the analyte present in the sample.
  • a polynucleotide can be any polynucleotide.
  • a polynucleotide such as a nucleic acid is a macromolecule containing two or more nucleotides.
  • the polynucleotide or nucleic acid can include any combination of nucleotides. Nucleotides can be naturally occurring or artificially synthesized.
  • Nucleotides generally contain a nucleobase, a sugar and at least one phosphate group. The nucleobase and sugar form a nucleoside. Nucleotides can be natural nucleotides or non-natural nucleotides.
  • Nucleoside bases are typically heterocyclic. Nucleobases include, but are not limited to, purines and pyrimidines, more specifically, adenine (A), guanine (G), thymine (T), uracil (U), and cytosine (C).
  • A adenine
  • G guanine
  • T thymine
  • U uracil
  • C cytosine
  • the sugar is usually a pentose.
  • Nucleotide sugars include, but are not limited to, ribose and deoxyribose.
  • the sugar is preferably deoxyribose.
  • the nucleotides in the polynucleotide are usually ribonucleotides or deoxyribonucleotides.
  • the polynucleotide may contain the following nucleosides: adenosine, uridine, guanosine and cytidine.
  • the nucleotides are preferably deoxyribonucleotides.
  • the polynucleotide preferably contains the following nucleosides: deoxyadenosine (dA), deoxyuridine (dU) and/or thymidine (dT), deoxyguanosine (dG) and deoxycytidine (dC).
  • the nucleotides typically contain monophosphate, diphosphate or triphosphate.
  • the phosphatase can be attached to the 5' or 3' side of the nucleotide.
  • nucleotides in a polynucleotide can be linked to each other in any manner.
  • nucleic acids the nucleotides are usually linked by their sugar and phosphate groups.
  • nucleobases the nucleobases.
  • the polynucleotide may be single-stranded or double-stranded. At least a portion of the polynucleotide is preferably double-stranded.
  • a polynucleotide can be a nucleic acid, such as deoxyribonucleic acid (DNA) or ribonucleic acid (RNA).
  • DNA deoxyribonucleic acid
  • RNA ribonucleic acid
  • the polynucleotide is preferably DNA, RNA or a DNA or RNA hybrid.
  • the polynucleotide may comprise a single-stranded region and a region with other structures, such as a hairpin loop, a triplex and/or a quadruplex.
  • a DNA/RNA hybrid may comprise DNA and RNA on the same strand.
  • the DNA/RNA hybrid comprises a DNA strand hybridized to an RNA strand.
  • the polynucleotide can be of any length.
  • the polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 nucleotide pairs in length.
  • the polynucleotide can be 1000 or more nucleotide pairs in length, 5000 or more nucleotide pairs in length or 100000 or more nucleotide pairs in length.
  • the polynucleotides may be present in any suitable sample.
  • the present invention is generally performed on samples known to contain or suspected of containing polynucleotides.
  • the present invention may be performed on samples to determine the types of one or more polynucleotides known or expected to be present in the sample.
  • the polynucleotide binding protein can be any protein that can bind to the polynucleotide and control its movement through the hole. It is very simple to determine whether a protein binds to a polynucleotide in the art. Proteins usually interact with polynucleotides and modify at least one of their properties. Proteins can modify polynucleotides by cleaving polynucleotides to form a single nucleotide or a shorter nucleotide chain such as a dinucleotide or trinucleotide. The part can modify the polynucleotide by positioning or moving the polynucleotide to a specific position (i.e., controlling its movement).
  • the polynucleotide binding protein is derived from a polynucleotide processing enzyme; the polynucleotide processing enzyme is selected from a polymerase, a helicase or a nuclease exonuclease. More preferably, the helicase is selected from a He1308 helicase, a RecD helicase, a XPD helicase, a Dda helicase or an ED1 helicase.
  • the polynucleotide binding protein is a polynucleotide helicase.
  • a polynucleotide helicase is an enzyme that can melt a double-stranded polynucleotide into a single strand.
  • a polynucleotide helicase can melt a double-stranded DNA into a single strand.
  • a polynucleotide helicase is an enzyme with helicase activity. Examples of polynucleotide helicases include, for example, helicases as described herein.
  • the polynucleotide binding ability can be measured using any method known in the art.
  • a protein can be contacted with a polynucleotide, and the ability of the protein to bind to the polynucleotide and move along the polynucleotide can be measured.
  • the protein can include modifications that contribute to polynucleotide binding and/or contribute to its activity at high salt concentrations and/or room temperature.
  • the protein can be modified so that it binds to the polynucleotide (i.e., retains the polynucleotide binding ability) but does not act as a helicase (i.e., does not move along the polynucleotide when all the necessary components (e.g., ATP and Mg 2+ ) are available for movement).
  • modifications are known in the art. For example, modifications of the Mg 2+ binding domain in a helicase typically produce variants that do not function as a helicase.
  • the enzyme may be covalently attached to the pore. Any method may be used to covalently attach the enzyme to the pore.
  • polynucleotides are translocated through the hole along or against the applied potential.
  • Exonucleases that gradually or stepwise work on double-stranded polynucleotides can be used on the cis side of the hole to supply the remaining single strand under the applied potential, or on the trans side to supply the remaining single strand under the reverse potential.
  • helicases that unwind the double-stranded DNA can also be used in a similar manner.
  • Polymerases can also be used. Sequencing applications that require chain translocation against the applied potential are also possible, but DNA must first be "captured" by the enzyme at the opposite potential or without potential.
  • Single-stranded DNA exonucleases or single-stranded DNA-dependent polymerases can serve as molecular motors to pull the recently translocated single strand back from the hole in a controlled, stepwise manner against the applied potential from trans to cis.
  • any helicase can be used in the present invention.
  • the helicase can act on the hole in two modes.
  • the method is preferably carried out using a helicase so that the helicase moves the polynucleotide through the hole under the action of the field caused by the applied voltage.
  • the 5' end of the polynucleotide is first captured in the hole, and the helicase moves the polynucleotide into the hole so that it passes through the hole under the action of the field until it finally translocates through and reaches the reverse side of the membrane.
  • the method is preferably carried out in this way, and the helicase moves the polynucleotide through the hole against the field caused by the applied voltage.
  • the 3' end of the polynucleotide is first captured in the hole, and the helicase moves the polynucleotide through the hole so that it is pulled out of the hole against the applied field until it is finally pushed back to the cis side of the membrane.
  • the method can also be performed in the reverse direction.
  • the 3' end of the polynucleotide can first be captured in the pore, and the helicase can move the polynucleotide into the pore, allowing it to pass through the pore under the influence of the field until it finally translocates through and reaches the opposite side of the membrane.
  • the helicase When the helicase does not possess the essential component that is convenient to move or is modified to stop or prevent it from moving, the helicase can be combined with the polynucleotide and serve as a brake to slow down the movement of the polynucleotide when the polynucleotide is pulled into the hole by the external field.
  • the inactive mode it is not important whether the 3 ' or 5 ' of the polynucleotide is captured, and it is the external field that pulls the polynucleotide into the hole towards the opposite side under the effect of the enzyme that serves as the brake.
  • the control of the movement of the helicase to the polynucleotide can be described in a variety of ways, including toothing, sliding and braking.
  • the helicase variant lacking helicase activity can also be used in this way.
  • the polynucleotide may be contacted with the polynucleotide binding protein (e.g., a polynucleotide helicase) and the pore in any order.
  • the polynucleotide binding protein e.g., a polynucleotide helicase
  • the polynucleotide is first contacted with the polynucleotide binding protein (e.g., a polynucleotide helicase) and the pore.
  • the nucleotide binding protein eg, polynucleotide helicase
  • the polynucleotide/polynucleotide binding protein eg, polynucleotide helicase
  • the polynucleotide/polynucleotide binding protein eg, polynucleotide helicase
  • Any step in the method using a polynucleotide binding protein is typically performed in the presence of free nucleotides or free nucleotide analogs and an enzyme cofactor that promotes the action of the polynucleotide binding protein (e.g., a polynucleotide helicase).
  • the free nucleotides can be one or more of any individual nucleotides.
  • Free nucleotides include, but are not limited to, adenosine monophosphate (AMP), adenosine diphosphate (ADP), adenosine triphosphate (ATP), guanosine monophosphate (GMP), guanosine diphosphate (GDP), guanosine triphosphate (GTP), thymidine monophosphate (TMP), thymidine diphosphate (TDP), thymidine triphosphate (TTP), uridine monophosphate (UMP), uridine diphosphate (UDP), uridine triphosphate (UTP), cytidine monophosphate (CMP), cytidine diphosphate (CDP), cytidine triphosphate (CTP), cyclic adenosine monophosphate (cAMP), cyclic guanosine monophosphate (cGMP), deoxyguanosine monophosphate (DAMP), deoxyguanosine monophosphate (DGP), deoxyguanosine monophosphat
  • the free nucleotides are preferably selected from AMP, TMP, GMP, CMP, UMP, dAMP, dTMP, dGMP or dCMP.
  • the free nucleotides are preferably adenosine triphosphate (ATP).
  • the enzyme cofactor is a factor that allows the construct to work.
  • the enzyme cofactor is preferably a divalent metal cation.
  • the divalent metal cation is preferably Mg 2+ , Mn 2+ , Ca 2+ or Co 2+ .
  • the enzyme cofactor is most preferably Mg 2+ .
  • the polynucleotide binding protein has stopped moving along the analyte, such as a polynucleotide, it is arrested.
  • the combination of an arresting element is used to arrest the polynucleotide binding protein.
  • the polynucleotide binding protein can be arrested before the arresting element.
  • the stalled helicase and polynucleotide are contacted with a transmembrane pore and an electric potential is applied. Under the action of the field generated by the applied electric potential, the polynucleotide moves through the pore.
  • the polynucleotide-binding protein is usually too large to move through the pore. When a portion of the polynucleotide enters the pore and moves along the field generated by the applied electric potential, the polynucleotide-binding protein moves through the pore through the associated blocking element as the polynucleotide moves through the pore.
  • the polynucleotide binding protein On the polynucleotide to be controlled. Before the stalled polynucleotide binding protein and polynucleotide are contacted with the transmembrane pore and an electric potential is applied, the polynucleotide binding protein stays at the position where they are stalled. Even in the presence of necessary components (e.g., ATP and Mg2+) to promote the movement of the polynucleotide binding protein, the polynucleotide binding protein will not move through the combined blocking element on the polynucleotide until there is a transmembrane pore and an applied electric potential.
  • necessary components e.g., ATP and Mg2+
  • Any membrane may be used in accordance with the present invention. Suitable membranes are well known in the art.
  • the membrane is preferably an amphiphilic layer.
  • An amphiphilic layer is a layer formed by amphiphilic molecules (e.g., phospholipids) having both hydrophilic and lipophilic properties.
  • the amphiphilic molecules may be synthetic or naturally occurring.
  • Non-naturally occurring amphiphiles and amphiphiles that form monolayers are known in the art and include, for example, block copolymers.
  • Block copolymers are polymeric materials in which two or more monomer subunits are polymerized together to produce a single polymer chain. Block copolymers typically have properties provided by each monomer subunit.
  • block copolymers may have unique properties that polymers formed by individual subunits do not have.
  • Block copolymers may be designed such that one monomer subunit is hydrophobic (i.e., lipophilic) while the other subunit or subunits are hydrophilic when in an aqueous medium.
  • the block copolymer may have amphiphilic properties and may form a membrane.
  • the block copolymers can be diblock (composed of two monomer subunits), but can also be composed of more than two monomer subunits to form more complex arrangements that behave as amphiphiles.
  • the copolymers can be triblock, tetrablock or pentablock copolymers.
  • Archaeal bipolar tetraether lipids are naturally occurring lipids that are constructed so that the lipids form monolayer membranes. These lipids are commonly found in extremophiles, thermophiles, halophiles, and acidophiles that survive in harsh biological environments. It is believed that their stability comes from the fusogenic nature of the final bilayer. It is easy to construct block copolymer materials that mimic these biological entities by creating triblock polymers with the general motif hydrophilic-hydrophobic-hydrophilic. The material can form a monomeric membrane that behaves similarly to a lipid bilayer and has a range of phase states from vesicles to laminae. The membranes formed by these triblock copolymers have some advantages over biological lipid membranes. Because the triblock copolymers are synthetic, the precise construction can be carefully controlled to provide the correct chain length and properties required to form a membrane and interact with pores and other proteins.
  • Block copolymers can also be constructed from subunits that are not classified as lipid submaterials; for example, hydrophobic polymers can be made from monomers based on siloxane or other non-hydrocarbon compounds.
  • the hydrophilic subportion of the block copolymer can also have low protein binding properties, which enables the production of membranes that are highly resistant when exposed to raw biological samples.
  • the head group unit can also be derived from atypical lipid head groups.
  • triblock copolymer membranes Compared to biological lipid membranes, triblock copolymer membranes also have enhanced mechanical and environmental stability, such as much higher operating temperature or pH range.
  • the synthetic nature of the block copolymers provides a platform to tailor polymer-based membranes for a wide range of applications.
  • amphiphilic molecules may be chemically modified or functionalized to facilitate coupling of the analyte.
  • the amphiphilic layer can be a single layer or a double layer.
  • the amphiphilic layer is usually planar.
  • the amphiphilic layer can be non-planar, for example curved.
  • the amphiphilic layer is typically a lipid bilayer.
  • the lipid bilayer is a model for a cell membrane and serves as an excellent platform for a range of experimental studies.
  • lipid bilayers can be used for in vitro studies of membrane proteins using single channel recording.
  • lipid bilayers can be used as biosensors to detect the presence of a range of substances.
  • the lipid bilayer can be any lipid bilayer. Suitable lipid bilayers include, but are not limited to, planar lipid bilayers, supported bilayers or liposomes.
  • the lipid bilayer is preferably a planar lipid bilayer.
  • the film is a solid layer.
  • the solid layer is not of biological origin. In other words, the solid layer is not derived from or separated from a biological environment, such as an organism or a cell, or a biologically usable structure in a synthetically manufactured form.
  • the solid layer can be formed with organic or inorganic materials, including but not limited to microelectronic materials, insulating materials such as Si 3 N 4 , Al 2 O 3 , and SiO 2 , organic and inorganic polymers such as polyamides, plastics such as or elastomers such as two-component addition-cure silicone rubber and glass.
  • the solid layer can be formed with graphene.
  • Transmembrane pores are structures that allow hydrated ions to flow from one side of the membrane to the other side driven by an applied electrical potential.
  • the transmembrane pore is preferably a transmembrane protein pore.
  • a transmembrane protein pore is a polypeptide or a collection of polypeptides that allows hydrated ions (e.g., analytes) to flow from one side of the membrane to the other side of the membrane.
  • the transmembrane protein pore can form a hole that allows hydrated ions driven by an applied potential to flow from one side of the membrane to the other side.
  • the transmembrane protein pore preferably allows analytes (e.g., nucleotides) to flow from one side of the membrane (e.g., a lipid bilayer) to the other side.
  • the transmembrane protein pore allows polynucleotides (e.g., DNA or RNA) to move through the pore.
  • the transmembrane protein pore may be a monomer or an oligomer.
  • the pore is preferably composed of several repeating subunits (e.g. 6, 7 or 8 subunits).
  • the pore is more preferably a heptamer or octamer pore.
  • Transmembrane protein pores typically comprise a barrel or channel through which the ions can flow.
  • the subunits of the pore typically surround a central axis and provide strands for a transmembrane beta barrel or channel or a transmembrane alpha-helical bundle or channel.
  • the barrel or channel of a transmembrane protein pore typically comprises amino acids that promote interaction with an analyte (e.g., a nucleotide, a polynucleotide, or a nucleic acid). These amino acids are preferably located near the constriction of the barrel or channel.
  • Transmembrane protein pores typically comprise one or more positively charged amino acids (e.g., arginine, lysine, or histidine) or aromatic amino acids (e.g., tyrosine or tryptophan). These amino acids typically promote interaction between the pore and a nucleotide, polynucleotide, or nucleic acid.
  • Transmembrane protein pores useful in the present invention may be derived from ⁇ -barrel pores or ⁇ -helical bundle pores, ⁇ -barrel pores comprising a barrel or channel formed by ⁇ -strands.
  • Suitable ⁇ -barrel pores include, but are not limited to, ⁇ -toxins, such as ⁇ -hemolysin, anthrax toxin and leukocidin, and bacterial outer membrane proteins/porins, such as Mycobacterium smegmatis porins (Msp) (e.g., MspA), outer membrane porin F (OmpF), outer membrane porin G (OmpG), outer membrane phospholipase A and Neisseria autotransporter lipoprotein (NalP).
  • Msp Mycobacterium smegmatis porins
  • OmpF outer membrane porin F
  • OmpG outer membrane porin G
  • Neisseria autotransporter lipoprotein Neisseria autotransporter lipoprotein
  • ⁇ -helical bundle pores comprise a barrel or channel formed by ⁇ -helices.
  • Suitable ⁇ -helical bundle pores include, but are not limited to inner membrane proteins and ⁇ -outer membrane proteins, such as WZA and ClyA toxins.
  • Transmembrane pores may be derived from Msp or ⁇ -hemolysin ( ⁇ -HL).
  • the transmembrane protein pore is preferably derived from Msp, preferably derived from MspA. Such pores are oligomers and typically comprise 7, 8, 9 or 10 monomers derived from Msp.
  • the pore may be a homo-oligomeric pore derived from Msp comprising the same monomers. Alternatively, the pore may be a hetero-oligomeric pore derived from Msp containing at least one monomer different from the other monomers.
  • the pore may also comprise one or more constructs comprising two or more covalently linked monomers derived from Msp.
  • the transmembrane protein pore is also preferably derived from alpha-hemolysin (alpha-HL).
  • alpha-HL alpha-hemolysin
  • the wild-type alpha-HL pore is formed by 7 identical monomers or subunits (ie it is a heptamer).
  • the transmembrane protein pore is chemically modified.
  • the pore can be chemically modified at any site in any manner.
  • the transmembrane protein pore is chemically modified by binding of a molecule to one or more cysteines (cysteine ligation), binding of a molecule to one or more lysines, binding of a molecule to one or more non-natural amino acids, enzymatic modification of an epitope, or modification of a terminal end. Suitable methods for performing such modifications are well known in the art.
  • the transmembrane protein pore can be chemically modified by binding any molecule.
  • the pore can be chemically modified by binding a dye or a fluorophore.
  • any number of monomers in the pore may be chemically modified.
  • one or more, for example 2, 3, 4, 5, 6, 7, 8, 9 or 10, of the monomers are chemically modified as described above.
  • the analyte such as the polynucleotide
  • Moving the polynucleotide through the transmembrane pore refers to moving the polynucleotide from one side of the pore to the other side.
  • the movement of the polynucleotide through the pore may be driven or controlled by an electric potential or an enzymatic action or both an electric potential and an enzymatic action.
  • the movement may be unidirectional, or backward and forward movement may be allowed.
  • a polynucleotide binding protein is used to control the movement of the polynucleotide through the pore.
  • the characterization method may include measuring one, two, three, four or five or more characteristics of an analyte, such as a polynucleotide.
  • the method includes controlling movement of an analyte through a transmembrane pore and obtaining one or more measurements as the analyte moves relative to the pore, wherein the measurements represent one or more characteristics of the analyte.
  • the characteristic is preferably selected from (i) the length of the polynucleotide, (ii) the identity of the polynucleotide, (iii) the sequence of the polynucleotide, (iv) the secondary structure of the polynucleotide, and (v) whether the polynucleotide is modified.
  • the analyte is a polynucleotide. Any number of polynucleotides can be characterized.
  • the method of the present invention can involve characterizing 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 50, 100 or more polynucleotides.
  • the polynucleotide can be naturally occurring or artificial.
  • the method can be used to test the sequence of the manufactured oligonucleotide. The method is generally performed in vitro.
  • the method of the present invention comprises moving the single-stranded polynucleotide through the transmembrane pore such that a portion of the nucleotides of the single-stranded polynucleotide interact with the pore.
  • the method can be performed using any suitable membrane as described above, preferably a lipid bilayer system, wherein the pore is inserted into the lipid bilayer.
  • the method is typically performed using a membrane that includes (i) an artificial bilayer containing a pore, (ii) an isolated naturally occurring lipid bilayer containing a pore, or (iii) a cell having a pore inserted therein.
  • the method is preferably performed using an artificial lipid bilayer.
  • the bilayer may include other transmembrane proteins and/or intramembrane proteins and other molecules. Suitable apparatus and conditions are described in detail below with reference to the sequencing embodiment of the present invention.
  • the method of the present invention is typically performed in vitro.
  • the present invention provides a method for characterizing a polynucleotide, the method comprising:
  • step (b) contacting the polynucleotide loaded with the polynucleotide binding protein provided in step (a) with the transmembrane pore;
  • the polynucleotide binding protein stops in front of the first retarding element of the adapter under the combined action of the retarding elements, and then moves through the main chain of the adapter under the action of the electric potential.
  • the order of movement on the main chain is: the first retarding element, the segment complementary to the second retarding element, the second segment, and then through the polynucleotide chain connected to the main chain.
  • the polynucleotide binding protein does not pass through the blocking chain of the adapter.
  • the polynucleotide binding protein also passes through the blocking chain of the adapter, and when the second retarding element is located on the blocking chain, the order of movement on the blocking chain is: the fourth segment, the nucleotide of the second retarding element.
  • the polynucleotide binding protein does not pass through the modifier of the second retarding element.
  • transmembrane protein pores can be used to distinguish nucleotides with similar structures based on their different effects on the current passing through the pore.
  • Individual nucleotides can be identified at the single molecule level based on their current amplitude when they interact with the pore. If the current flows through the pore in a manner specific for a certain nucleotide (i.e., if a characteristic current associated with the nucleotide is detected flowing through the pore), then the nucleotide is present in the pore. Continuous identification of nucleotides in a polynucleotide enables the sequence of the polynucleotide to be estimated or determined.
  • the method involves sensing a portion of the nucleotides in the polynucleotide across a membrane pore as they pass through the barrel or channel one by one in order to sequence the polynucleotide. As described above, this is strand sequencing.
  • the method includes providing a tether for bringing the construct into proximity with the transmembrane pore; the tether includes a capture region and an anchor region, wherein the capture region is used to capture the adaptor of the construct, and the anchor region is used to anchor and bind to the transmembrane pore or the membrane in which the transmembrane pore is located.
  • the method can be used to sequence all or only a portion of the polynucleotide.
  • the polynucleotide can be of any length.
  • the polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 nucleotide pairs in length.
  • the polynucleotide can be 1000 or more nucleotide pairs, 5000 or more nucleotide pairs or 100000 or more nucleotide pairs in length.
  • the polynucleotide can be naturally occurring or artificial.
  • the method can be used to verify the sequence of the oligonucleotide manufactured. The method is usually performed in vitro.
  • the single stranded polynucleotide may interact with the pore on either side of the membrane.
  • the single stranded polynucleotide may interact with the pore at any site in any manner.
  • the nucleotide affects the current flowing through the hole in a manner specific to the nucleotide. For example, a specific nucleotide will reduce the current flowing through the hole, and this reduction lasts for a specific average duration and reaches a specific degree. In other words, the current flowing through the hole is characteristic for a specific nucleotide.
  • Control experiments can be performed to determine the influence of a specific nucleotide on the current flowing through the hole. Then, the results obtained by performing the method of the present invention on the test sample can be compared with the results obtained from such control experiments to determine or estimate the sequence of the polynucleotide.
  • the sequencing method can be performed using any suitable membrane/pore system in which the pore is inserted into the membrane.
  • the method is typically performed using a membrane comprising naturally occurring or synthetic lipids.
  • the membrane is typically formed in vitro.
  • the method is preferably not performed using an isolated naturally occurring pore-containing membrane, or a cell expressing a pore.
  • the method is preferably performed using an artificial membrane.
  • the membrane may contain other transmembrane proteins and/or intramembrane proteins and other molecules.
  • the polypeptide is first connected to a nucleic acid to obtain a nucleic acid-polypeptide conjugate, and then the adaptor of the present invention is connected to the nucleic acid-polypeptide conjugate, thereby characterizing the polypeptide.
  • the other steps of the method for characterizing the polypeptide are similar to the steps for characterizing the polynucleotide. Kit
  • the present invention also provides a kit for preparing a method for characterizing an analyte such as a polynucleotide, wherein the kit comprises (a) an adaptor of the present invention, and (b) a polynucleotide binding protein, and/or (c) a transmembrane pore, and optionally a tether.
  • the kit preferably also comprises one or more markers that produce a characteristic current when interacting with the transmembrane pore. Such markers are described in detail above.
  • the kit preferably also comprises a means for coupling the polynucleotide to the membrane.
  • the means for coupling the polynucleotide to the membrane are described above.
  • the means for coupling preferably comprises a reactive group. Suitable groups include but are not limited to Limited to sulfhydryl, cholesterol, lipid and biotin groups.
  • the kit may also contain components of membranes, such as phospholipids required for lipid bilayer formation.
  • the kit of the present invention may additionally comprise one or more other reagents or instruments that enable any of the above embodiments to be implemented.
  • reagents or instruments include one or more of the following: a suitable buffer (aqueous solution), a tool for sampling from a subject (e.g., a tube or apparatus comprising a needle), a tool for amplifying and/or expressing polynucleic acids, such as a membrane or voltage clamp or patch clamp device as defined above.
  • the reagents may be present in the kit in a dry state so that the reagents can be resuspended with a fluid sample.
  • the kit may also include instructions for making the kit useful in the method of the present invention, or detailed instructions on which patients the method may be used for.
  • the kit may include nucleotides.
  • the main chain, fluorescent chain and blocking chain were synthesized separately, and the main chain, fluorescent chain and blocking chain were annealed at a ratio of 1:1.1:1.1 to form a Y-type connector as shown in Figure 1.
  • the annealing treatment was specifically to slowly cool from 95°C to 25°C, with the cooling amplitude not exceeding 0.1°C/s.
  • the annealing treatment system included 160mM HEPES 7.0, 200mM NaCl, and the concentration of the main chain in the annealing system was 4-8uM.
  • 500nM Y-type adapter 6 times the amount of substance of helicase E1 (SEQ ID NO.: 14 with M1G/E94C/C109A/C136A/A360C mutations) and 1.5mM TMAD (azodicarbonamide) were mixed and incubated at room temperature for 30 minutes to prepare sequencing adapter complexes QMX1-QMX10.
  • the sequencing adapter complex was added to a DNAPac PA200 column and purified with elution buffer to elute the enzyme that was not bound to the sequencing adapter complex from the column.
  • the sequencing adapter complex was then eluted with a mixture of buffer A and buffer B with 10 times the column volume.
  • buffer A 20mM Na-CHES, 250mM NaCl, 4% (W/V) glycerol, pH 8.6
  • buffer B 20mM Na-CHES, 1M NaCl, 4% (W/V) glycerol, pH 8.6.
  • sequences of the main chain, fluorescent chain and blocking chain used in the linker of this example are as follows:
  • Y1 (as shown in SEQ ID NO.: 1)
  • Y1-3C3 (as shown in SEQ ID NO.: 2)
  • Y1-5C3 (as shown in SEQ ID NO.: 3)
  • Y1-5C3-cPIP (as shown in SEQ ID NO.: 4)
  • Fluorescent chain sequence Y2 (as shown in SEQ ID NO.: 8)
  • Blocking chain sequence B (as shown in SEQ ID NO.: 9), B-LNA (as shown in SEQ ID NO.: 10), B-PNA (as shown in SEQ ID NO.: 11), B-cPIP (as shown in SEQ ID NO.: 12)
  • Sample No. 9 is a sample with 5C3 on the main chain but no cPIP
  • Sample No. 10 is a sample with 5C3 on the main chain as a blocking element and cPIP interacting with the double chain as another blocking element. It can be seen from samples No. 9 and No. 10 that in the presence of cPIP, the enzyme quality inspection shedding rate dropped from 60.5% to 51.8%.
  • the main chain, fluorescent chain and blocking chain were synthesized separately, and the main chain, fluorescent chain and blocking chain were annealed at a ratio of 1:1.1:1.1 to form a Y-type connector.
  • the annealing treatment was specifically to slowly cool from 95°C to 25°C, with the cooling amplitude not exceeding 0.1°C/s.
  • the annealing treatment system included 160mM HEPES 7.0, 200mM NaCl, and the concentration of the main chain in the annealing system was 4-8uM.
  • RNAPac PA200 500 nM Y-type adapter, 5 times the amount of substance of helicase E2 (SEQ ID NO.: 15 with D99C/A366C/C308T/C419D/E286K/F246Y/S293N/V422H mutations) and 1.5 mM TMAD (azodicarbonamide) were mixed and incubated at room temperature for 30 minutes to prepare sequencing adapter complexes QMX11-QMX14.
  • the sequencing adapter complex complex was added to a DNAPac PA200 column and purified with elution buffer to elute the enzyme that was not bound to the sequencing adapter complex from the column.
  • the sequencing adapter complex was then eluted with a mixture of buffer A and buffer B with 10 times the column volume. The main elution peak was then pooled, its concentration was measured, and a TBE PAGE gel was run at 160V for 40 minutes.
  • buffer A 20mM Na-CHES, 250mM NaCl, 4% (W/V) glycerol, pH 8.6
  • buffer B 20mM Na-CHES, 1M NaCl, 4% (W/V) glycerol, pH 8.6.
  • sequences of the main chain, fluorescent chain and blocking chain used in the linker of this example are as follows:
  • Fluorescent chain sequence Y2 (as shown in SEQ ID NO.: 8)
  • Blocking chain sequence B (as shown in SEQ ID NO.: 9), B-LNA (as shown in SEQ ID NO.: 10)
  • Example 3 Chip mixed library test of linkers containing single blocking elements and linkers containing combined blocking elements
  • Example 4 Chip mixed library test of adapters containing joint blocking elements and commercial adapters
  • the commercial 4C18-containing linker complex QMX-15 (Y1-4C18/Y2/B/E2) was prepared using the method of Example 1.
  • the linker in the linker complex was formed by a main chain Y1-4C18 (as shown in SEQ ID NO.: 7), a fluorescent chain (as shown in SEQ ID NO.: 8) and a blocking chain (as shown in SEQ ID NO.: 9).
  • a 10 kb library was prepared by end repair, and the QMX-13 prepared in Example 2 and the commercial linker QMX-15 (Y1-4C18/Y2/B/E2) containing the 4C18 blocking element were connected to the library. Equal amounts of QMX-13 and QMX-15 libraries were evenly loaded onto three different QNome-9604s of Qi Carbon Technology Co., Ltd. for sequencing. The sequence of the tether used in the sequencing process is shown in SEQ ID NO: 13. The comparison of the mixed test data throughput is shown in Table 4 below.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Provided in the present invention are an adapter for characterizing an analyte, a construct comprising the analyte and the adapter, and a method for characterizing the analyte using same. Further provided is the use of an organic oligocation such as spermine as a blocking element for blocking the binding of polynucleotide to protein. Specifically, the adapter contains two different blocking elements. By the combined use of the two blocking elements, blocking the binding of polynucleotide to protein can be achieved and the application range of a spacer region is broadened.

Description

包含联合阻滞元件的衔接体、构建体、方法和应用Adaptors, constructs, methods and uses comprising combined blocking elements
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请要求享有于2022年12月13日提交的名称为“包含联合阻滞元件的衔接体、构建体、方法和应用”的中国专利申请202211595915.2的优先权,该申请的全部内容通过引用并入本文中。This application claims priority to Chinese patent application No. 202211595915.2, filed on December 13, 2022, entitled “Adapters, constructs, methods and applications comprising combined blocking elements,” the entire contents of which are incorporated herein by reference.
技术领城Technology
本申请涉及纳米孔测序技术领域,具体涉及用于表征分析物的包含联合阻滞元件的衔接体、构建体、方法和应用。The present application relates to the technical field of nanopore sequencing, and in particular to adaptors, constructs, methods and applications comprising a combined blocking element for characterizing analytes.
背景技术Background technique
分析物如多核苷酸(如RNA或DNA)测序技术都有广泛应用。对多核苷酸,尤其是DNA的测序技术也一直在更新。Analytes such as polynucleotides (such as RNA or DNA) sequencing technologies are widely used. The sequencing technologies for polynucleotides, especially DNA, are also constantly being updated.
DNA测序技术从1977年以来已发展到第四代测序技术,除了第一代测序技术,第二代、第三代、第四代测序技术统称为下一代测序(NGS)技术。基因测序主要经历了从第一代的Sanger测序法,到第二代的边合成边测序、第三代的单分子测序和第四代纳米孔测序发展过程。相较于第一代测序技术,NGS技术因为通量提高、成本降低和测序周期缩短优势,得到更广泛的应用。DNA sequencing technology has developed to the fourth generation of sequencing technology since 1977. In addition to the first generation of sequencing technology, the second, third, and fourth generation sequencing technologies are collectively referred to as next generation sequencing (NGS) technology. Gene sequencing has mainly experienced the development process from the first generation of Sanger sequencing, to the second generation of sequencing by synthesis, the third generation of single molecule sequencing, and the fourth generation of nanopore sequencing. Compared with the first generation of sequencing technology, NGS technology has been more widely used due to its advantages of increased throughput, reduced costs, and shortened sequencing cycles.
第四代纳米孔测序指的是DNA分子在电泳驱动下通过纳米微孔时,会因为DNA分子每个碱基的大小形状不同而引起特征性的电流变化,由此利用电子传导检测可确定DNA分子的碱基类型和排列顺序,实现了单分子测序。The fourth-generation nanopore sequencing refers to the situation where when a DNA molecule passes through a nanopore driven by electrophoresis, the different sizes and shapes of each base in the DNA molecule will cause characteristic current changes. Therefore, the base type and arrangement order of the DNA molecule can be determined by electron conduction detection, thus achieving single-molecule sequencing.
在纳米孔测序技术中,未施加电势时,通常需要使多核苷酸结合蛋白停滞在多核苷酸上,防止多核苷酸结合蛋白沿多核苷酸进一步移动。但当多核苷酸与多核苷酸结合蛋白以及跨膜孔接触并施加电势后,可以移动停滞的多核苷酸结合蛋白,沿着待测多核苷酸序列进行移动,从而达到测序的目的。In nanopore sequencing technology, when no potential is applied, the polynucleotide binding protein is usually required to be stagnant on the polynucleotide to prevent the polynucleotide binding protein from moving further along the polynucleotide. However, when the polynucleotide contacts the polynucleotide binding protein and the transmembrane pore and a potential is applied, the stagnant polynucleotide binding protein can be moved along the polynucleotide sequence to be tested, thereby achieving the purpose of sequencing.
现有技术中通常采用由iSp18或iSpC9等无碱基基团形成的单链核酸作为间隔区对多核苷酸结合蛋白的移位功能进行抑制从而达到停滞多核苷酸结合蛋白的效果,之后通过电场力让多核苷酸结合蛋白越过间隔区开始正常测序。但是,现有技术中可以起到良好的停滞多核苷酸结合蛋白的间隔区非常有限,某些基团作为间隔区会使多核苷酸结合蛋白的脱落率高或数据通量低。In the prior art, single-stranded nucleic acids formed by abasic groups such as iSp18 or iSpC9 are usually used as spacers to inhibit the translocation function of polynucleotide binding proteins, thereby achieving the effect of stagnating polynucleotide binding proteins, and then the polynucleotide binding proteins are allowed to cross the spacers through the electric field force to start normal sequencing. However, the spacers that can effectively stagnate polynucleotide binding proteins in the prior art are very limited, and some groups as spacers will cause the polynucleotide binding proteins to have a high shedding rate or low data throughput.
发明内容Summary of the invention
本发明人惊奇地发现当某些单链核酸修饰作为间隔区不能起到良好的阻滞效果,但是如果与另一提高双链稳定性(比如增加双链的解链难度或提高双链Tm值)的阻滞元件联合使用时,则可以达到良好的阻滞效果,甚至比现有技术的传统间隔区同等或更好的阻滞效果,并且可以用于纳米孔测序中。The inventors surprisingly found that when certain single-stranded nucleic acid modifications are used as spacers, they cannot achieve a good blocking effect. However, if they are used in combination with another blocking element that improves the stability of the double strand (such as increasing the difficulty of double-stranded unzipping or increasing the double-stranded Tm value), then a good blocking effect can be achieved, even the same or better blocking effect than the traditional spacers in the prior art, and can be used in nanopore sequencing.
因此,本发明的第一方面涉及一种用于表征分析物的衔接体,所述衔接体包含一个或多个第一阻滞元件和一个或多个与所述第一阻滞元件不同的第二阻滞元件,并且在所述衔接体与多核苷酸结合蛋白接触之后,所述一个或多个第一阻滞元件和所述一个或多个第二阻滞元件能够联合作用来停 滞所述多核苷酸结合蛋白。Therefore, a first aspect of the present invention relates to an adaptor for characterizing an analyte, the adaptor comprising one or more first blocking elements and one or more second blocking elements different from the first blocking elements, and after the adaptor is contacted with a polynucleotide binding protein, the one or more first blocking elements and the one or more second blocking elements can act in combination to stop The polynucleotide binding protein is retained.
优选地,所述衔接体包含互补的主体链和阻挡链,所述一个或多个第一阻滞元件以共价键连接到所述主体链上;所述一个或多个第二阻滞元件以共价键修饰到所述主体链或所述阻挡链上或以非共价键与所述主体链和/或所述阻挡链互作。Preferably, the linker comprises a complementary main chain and a blocking chain, and the one or more first blocking elements are covalently connected to the main chain; the one or more second blocking elements are covalently modified to the main chain or the blocking chain or interact with the main chain and/or the blocking chain by non-covalent bonds.
优选地,所述一个或多个第二阻滞元件与所述阻挡链或所述主体链互补,其中,当所述一个或多个第二阻滞元件以共价键修饰到所述主体链上时,所述一个或多个第二阻滞元件与所述阻挡链互补;当所述一个或多个第二阻滞元件以共价键修饰到所述阻挡链上时,所述一个或多个第二阻挡元件与所述主体链互补;当一个或多个第二阻滞元件以非共价键与所述主体链和/或所述阻挡链互作时,所述第二阻滞元件可与所述阻挡链或所述主体链互补。Preferably, the one or more second blocking elements are complementary to the blocking chain or the main chain, wherein, when the one or more second blocking elements are modified to the main chain by covalent bonds, the one or more second blocking elements are complementary to the blocking chain; when the one or more second blocking elements are modified to the blocking chain by covalent bonds, the one or more second blocking elements are complementary to the main chain; when the one or more second blocking elements interact with the main chain and/or the blocking chain by non-covalent bonds, the second blocking elements may be complementary to the blocking chain or the main chain.
优选地,所述一个或多个第一阻滞元件具有与核苷酸不同的结构;所述一个或多个第二阻滞元件具有用于提高双链稳定性的结构。Preferably, the one or more first retarding elements have a structure different from that of nucleotides; and the one or more second retarding elements have a structure for improving double-strand stability.
优选地,所述第一阻滞元件包含一个或多个选自由有机寡阳离子、iSpC3、iSp18、iSp9、硝基吲哚、肌苷、吖啶、2-氨基嘌呤、2-6-二氨基嘌呤、5-溴-脱氧尿嘧啶、反向胸苷(反向dT)、反向二脱氧胸苷(ddT)、二脱氧胞苷(ddC)、5-甲基胞苷酸、5-羟甲基胞苷、2'-O-甲基RNA碱基、异脱氧胞苷(异-dC)、异脱氧鸟苷(异-dG)、光裂解(PC)的基团或己二醇组成的群组;所述第二阻滞元件包含一个或多个选自由经锁核酸(LNA)、肽核酸(PNA)、甲氧基(OMe)、双环核苷(BNA)、甘油核酸(GNA)、苏糖核酸(TNA)、环吡咯咪唑聚酰胺(cPIP)或双链结合蛋白修饰的核苷酸组成的群组。Preferably, the first blocking element comprises one or more selected from the group consisting of organic oligocations, iSpC3, iSp18, iSp9, nitroindole, inosine, acridine, 2-aminopurine, 2-6-diaminopurine, 5-bromo-deoxyuracil, inverted thymidine (inverted dT), inverted dideoxythymidine (ddT), dideoxycytidine (ddC), 5-methylcytidylic acid, 5-hydroxymethylcytidine, 2'-O-methyl RNA base, isodeoxycytidine (iso-dC), isodeoxyguanosine (iso-dG), a photocleavable (PC) group or hexanediol; the second blocking element comprises one or more selected from the group consisting of nucleotides modified with locked nucleic acid (LNA), peptide nucleic acid (PNA), methoxy (OMe), bicyclic nucleoside (BNA), glycerol nucleic acid (GNA), threose nucleic acid (TNA), cyclopyrrolimidazole polyamide (cPIP) or double-stranded binding protein.
优选地,所述有机寡阳离子具有通式Bj,Bj是j‐聚体的有机寡阳离子部分,j=1‐50,其中B选自包括以下基团的组:Preferably, the organic oligocation has the general formula Bj, Bj is the organic oligocation part of a j-mer, j=1-50, wherein B is selected from the group comprising the following groups:
‐HPO3‐R1‐(X‐R2 n)n1‐X‐R3‐O‐,其中R1、R2 n和R3是相同或不同的C1‐C5低级亚烷基,X是NH或NC(NH2)2,n1=2‐20,-HPO 3 -R 1 -(X-R 2 n ) n1 -X-R 3 -O-, wherein R 1 , R 2 n and R 3 are the same or different C1-C5 lower alkylene groups, X is NH or NC(NH 2 ) 2 , n1=2-20,
‐HPO3‐R4‐CH(R5X1)‐R6‐O‐,其中R4是C1‐C5低级亚烷基,R5和R6是相同或不同的C1‐C5低级亚烷基,X1为腐胺、亚精胺或精胺残基,-HPO 3 -R 4 -CH(R 5 X 1 )-R 6 -O-, wherein R 4 is a C1-C5 lower alkylene group, R 5 and R 6 are the same or different C1-C5 lower alkylene groups, and X 1 is a putrescine, spermidine or spermine residue,
‐HPO3‐R7‐(aa)n2‐R8‐O‐,其中R7是C1‐C5低级亚烷基,R8是C1‐C5低级亚烷基、丝氨酸、天然氨基醇,(aa)n2是含有具有阳离子侧链的天然氨基酸的肽,n2=2‐20;‐HPO 3 ‐R 7 ‐(aa) n2 ‐R 8 ‐O‐, wherein R 7 is C1‐C5 lower alkylene, R 8 is C1‐C5 lower alkylene, serine, natural amino alcohol, (aa) n2 is a peptide containing a natural amino acid having a cationic side chain, and n2=2‐20;
更优选地,所述有机寡阳离子选自精胺(Sp)。More preferably, the organic oligocation is selected from spermine (Sp).
优选地,所述第一阻滞元件与所述第二阻滞元件不互补。Preferably, the first blocking element is not complementary to the second blocking element.
优选地,所述衔接体还包含第三链,所述第三链与所述主体链部分互补;更优选地,所述主体链与所述阻挡链的部分互补并且所述主体链与所述阻挡链的互补端用于与所述分析物直接或间接连接。Preferably, the adapter further comprises a third strand, which is partially complementary to the main strand; more preferably, the main strand is partially complementary to the blocking strand and the complementary ends of the main strand and the blocking strand are used to directly or indirectly connect to the analyte.
优选地,所述分析物选自多核苷酸、多肽、脂质或多糖,优选多核苷酸,所述多核苷酸是完全双链多核苷酸、部分双链多核苷酸或单链多核苷酸。Preferably, the analyte is selected from a polynucleotide, a polypeptide, a lipid or a polysaccharide, preferably a polynucleotide, which is a fully double-stranded polynucleotide, a partially double-stranded polynucleotide or a single-stranded polynucleotide.
优选地,所述多核苷酸结合蛋白衍生自多核苷酸处理酶;所述多核苷酸处理酶选自聚合酶、解旋酶或核酸外切酶。更优选地,所述解旋酶选自He1308解旋酶、RecD解旋酶、XPD解旋酶、Dda 解旋酶或ED1解旋酶。Preferably, the polynucleotide binding protein is derived from a polynucleotide processing enzyme; the polynucleotide processing enzyme is selected from a polymerase, a helicase or a nuclease exonuclease. More preferably, the helicase is selected from He1308 helicase, RecD helicase, XPD helicase, Dda Helicase or ED1 helicase.
本发明的第二方面涉及一种用于表征分析物的构建体,所述构建体包含分析物和本发明第一方面的衔接体,其中所述衔接体与所述分析物的任一端或两端直接或间接连接。The second aspect of the present invention relates to a construct for characterizing an analyte, the construct comprising the analyte and the adaptor of the first aspect of the present invention, wherein the adaptor is directly or indirectly linked to either or both ends of the analyte.
本发明的第三方面涉及一种用于表征分析物的复合物,所述复合物包含多核苷酸结合蛋白和本发明第一方面的衔接体或本发明第二面的构建体;其中所述多核苷酸结合蛋白在所述第一阻滞元件和所述第二阻滞元件的联合作用下停滞在所述衔接体上。The third aspect of the present invention relates to a complex for characterizing an analyte, which comprises a polynucleotide binding protein and a linker of the first aspect of the present invention or a construct of the second aspect of the present invention; wherein the polynucleotide binding protein is arrested on the linker under the combined action of the first blocking element and the second blocking element.
优选地,所述多核苷酸结合蛋白衍生自多核苷酸处理酶;所述多核苷酸处理酶选自聚合酶、解旋酶或核酸外切酶。更优选地,所述解旋酶选自He1308解旋酶、RecD解旋酶、XPD解旋酶、Dda解旋酶或ED1解旋酶。Preferably, the polynucleotide binding protein is derived from a polynucleotide handling enzyme; the polynucleotide handling enzyme is selected from a polymerase, a helicase or an exonuclease. More preferably, the helicase is selected from a He1308 helicase, a RecD helicase, a XPD helicase, a Dda helicase or an ED1 helicase.
本发明的第四方面涉及一种控制多核苷酸结合蛋白在分析物上装载的方法,所述方法包括:A fourth aspect of the present invention relates to a method for controlling the loading of a polynucleotide binding protein on an analyte, the method comprising:
提供具有分析物的构建体,其中所述分析物在其一端或两端与衔接体直接或间接连接,所述衔接体包含一个或多个第一阻滞元件和一个或多个与所述第一阻滞元件不同的第二阻滞元件;以及将所述构建体与所述多核苷酸结合蛋白接触,使得所述多核苷酸结合蛋白在所述一个或多个第一阻滞元件和所述一个或多个第二阻滞元件的联合作用下停滞在所述衔接体上;或Providing a construct having an analyte, wherein the analyte is directly or indirectly connected to an adaptor at one or both ends thereof, the adaptor comprising one or more first retarding elements and one or more second retarding elements different from the first retarding elements; and contacting the construct with the polynucleotide binding protein, such that the polynucleotide binding protein is arrested on the adaptor under the combined action of the one or more first retarding elements and the one or more second retarding elements; or
使多核苷酸结合蛋白装载到衔接体上,所述衔接体包含一个或多个第一阻滞元件和一个或多个与所述第一阻滞元件不同的第二阻滞元件,所述多核苷酸结合蛋白在所述一个或多个第一阻滞元件和所述一个或多个第二阻滞元件的联合作用下停滞在所述衔接体上;以及使所述装载有多核苷酸结合蛋白的衔接体连接到所述分析物。The polynucleotide binding protein is loaded onto a linker, wherein the linker comprises one or more first blocking elements and one or more second blocking elements different from the first blocking elements, and the polynucleotide binding protein is arrested on the linker under the combined action of the one or more first blocking elements and the one or more second blocking elements; and the linker loaded with the polynucleotide binding protein is connected to the analyte.
优选地,所述衔接体如本发明第一方面所定义。Preferably, the adapter is as defined in the first aspect of the present invention.
优选地,所述多核苷酸结合蛋白衍生自多核苷酸处理酶;所述多核苷酸处理酶选自聚合酶、解旋酶或核酸外切酶。更优选地,所述解旋酶选自He1308解旋酶、RecD解旋酶、XPD解旋酶、Dda解旋酶或ED1解旋酶。Preferably, the polynucleotide binding protein is derived from a polynucleotide handling enzyme; the polynucleotide handling enzyme is selected from a polymerase, a helicase or an exonuclease. More preferably, the helicase is selected from a He1308 helicase, a RecD helicase, a XPD helicase, a Dda helicase or an ED1 helicase.
本发明的第五方面涉及一种控制分析物穿过跨膜孔的移动的方法,所述方法包括:A fifth aspect of the invention relates to a method of controlling the movement of an analyte through a transmembrane pore, the method comprising:
(a)实施本发明第四方面的控制多核苷酸结合蛋白在分析物上装载的方法;(a) implementing the method for controlling the loading of a polynucleotide binding protein on an analyte according to the fourth aspect of the present invention;
(b)将步骤(a)中提供的装载有所述多核苷酸结合蛋白的分析物与所述跨膜孔接触;以及(b) contacting the analyte loaded with the polynucleotide binding protein provided in step (a) with the transmembrane pore; and
(c)跨所述跨膜孔施加电势,使得所述多核苷酸结合蛋白移动穿过所述一个或多个第一阻滞元件、任选地(即穿过或不穿过)所述一个或多个第二阻滞元件的部分区域,并控制所述分析物穿过所述跨膜孔的移动。(c) applying an electric potential across the transmembrane pore so that the polynucleotide binding protein moves through the one or more first retardation elements, optionally (i.e., through or not through) a portion of the one or more second retardation elements, and controls the movement of the analyte through the transmembrane pore.
优选地,所述第二阻滞元件包含经共价修饰的核苷酸或经非共价修饰的核苷酸;当所述多核苷酸结合蛋白移动穿过所述一个或多个第二阻滞元件时,所述多核苷酸结合蛋白移动穿过所述一个或多个第二阻滞元件的核苷酸,而不穿过所述一个或多个第二阻滞元件的共价或非共价修饰物。Preferably, the second blocking element comprises covalently modified nucleotides or non-covalently modified nucleotides; when the polynucleotide binding protein moves through the one or more second blocking elements, the polynucleotide binding protein moves through the nucleotides of the one or more second blocking elements without passing through the covalent or non-covalent modifications of the one or more second blocking elements.
优选地,所述方法包括提供系链用于使所述装载有所述多核苷酸结合蛋白的分析物靠近所述跨膜孔;所述系链包括捕获区和锚定区,所述捕获区用于捕获所述衔接体,所述锚定区用于与所述跨膜孔或所述跨膜孔所在的膜锚定结合。 Preferably, the method includes providing a tether for bringing the analyte loaded with the polynucleotide binding protein close to the transmembrane pore; the tether includes a capture region and an anchor region, the capture region is used to capture the adapter, and the anchor region is used to anchor and bind to the transmembrane pore or the membrane where the transmembrane pore is located.
本发明的第六方面涉及一种表征分析物的方法,所述方法包括:A sixth aspect of the present invention relates to a method for characterizing an analyte, the method comprising:
(a)实施本发明第五方面的控制分析物穿过跨膜孔的移动的方法;以及(a) implementing the method of controlling the movement of an analyte through a transmembrane pore according to the fifth aspect of the present invention; and
(b)随着所述分析物相对于所述跨膜孔移动,获取一个或多个测量值,其中所述测量值代表所述分析物的一个或多个特征,并由此表征所述分析物。(b) obtaining one or more measurements as the analyte moves relative to the transmembrane pore, wherein the measurements represent one or more characteristics of the analyte and thereby characterize the analyte.
优选地,所述跨膜孔是蛋白孔或固态孔,和/或,所述膜是两亲层或固态层。Preferably, the transmembrane pore is a protein pore or a solid-state pore, and/or the membrane is an amphiphilic layer or a solid-state layer.
优选地,所述分析物选自多核苷酸、多肽、脂质或多糖,优选多核苷酸,所述多核苷酸是完全双链多核苷酸、部分双链多核苷酸或单链多核苷酸。Preferably, the analyte is selected from a polynucleotide, a polypeptide, a lipid or a polysaccharide, preferably a polynucleotide, which is a fully double-stranded polynucleotide, a partially double-stranded polynucleotide or a single-stranded polynucleotide.
本发明的第七方面涉及一种表征分析物的试剂盒,所述试剂盒包含:A seventh aspect of the present invention relates to a kit for characterizing an analyte, the kit comprising:
(a)本发明第一方面的衔接体,和(b)多核苷酸结合蛋白,和/或(c)跨膜孔。(a) the adaptor according to the first aspect of the invention, and (b) a polynucleotide binding protein, and/or (c) a transmembrane pore.
优选地,所述多核苷酸结合蛋白衍生自多核苷酸处理酶;所述多核苷酸处理酶选自聚合酶、解旋酶或核酸外切酶。更优选地,所述解旋酶选自He1308解旋酶、RecD解旋酶、XPD解旋酶、Dda解旋酶或ED1解旋酶。Preferably, the polynucleotide binding protein is derived from a polynucleotide handling enzyme; the polynucleotide handling enzyme is selected from a polymerase, a helicase or an exonuclease. More preferably, the helicase is selected from a He1308 helicase, a RecD helicase, a XPD helicase, a Dda helicase or an ED1 helicase.
优选地,所述跨膜孔是蛋白孔或固态孔。Preferably, the transmembrane pore is a protein pore or a solid-state pore.
本发明的第八方面涉及有机寡阳离子作为用于停滞多核苷酸结合蛋白的阻滞元件的用途,所述有机寡阳离子具有通式Bj,Bj是j‐聚体的有机寡阳离子部分,j=1‐50,其中B选自包括以下基团的组:The eighth aspect of the present invention relates to the use of an organic oligocation as a blocking element for arresting a polynucleotide binding protein, wherein the organic oligocation has the general formula Bj, Bj is the organic oligocation part of a j-mer, j=1-50, wherein B is selected from the group comprising the following groups:
‐HPO3‐R1‐(X‐R2 n)n1‐X‐R3‐O‐,其中R1、R2 n和R3是相同或不同的C1‐C5低级亚烷基,X是NH或NC(NH2)2,n1=2‐20,-HPO 3 -R 1 -(X-R 2 n ) n1 -X-R 3 -O-, wherein R 1 , R 2 n and R 3 are the same or different C1-C5 lower alkylene groups, X is NH or NC(NH 2 ) 2 , n1=2-20,
‐HPO3‐R4‐CH(R5X1)‐R6‐O‐,其中R4是C1‐C5低级亚烷基,R5和R6是相同或不同的C1‐C5低级亚烷基,X1为腐胺、亚精胺或精胺残基,-HPO 3 -R 4 -CH(R 5 X 1 )-R 6 -O-, wherein R 4 is a C1-C5 lower alkylene group, R 5 and R 6 are the same or different C1-C5 lower alkylene groups, and X 1 is a putrescine, spermidine or spermine residue,
‐HPO3‐R7‐(aa)n2‐R8‐O‐,其中R7是C1‐C5低级亚烷基,R8是C1‐C5低级亚烷基、丝氨酸、天然氨基醇,(aa)n2是含有具有阳离子侧链的天然氨基酸的肽,n2=2‐20;‐HPO 3 ‐R 7 ‐(aa) n2 ‐R 8 ‐O‐, wherein R 7 is C1‐C5 lower alkylene, R 8 is C1‐C5 lower alkylene, serine, natural amino alcohol, (aa) n2 is a peptide containing a natural amino acid having a cationic side chain, and n2=2‐20;
优选地,所述有机寡阳离子选自精胺。Preferably, the organic oligocation is selected from spermine.
本发明的第九方面涉及本发明第一方面的衔接体、本发明第二方面的构建体、本发明第三方面的复合物、本发明第三至第六方面的方法、或本发明第七方面的试剂盒在制备用于表征分析物的产品或在表征分析物中的应用。The ninth aspect of the present invention relates to the use of the linker of the first aspect of the present invention, the construct of the second aspect of the present invention, the complex of the third aspect of the present invention, the methods of the third to sixth aspects of the present invention, or the kit of the seventh aspect of the present invention in the preparation of a product for characterizing an analyte or in characterizing an analyte.
本发明的技术方案取得了以下技术效果:The technical solution of the present invention achieves the following technical effects:
本发明采用了通过采用一种以上的阻滞元件联合作用对酶进行停滞,进一步拓展了间隔区的应用范围,从而使某些阻滞元件单独使用时无法起到良好的阻滞效果甚至没有阻滞效果,但是与另一阻滞元件联合作用时,可以实现良好的阻滞效果。The present invention uses more than one blocking element to jointly act to arrest the enzyme, further expanding the application range of the spacer region, so that some blocking elements cannot play a good blocking effect or even have no blocking effect when used alone, but can achieve a good blocking effect when used in combination with another blocking element.
另外,现有技术使用的多种间隔区,如iSp18或iSpC9等无碱基基团形成的单链核酸作为间隔区,虽然电场力的作用会使酶跨过间隔区,但是该阻滞元件会一直存在不会消失,因而这种间隔区只适合用于内推测序法(entry sequencing),不能用于外拉测序法(outry sequencing)。而本发明中,第一阻滞元件和第二阻滞元件形成的联合阻滞结构会因序列通过纳米孔而被破坏或消除,因而可以 进一步拓展其应用,使得其不仅可以用于内推测序法,还可以用于外拉测序法。“内推测序法”是指电场力的方向与酶运动方向相同而使酶通过进而对分析物进行测序的方法,“外拉测序法”是指电场力的方向与酶运动方向相反而使酶通过进而对分析物进行测序的方法。In addition, the prior art uses a variety of spacers, such as single-stranded nucleic acids formed by a base-free group such as iSp18 or iSpC9 as spacers. Although the electric field force causes the enzyme to cross the spacer, the blocking element will always exist and will not disappear. Therefore, this spacer is only suitable for entry sequencing and cannot be used for outry sequencing. In the present invention, the combined blocking structure formed by the first blocking element and the second blocking element will be destroyed or eliminated as the sequence passes through the nanopore, so it can be Further expand its application, so that it can be used not only for internal push sequencing, but also for external pull sequencing. "Internal push sequencing" refers to a method in which the direction of the electric field force is the same as the direction of the enzyme movement, so that the enzyme passes through and then the analyte is sequenced, and "external pull sequencing" refers to a method in which the direction of the electric field force is opposite to the direction of the enzyme movement, so that the enzyme passes through and then the analyte is sequenced.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1示出了一种实施方式的含两个阻滞元件的Y衔接体以及酶穿过阻滞元件1的结构示意图。其中标记表示如下:(Y1)包含阻滞元件1的主体链;(Y2)第三链,也称荧光链;(B)包含阻滞元件2的阻挡链。其中序列Y1的部分区段与Y2形成互补双链区,Y1的部分区段与B部分互补,B上的阻滞元件2的核苷酸与Y1互补。Figure 1 shows a schematic diagram of a Y adapter containing two blocking elements and an enzyme passing through the blocking element 1 according to an embodiment. The labels are as follows: (Y1) the main strand containing the blocking element 1; (Y2) the third strand, also known as the fluorescent strand; (B) the blocking strand containing the blocking element 2. Part of the sequence Y1 forms a complementary double-stranded region with Y2, part of the sequence Y1 is complementary to the B part, and the nucleotides of the blocking element 2 on B are complementary to Y1.
图2示出了多种含两个阻滞元件的Y衔接体的例示性结构示意图。两个阻滞元件的组合关系分别为阻滞元件1+阻滞元件2、阻滞元件1+阻滞元件2+阻滞元件1、阻滞元件1+阻滞元件2+阻滞元件1+阻滞元件2,以及阻滞元件2+阻滞元件1。Fig. 2 shows a schematic diagram of the exemplary structures of various Y adapters containing two blocking elements. The combination relationship of the two blocking elements is blocking element 1 + blocking element 2, blocking element 1 + blocking element 2 + blocking element 1, blocking element 1 + blocking element 2 + blocking element 1 + blocking element 2, and blocking element 2 + blocking element 1.
序列表说明:Description of sequence listing:
SEQ ID NO:1示出了Y接头的主体链Y1:SEQ ID NO: 1 shows the main chain Y1 of the Y linker:
5'-(iSpC3)30-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT ACTGC TCATT CGGTC CTGCT GACT-3’5'-(iSpC3)30-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT ACTGC TCATT CGGTC CTGCT GACT-3'
SEQ ID NO:2示出了Y接头的包含3C3的主体链Y1-3C3:SEQ ID NO: 2 shows the main chain Y1-3C3 containing 3C3 of the Y linker:
5'-(iSpC3)30-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT-(iSpC3)3-ACTGC TCATT CGGTC CTGCT GACT-3’5'-(iSpC3)30-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT-(iSpC3)3-ACTGC TCATT CGGTC CTGCT GACT-3'
SEQ ID NO:3示出了Y接头的包含5C3的主体链Y1-5C3:SEQ ID NO: 3 shows the main chain Y1-5C3 containing 5C3 of the Y linker:
5'-(iSpC3)30-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT-(iSpC3)5-ACTGC TCATT CGGTC CTGCT GACT-35'-(iSpC3)30-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT-(iSpC3)5-ACTGC TCATT CGGTC CTGCT GACT-3
SEQ ID NO:4示出了Y接头的包含5C3的主体链Y1-5C3-cPIP:SEQ ID NO: 4 shows the main chain Y1-5C3-cPIP containing 5C3 of the Y linker:
5'-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT-(iSpC3)5-CGAGC AGCAC GCGAG CAGCA CG GACT-3'5'-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT-(iSpC3)5-CGAGC AGCAC GCGAG CAGCA CG GACT-3'
SEQ ID NO:5示出了Y接头的包含Sp的主体链Y1-Sp:SEQ ID NO:5 shows the main chain Y1-Sp containing Sp of the Y linker:
5'-(iSpC3)30-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT-Sp-ACTGC TCATT CGGTC CTGCT GACT-3’5'-(iSpC3)30-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT-Sp-ACTGC TCATT CGGTC CTGCT GACT-3'
SEQ ID NO:6示出了Y接头的包含2Sp的主体链Y1-2Sp:SEQ ID NO:6 shows the main chain Y1-2Sp containing 2Sp of the Y linker:
5'-(iSpC3)30-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT-(Sp)2-ACTGC TCATT CGGTC CTGCT GACT-3’5'-(iSpC3)30-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT-(Sp)2-ACTGC TCATT CGGTC CTGCT GACT-3'
SEQ ID NO:7示出了Y接头的包含4C18的主体链Y1-4C18:SEQ ID NO:7 shows the main chain Y1-4C18 containing 4C18 of the Y linker:
5'-(iSpC3)30-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT-(iSpC18)4-ACTGC TCATT CGGTC CTGCT GACT-3’5'-(iSpC3)30-GCGGA GTCAA ACGGT AGAAG TCG TTTTT TTTTT-(iSpC18)4-ACTGC TCATT CGGTC CTGCT GACT-3'
SEQ ID NO:8示出了Y接头的荧光链Y2,该序列的3’端连接荧光基团CY5: SEQ ID NO: 8 shows the fluorescent chain Y2 of the Y linker, and the 3' end of the sequence is connected to the fluorescent group CY5:
5'-CGACT TCTAC CGTTT GACTC CGC-CY5-3'5'-CGACT TCTAC CGTTT GACTC CGC-CY5-3'
SEQ ID NO:9示出了Y接头的阻挡链B,其中部分核苷酸经OMe修饰以防止其与解旋酶结合:SEQ ID NO:9 shows the blocking strand B of the Y linker, in which some nucleotides are modified with OMe to prevent them from binding to the helicase:
5'-P-GTCAG CAGGA CCGAA TGA(GC AGTAG TCCAG CACCG ACC)OMe-3'5'-P-GTCAG CAGGA CCGAA TGA(GC AGTAG TCCAG CACCG ACC) OMe -3'
SEQ ID NO:10示出了Y接头的包含经LNA修饰的核苷酸的阻挡链B-LNA:SEQ ID NO: 10 shows the blocking strand B-LNA of the Y linker comprising LNA-modified nucleotides:
5'-P-GTCAG CAGGA CCGAA TGA(GCAGT)LNA(AGTCC AGCAC CGACC)OMe-3'5'-P-GTCAG CAGGA CCGAA TGA(GCAGT) LNA (AGTCC AGCAC CGACC) OMe -3'
SEQ ID NO:11示出了Y接头的包含经PNA修饰的核苷酸的阻挡链B-PNA:SEQ ID NO: 11 shows the blocking strand B-PNA of the Y linker comprising a PNA-modified nucleotide:
5'-(GTCAG CAGGA CCGAA TGAGC AGT)PNA-3'5'-(GTCAG CAGGA CCGAA TGAGC AGT) PNA- 3'
SEQ ID NO:12示出了Y接头的阻挡链序列B-cPIP:SEQ ID NO: 12 shows the blocking strand sequence B-cPIP of the Y linker:
5'-GTCCG TGCTG CTCGC GTGCT GCTCG-3'5'-GTCCG TGCTG CTCGC GTGCT GCTCG-3'
SEQ ID NO:13示出了系链,该链的5’端连接胆固醇:SEQ ID NO: 13 shows a tether with cholesterol attached to the 5' end of the chain:
5’-Chol-(iSpC3)20-TTGGTGGGTGGGTGGG-3’5’-Chol-(iSpC3)20-TTGGTGGGTGGGTGGG-3’
SEQ ID NO:14示出了解旋酶E1的野生型序列:
SEQ ID NO: 14 shows the wild-type sequence of helicase E1:
SEQ ID NO:15示出了解旋酶E2的野生型序列:
SEQ ID NO: 15 shows the wild-type sequence of helicase E2:
具体实施方式Detailed ways
应理解,所公开的产物和方法的不同应用可根据本领域内的具体需要进行调整。还应理解,本文所用的术语只是为了描述本发明的具体实施方案的目的,而非意图进行限制。It should be understood that different applications of the disclosed products and methods can be adjusted according to specific needs in the art. It should also be understood that the terminology used herein is only for the purpose of describing specific embodiments of the present invention and is not intended to be limiting.
本文中——无论在上文还是下文——引用的所有出版物、专利和专利申请以引用的方式全文纳入本文。 All publications, patents and patent applications cited herein, whether supra or infra, are hereby incorporated by reference in their entirety.
定义definition
为了更清楚地解释本发明的实施方式,本文中使用了一些科学术语和专有名词。除非在本文中进行了明确定义,所有这些术语和名词应当被理解为具有本领域技术人员所通常理解的含义。为了更清楚起见,对于本文中使用的某些术语进行了以下定义。In order to explain the embodiments of the present invention more clearly, some scientific terms and special terms are used herein. Unless clearly defined herein, all these terms and special terms should be understood to have the meanings commonly understood by those skilled in the art. For the sake of clarity, some terms used in this article are defined below.
衔接体Connector
本发明提供了用于表征核酸的衔接体,所述衔接体能够连接分析物。衔接体也称为接头、衔接子等。本发明的衔接体包含一个或多个第一阻滞元件和一个或多个第二阻滞元件,第一阻滞元件和第二阻滞元件在结构和组成上完全不同。衔接体包含互补,优选部分互补的主体链和阻挡链,主体链和阻挡链均为由核苷酸、核苷酸类似物或修饰的核苷酸等连接形成的单链结构或部分核苷酸被修饰的单链结构。任意数量的第一阻滞元件,如1个、2个、3个、4个、5个、6个、7个、8和、9个、10个或更多个第一阻滞元件以共价键连接到主体链上。任意数量的第二阻滞元件,如1个、2个、3个、4个、5个、6个、7个、8和、9个、10个或更多个第二阻滞元件以共价键修饰到主体链或阻挡链上(如LNA、PNA等)或以非共价键与主体链、阻挡链或主体链和阻挡链形成的双链互作(如cPIP等)。The present invention provides an adapter for characterizing nucleic acids, wherein the adapter is capable of connecting an analyte. An adapter is also referred to as a joint, an adapter, etc. The adapter of the present invention comprises one or more first retarding elements and one or more second retarding elements, and the first retarding element and the second retarding element are completely different in structure and composition. The adapter comprises a complementary, preferably partially complementary main chain and a blocking chain, and both the main chain and the blocking chain are single-stranded structures formed by connecting nucleotides, nucleotide analogs or modified nucleotides, or single-stranded structures in which some nucleotides are modified. Any number of first retarding elements, such as 1, 2, 3, 4, 5, 6, 7, 8 and, 9, 10 or more first retarding elements are covalently connected to the main chain. Any number of second blocking elements, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more second blocking elements are modified to the main chain or the blocking chain with a covalent bond (such as LNA, PNA, etc.) or interact with the main chain, the blocking chain or the double chain formed by the main chain and the blocking chain with a non-covalent bond (such as cPIP, etc.).
当一个或多个第二阻滞元件以共价键修饰到主体链上时,一个或多个第二阻滞元件与阻挡链互补;当一个或多个第二阻滞元件以共价键修饰到阻挡链上时,一个或多个第二阻挡元件与主体链互补;当一个或多个第二阻滞元件以非共价键与主体链和/或阻挡链互作时,第二阻滞元件可与阻挡链或主体链互补。When one or more second retarding elements are modified to the main chain by covalent bonds, the one or more second retarding elements are complementary to the blocking chain; when one or more second retarding elements are modified to the blocking chain by covalent bonds, the one or more second retarding elements are complementary to the main chain; when one or more second retarding elements interact with the main chain and/or the blocking chain by non-covalent bonds, the second retarding elements may be complementary to the blocking chain or the main chain.
第二阻滞元件包含经修饰物修饰的核苷酸,当一个或多个第二阻滞元件的核苷酸部分位于主体链上并且修饰物部分以非共价键与主体链和/或阻挡链互作时,第二阻滞元件与阻挡链互补;当一个或多个第二阻滞元件的核苷酸部分位于阻挡链上并且修饰物部分以非共价键与主体链和/或阻挡链互作时,第二阻滞元件与主体链互补。The second blocking element comprises a nucleotide modified by a modifier, and when the nucleotide portion of one or more second blocking elements is located on the main chain and the modifier portion interacts with the main chain and/or the blocking chain by a non-covalent bond, the second blocking element is complementary to the blocking chain; when the nucleotide portion of one or more second blocking elements is located on the blocking chain and the modifier portion interacts with the main chain and/or the blocking chain by a non-covalent bond, the second blocking element is complementary to the main chain.
第二阻滞元件包含经修饰物修饰的核苷酸,本文所述“第二阻滞元件位于主体链或阻挡链上”是指第二阻滞元件所包含的核苷酸连接到主体链或阻挡链上,而非对修饰物的位置的限定。The second blocking element comprises nucleotides modified with a modifier. The phrase "the second blocking element is located on the main strand or the blocking strand" herein means that the nucleotides comprised by the second blocking element are connected to the main strand or the blocking strand, rather than limiting the position of the modifier.
在一些实施例中,一个或多个第一阻滞元件与一个或多个第二阻滞元件可不以碱基互补配对方式结合。在一些实施例中,一个或多个第二阻滞元件与阻挡链或主体链可以碱基互补配对方式结合。In some embodiments, the one or more first retarding elements may not bind to the one or more second retarding elements in a complementary base pairing manner. In some embodiments, the one or more second retarding elements may bind to the barrier strand or the main strand in a complementary base pairing manner.
一个或多个第一阻滞元件与一个或多个第二阻滞元件可紧邻设置或间隔设置。The one or more first blocking elements and the one or more second blocking elements may be disposed adjacent to each other or spaced apart from each other.
在一些实施方式中,一个或多个第一阻滞元件与一个或多个第二阻滞元件紧邻设置。在多核苷酸结合蛋白在主体链移动的方向上,至少一个第一阻滞元件紧邻设置在至少一个第二阻滞元件之前或之后或者与至少一个第二阻滞元件互补的区段之前或之后,二者之间不存在任何基团。In some embodiments, one or more first retarding elements are disposed adjacent to one or more second retarding elements. In the direction in which the polynucleotide binding protein moves in the main chain, at least one first retarding element is disposed adjacent to at least one second retarding element or to a segment complementary to at least one second retarding element, and no group exists between the two.
在一些实施方式中,一个或多个第二阻滞元件紧邻设置在一起。一个或多个第一阻滞元件可彼此紧邻设置,或与一个或多个第二阻滞元件紧邻设置,或者与在主体链上与第二阻滞元件互补的区段紧邻设置。In some embodiments, one or more second blocking elements are disposed adjacent to each other. One or more first blocking elements may be disposed adjacent to each other, or adjacent to one or more second blocking elements, or adjacent to a segment on the main chain that is complementary to a second blocking element.
本文所述“第一阻滞元件紧邻设置”的含义指主体链上的一个或多个第一阻滞元件之间没有任何 基团形成的空隙,而是直接连接在一起。本文所述“第一阻滞元件与第二阻滞元件紧邻设置”的含义指主体链上的第一阻滞元件与位于主体链上的第二阻滞元件之间、或主体链上的第一阻滞元件与在主体链上与位于阻挡链上的第二阻滞元件互补的区段之间没有任何基团形成的空隙,而是直接连接在一起。本文所述“第二阻滞元件紧邻设置”的含义指位于主体链或阻挡链上的一个或多个第二阻滞元件之间没有任何基团形成的空隙,而是直接连接在一起。The meaning of "the first blocking element is arranged in close proximity" in this article is that there is no The meaning of "the first blocking element and the second blocking element are arranged in close proximity" as described herein refers to that there is no gap formed by any group between the first blocking element on the main chain and the second blocking element located on the main chain, or between the first blocking element on the main chain and the segment on the main chain that is complementary to the second blocking element located on the blocking chain, but they are directly connected together. The meaning of "the second blocking element is arranged in close proximity" as described herein refers to that there is no gap formed by any group between one or more second blocking elements located on the main chain or the blocking chain, but they are directly connected together.
在一些实施方式中,一个或多个第一阻滞元件与一个或多个第二阻滞元件间隔设置。在多核苷酸结合蛋白在主体链移动的方向上,至少一个第一阻滞元件与至少一个第二阻滞元件或者与至少一个第二阻滞元件互补的区段之间存在一个或多个基团。In some embodiments, one or more first retarding elements are spaced apart from one or more second retarding elements. In the direction in which the polynucleotide binding protein moves along the main strand, one or more groups exist between at least one first retarding element and at least one second retarding element or a segment complementary to at least one second retarding element.
在一些实施方式中,一个或多个第二阻滞元件间隔设置。一个或多个第一阻滞元件可间隔设置,或与一个或多个第二阻滞元件间隔设置,或者与在主体链上与第二阻滞元件互补的区段间隔设置。In some embodiments, one or more second blocking elements are spaced apart. One or more first blocking elements may be spaced apart, or spaced apart from one or more second blocking elements, or spaced apart from a segment on the main chain that is complementary to the second blocking element.
本文所述“第一阻滞元件间隔设置”的含义指主体链上的一个或多个第一阻滞元件之间存在一个或多个基团,例如1、2、3、4、5、6、7、8、9、10或更多个基团。本文所述“第一阻滞元件与第二阻滞元件间隔设置”的含义指主体链上的第一阻滞元件与位于主体链上的第二阻滞元件之间、或主体链上的第一阻滞元件与在主体链上与位于阻挡链上的第二阻滞元件互补的区段之间存在一个或多个基团,例如1、2、3、4、5、6、7、8、9、10或更多个基团。本文所述“第二阻滞元件间隔设置”的含义指位于主体链或阻挡链上的一个或多个第二阻滞元件之间存在一个或多个基团,例如1、2、3、4、5、6、7、8、9、10或更多个基团。The meaning of "the first retarding element is arranged at intervals" as described herein refers to the presence of one or more groups, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more groups, between one or more first retarding elements on the main chain. The meaning of "the first retarding element and the second retarding element are arranged at intervals" as described herein refers to the presence of one or more groups, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more groups, between the first retarding element on the main chain and the second retarding element located on the main chain, or between the first retarding element on the main chain and the segment on the main chain that is complementary to the second retarding element located on the blocking chain. The meaning of "the second retarding element is arranged at intervals" as described herein refers to the presence of one or more groups, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more groups, between one or more second retarding elements located on the main chain or the blocking chain.
如图2所示,一个或多个第一阻滞元件和一个或多个第二阻滞元件可以以任何组合或位置设置在衔接体上。在一些实施方式中,衔接体上包括在多核苷酸结合蛋白移动的方向上紧邻设置或间隔设置的1个第一阻滞元件和1个第二阻滞元件。在一些实施方式中,衔接体上包括在多核苷酸结合蛋白移动的方向上紧邻设置或间隔设置的1个第一阻滞元件、1个第二阻滞元件和1个第一阻滞元件。在一些实施方式中,衔接体上包括在多核苷酸结合蛋白移动的方向上紧邻设置或间隔设置的1个第一阻滞元件、1个第二阻滞元件、1个第一阻滞元件和1个第二阻滞元件。在一些实施方式中,衔接体上包括在多核苷酸结合蛋白移动的方向上紧邻设置或间隔设置的1个第二阻滞元件和1个第一阻滞元件。多核苷酸结合蛋白移动的方向通常是5’-3’,有些多核苷酸结合蛋白是按照3’-5’的方向移动的。As shown in Figure 2, one or more first retarding elements and one or more second retarding elements can be arranged on the adapter in any combination or position. In some embodiments, the adapter includes 1 first retarding element and 1 second retarding element that are arranged adjacent to or spaced in the direction of movement of the polynucleotide binding protein. In some embodiments, the adapter includes 1 first retarding element, 1 second retarding element and 1 first retarding element that are arranged adjacent to or spaced in the direction of movement of the polynucleotide binding protein. In some embodiments, the adapter includes 1 first retarding element, 1 second retarding element, 1 first retarding element and 1 second retarding element that are arranged adjacent to or spaced in the direction of movement of the polynucleotide binding protein. In some embodiments, the adapter includes 1 second retarding element and 1 first retarding element that are arranged adjacent to or spaced in the direction of movement of the polynucleotide binding protein. The direction of movement of the polynucleotide binding protein is usually 5'-3', and some polynucleotide binding proteins move in the 3'-5' direction.
在衔接体与多核苷酸结合蛋白接触之后,一个或多个第一阻滞元件和一个或多个第二阻滞元件能够联合作用来停滞多核苷酸结合蛋白。多核苷酸结合蛋白可以被停滞在第一阻滞元件或第二阻滞元件之前,优选停滞在第一个第一阻滞元件或第一个第二阻滞元件之前。After the adaptor contacts the polynucleotide binding protein, one or more first blocking elements and one or more second blocking elements can work together to arrest the polynucleotide binding protein. The polynucleotide binding protein can be arrested before the first blocking element or the second blocking element, preferably before the first first blocking element or the first second blocking element.
第一阻滞元件可具有停滞一个或多个多核苷酸结合蛋白的任意分子或任意分子的组合,优选地具有与核苷酸不同的结构或由与核苷酸不同的结构组成。更优选地,第一阻滞元件具有一个或多个有机寡阳离子、一个或多个iSpC3、一个或多个iSp18、一个或多个iSp9、一个或多个硝基吲哚、一个或多个肌苷、一个或多个吖啶、一个或多个2-氨基嘌呤、一个或多个2-6-二氨基嘌呤、一个或多个5-溴-脱氧尿嘧啶、一个或多个反向胸苷(反向dT)、一个或多个反向二脱氧胸苷(ddT)、一个或多个 二脱氧胞苷(ddC)、一个或多个5-甲基胞苷酸、一个或多个5-羟甲基胞苷、一个或多个2'-O-甲基RNA碱基、一个或多个异脱氧胞苷(异-dC)、一个或多个异脱氧鸟苷(异-dG)、一个或多个光裂解(PC)的基团或一个或多个己二醇连接。The first blocking element may have any molecule or any combination of molecules that arrest one or more polynucleotide binding proteins, preferably having a structure different from nucleotides or consisting of a structure different from nucleotides. More preferably, the first blocking element has one or more organic oligocations, one or more iSpC3, one or more iSp18, one or more iSp9, one or more nitroindoles, one or more inosine, one or more acridine, one or more 2-aminopurine, one or more 2-6-diaminopurine, one or more 5-bromo-deoxyuracil, one or more inverted thymidine (inverted dT), one or more inverted dideoxythymidine (ddT), one or more dideoxycytidine (ddC), one or more 5-methylcytidylic acid, one or more 5-hydroxymethylcytidine, one or more 2'-O-methyl RNA bases, one or more isodeoxycytidine (iso-dC), one or more isodeoxyguanosine (iso-dG), one or more photocleavable (PC) groups or one or more hexanediol linkages.
优选地,有机寡阳离子具有通式Bj,Bj是j‐聚体的有机寡阳离子部分,j=1‐50,其中B选自包括以下基团的组:Preferably, the organic oligocation has the general formula Bj, Bj is the organic oligocation part of a j-mer, j=1-50, wherein B is selected from the group comprising the following groups:
‐HPO3‐R1‐(X‐R2 n)n1‐X‐R3‐O‐,其中R1、R2 n和R3是相同或不同的C1‐C5低级亚烷基,X是NH或NC(NH2)2,n1=2‐20,-HPO 3 -R 1 -(X-R 2 n ) n1 -X-R 3 -O-, wherein R 1 , R 2 n and R 3 are the same or different C1-C5 lower alkylene groups, X is NH or NC(NH 2 ) 2 , n1=2-20,
‐HPO3‐R4‐CH(R5X1)‐R6‐O‐,其中R4是C1‐C5低级亚烷基,R5和R6是相同或不同的C1‐C5低级亚烷基,X1为腐胺、亚精胺或精胺残基,-HPO 3 -R 4 -CH(R 5 X 1 )-R 6 -O-, wherein R 4 is a C1-C5 lower alkylene group, R 5 and R 6 are the same or different C1-C5 lower alkylene groups, and X 1 is a putrescine, spermidine or spermine residue,
‐HPO3‐R7‐(aa)n2‐R8‐O‐,其中R7是C1‐C5低级亚烷基,R8是C1‐C5低级亚烷基、丝氨酸、天然氨基醇,(aa)n2是含有具有阳离子侧链的天然氨基酸的肽,n2=2‐20。‐HPO 3 ‐R 7 ‐(aa) n2 ‐R 8 ‐O‐, wherein R 7 is C1‐C5 lower alkylene, R 8 is C1‐C5 lower alkylene, serine, natural amino alcohol, (aa) n2 is a peptide containing a natural amino acid having a cationic side chain, and n2=2‐20.
更优选地,所述有机寡阳离子选自精胺。More preferably, the organic oligocation is selected from spermine.
更优选地,第一阻滞元件包含1个、2个、3个、4个、5个、6个、7个、8个或更多的精胺、iSpC3、iSp18或iSp9。最优选的,第一阻滞元件是3个、4个或5个iSpC3,或1个、2个、3个、4个或5个精胺。More preferably, the first retarding element comprises 1, 2, 3, 4, 5, 6, 7, 8 or more spermine, iSpC3, iSp18 or iSp9. Most preferably, the first retarding element is 3, 4 or 5 iSpC3, or 1, 2, 3, 4 or 5 spermine.
第二阻滞元件具有一个或多个提高双链稳定性的基团,比如增加双链的解链难度或提高双链Tm值的任意基团或任意基团的组合,优选地具有经修饰的核苷酸或由经修饰的核苷酸组成。优选地,第二阻滞元件包含一个或多个经锁核酸(LNA)修饰的核苷酸、一个或多个经肽核酸(PNA)修饰的核苷酸、一个或多个经甲氧基(OMe)修饰的核苷酸、一个或多个经双环核苷(BNA)修饰的核苷酸、一个或多个经甘油核酸(GNA)修饰的核苷酸、一个或多个经苏糖核酸(TNA)修饰的核苷酸、一个或多个经环吡咯咪唑聚酰胺(cPIP)修饰的核苷酸或一个或多个经双链结合蛋白修饰的核苷酸连接。The second blocking element has one or more groups that improve the stability of the double-stranded chain, such as any group or combination of any groups that increases the difficulty of unwinding the double-stranded chain or increases the Tm value of the double-stranded chain, preferably has a modified nucleotide or is composed of a modified nucleotide. Preferably, the second blocking element comprises one or more nucleotides modified by locked nucleic acid (LNA), one or more nucleotides modified by peptide nucleic acid (PNA), one or more nucleotides modified by methoxy (OMe), one or more nucleotides modified by bicyclic nucleoside (BNA), one or more nucleotides modified by glycerol nucleic acid (GNA), one or more nucleotides modified by threose nucleic acid (TNA), one or more nucleotides modified by cyclic pyrrolimidazole polyamide (cPIP) or one or more nucleotides modified by double-stranded binding protein.
更优选地,第二阻滞元件包含1个、2个、3个、4个、5个、6个、7个、8个、9个、10个、13个、15个、20个、23个、25个或更多的经LNA修饰的核苷酸、经PNA修饰的核苷酸、经OMe修饰的核苷酸。最优选的,第二阻滞元件是3个、4个、5个或6个经LNA修饰的核苷酸,或15个、20个、23个或25个经PNA修饰的核苷酸、经5个、10个、15个或20个经OMe修饰的核苷酸。More preferably, the second blocking element comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 13, 15, 20, 23, 25 or more LNA-modified nucleotides, PNA-modified nucleotides, OMe-modified nucleotides. Most preferably, the second blocking element is 3, 4, 5 or 6 LNA-modified nucleotides, or 15, 20, 23 or 25 PNA-modified nucleotides, 5, 10, 15 or 20 OMe-modified nucleotides.
在一些实施方式中,在主体链上,一个或多个紧邻设置或间隔设置的第一阻滞元件、第二阻滞元件或第一阻滞元件与第二阻滞元件的组合的两端分别与第一区段的一端和第二区段的一端连接。第一区段的另一端在测序时先进入跨膜孔并引导整个衔接体进入跨膜孔,第二区段的另一端与分析物直接连接或通过核苷酸与分析物间接连接,进而对分析物进行表征。第一区段是由核苷酸和无碱基基团形成的单链区段,第二区段是由核苷酸连接形成的单链区段。In some embodiments, on the main chain, two ends of one or more first blocking elements, second blocking elements, or a combination of the first blocking element and the second blocking element that are arranged adjacently or spaced apart are connected to one end of the first segment and one end of the second segment, respectively. During sequencing, the other end of the first segment first enters the transmembrane pore and guides the entire adapter into the transmembrane pore, and the other end of the second segment is directly connected to the analyte or indirectly connected to the analyte through nucleotides, thereby characterizing the analyte. The first segment is a single-stranded segment formed by nucleotides and abasic groups, and the second segment is a single-stranded segment formed by nucleotides.
在一些实施方式中,在阻挡链上,一个或多个紧邻设置或间隔设置的第二阻滞元件的两端分别与第三区段的一端和第四区段的一端连接。第三区段的另一端为游离末端,用于在测序时与系链结合用于使衔接体靠近跨膜孔。第四区段的另一端与分析物直接连接或通过核苷酸与分析物间接连接,进而对分析物进行表征。在一些实施方式中,第四区段的另一端为游离末端,不与任何基团连接。 第三区段是由核苷酸和/或经修饰的核苷酸连接形成的单链区段,第四区段是由核苷酸形成的单链区段。主体链的第二区段可与阻挡链的第四链以碱基互补配对的方式结合形成双链或部分双链。部分双链是指同时包含双链和单链的结构。In some embodiments, on the blocking strand, two ends of one or more second blocking elements disposed adjacently or spaced apart are connected to one end of the third segment and one end of the fourth segment, respectively. The other end of the third segment is a free end, which is used to bind to the tether during sequencing to bring the adapter close to the transmembrane pore. The other end of the fourth segment is directly connected to the analyte or indirectly connected to the analyte through nucleotides, thereby characterizing the analyte. In some embodiments, the other end of the fourth segment is a free end and is not connected to any group. The third segment is a single-stranded segment formed by connecting nucleotides and/or modified nucleotides, and the fourth segment is a single-stranded segment formed by nucleotides. The second segment of the main strand can be combined with the fourth strand of the blocking strand in a base complementary pairing manner to form a double strand or a partial double strand. A partial double strand refers to a structure that contains both a double strand and a single strand.
衔接体还包含第三链,第三链是由核苷酸形成的单链。第三链也称为荧光链。第三链可与主体链的第一区段以碱基互补配对的方式结合形成双链或部分双链。The adapter also includes a third strand, which is a single strand formed by nucleotides. The third strand is also called a fluorescent strand. The third strand can be combined with the first segment of the main strand in a base complementary pairing manner to form a double strand or a partial double strand.
在一些实施方式中,主体链、阻挡链和第三链形成Y型衔接体。Y型衔接体的结构为一个柄连接两条臂,其中主体链的第二区段与阻挡链的第四区段结合形成Y衔接体的柄,柄的一端连接两条臂,另一端与分析物连接。一条臂由阻挡链的游离端组成,另一条臂由主体链的第一区段与第三链结合形成。In some embodiments, the main strand, the blocking strand and the third strand form a Y-shaped adapter. The structure of the Y-shaped adapter is a handle connecting two arms, wherein the second segment of the main strand is combined with the fourth segment of the blocking strand to form the handle of the Y-shaped adapter, one end of the handle is connected to the two arms, and the other end is connected to the analyte. One arm is composed of the free end of the blocking strand, and the other arm is formed by the first segment of the main strand combined with the third strand.
本发明的衔接体优选地与系链(tether)结合,所述系链为由多个核苷酸组成的单链多核苷酸,还包含不能形成互补双链的无碱基基团。系链的一端与衔接体互补结合,另一端锚定到膜或孔上,从而将衔接体所连接的分析物围绕在孔周围。The adapter of the present invention is preferably combined with a tether, which is a single-stranded polynucleotide composed of multiple nucleotides and also contains a baseless group that cannot form a complementary double strand. One end of the tether is complementary to the adapter and the other end is anchored to the membrane or the hole, thereby surrounding the analyte connected to the adapter around the hole.
本发明的衔接体可独立使用,也可与其他衔接体(比如发夹衔接体、类发夹衔接体或常规Y衔接体等)组合使用形成用于测序的构建体。在一些实施方案中,本发明的衔接体连接分析物例如多核苷酸的两端形成构建体用于表征多核苷酸。在一些实施方案中,本发明的衔接体连接在多核苷酸的5’端,多核苷酸的3’端连接本发明的衔接体以外的常规Y衔接体,形成构建体。在一些实施方案中,如需要对双链多核苷酸的两条链进行表征,可将本发明的衔接体连接双链多核苷酸的5’端,双链多核苷酸的3’端连接发夹衔接体或类发夹衔接体,形成构建体。类发夹衔接体为具有与常规发夹结构类似的环,但该环并不与常规发夹结构一样而是由一条单链线性分子自身回折形成,能够连接多核苷酸的两条链。对于类发夹衔接体的具体示例,参见CN113462764A。The adapter of the present invention can be used independently, or it can be used in combination with other adapters (such as hairpin adapters, hairpin-like adapters or conventional Y adapters, etc.) to form a construct for sequencing. In some embodiments, the adapter of the present invention connects the two ends of the analyte, such as a polynucleotide, to form a construct for characterizing the polynucleotide. In some embodiments, the adapter of the present invention is connected to the 5' end of the polynucleotide, and the 3' end of the polynucleotide is connected to a conventional Y adapter other than the adapter of the present invention to form a construct. In some embodiments, if it is necessary to characterize the two chains of a double-stranded polynucleotide, the adapter of the present invention can be connected to the 5' end of the double-stranded polynucleotide, and the 3' end of the double-stranded polynucleotide can be connected to a hairpin adapter or a hairpin-like adapter to form a construct. A hairpin-like adapter is a loop having a similar structure to a conventional hairpin, but the loop is not the same as a conventional hairpin structure but is formed by a single-stranded linear molecule folding back on itself, and can connect the two chains of a polynucleotide. For specific examples of hairpin-like adapters, see CN113462764A.
所述衔接体或系链中使用的无碱基基团为iSp18、iSpC3或iSp9等不能形成碱基对的基团,其也可以被称为“无碱基位点”或“无碱基核苷酸”。无碱基基团是在糖部分的1'位置处缺乏核碱基的核苷酸或核苷。The abasic groups used in the adapter or tether are groups such as iSp18, iSpC3 or iSp9 that cannot form base pairs, which may also be referred to as "abasic sites" or "abasic nucleotides." Abasic groups are nucleotides or nucleosides that lack a nucleobase at the 1' position of the sugar portion.
分析物Analyte
分析物选自多核苷酸、多肽、多糖和脂质中的一种或多种。分析物优选地为多核苷酸例如核酸,包括脱氧核糖核酸(DNA)和/或核糖核酸(RNA)。多核苷酸可以是单链或双链。多核苷酸可以是环状的。多核苷酸可以是适体,与微RNA杂交的探针或微RNA本身。多核苷酸可为任意长度。例如,多核苷酸可为至少10个,至少50个,至少100个,至少150个,至少200个,至少250个,至少300个,至少400个或至少500个核苷酸对的长度。多核苷酸可为1000或更多个核苷酸对,5000或更多个核苷酸对的长度或100000或更多个核苷酸对的长度。The analyte is selected from one or more of polynucleotides, polypeptides, polysaccharides and lipids. The analyte is preferably a polynucleotide such as a nucleic acid, including deoxyribonucleic acid (DNA) and/or ribonucleic acid (RNA). The polynucleotide can be single-stranded or double-stranded. The polynucleotide can be circular. The polynucleotide can be an aptamer, a probe hybridized with a microRNA, or the microRNA itself. The polynucleotide can be of any length. For example, the polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 nucleotide pairs in length. The polynucleotide can be 1000 or more nucleotide pairs, 5000 or more nucleotide pairs in length or 100000 or more nucleotide pairs in length.
分析物可存在于任何适合的样品中。本发明通常在已知含有或怀疑含有分析物的样品上实施。本发明可以在含有一种或多种种类未知的分析物的样品上实施。或者,本发明可以在样品上实施以确认已知或预期存在于所述样品中的一种或多种分析物的种类。本领域技术人员可以预期的是,本发明的“提供分析物”是指提供包含分析物的样品,本发明的“测序接头与分析物连接”是指测序接头 与存在于样品中的分析物连接。The analyte may be present in any suitable sample. The present invention is generally implemented on a sample known to contain or suspected of containing the analyte. The present invention may be implemented on a sample containing one or more analytes of unknown species. Alternatively, the present invention may be implemented on a sample to confirm the species of one or more analytes known or expected to be present in the sample. It is contemplated by those skilled in the art that "providing an analyte" of the present invention refers to providing a sample containing the analyte, and "connecting a sequencing adapter to an analyte" of the present invention refers to connecting a sequencing adapter to a sequencing adapter. Binds to the analyte present in the sample.
多核苷酸Polynucleotide
多核苷酸可以是任何多核苷酸。多核苷酸如核酸是含有两个或更多个核苷酸的大分子。所述多核苷酸或核酸可包括任何核苷酸的任意组合。核苷酸可以是天然存在的或人工合成的。A polynucleotide can be any polynucleotide. A polynucleotide such as a nucleic acid is a macromolecule containing two or more nucleotides. The polynucleotide or nucleic acid can include any combination of nucleotides. Nucleotides can be naturally occurring or artificially synthesized.
核苷酸通常含有核碱基、糖和至少一个磷酸基团。所述核碱基和糖形成核苷。核苷酸可以是天然核苷酸或非天然核苷酸。Nucleotides generally contain a nucleobase, a sugar and at least one phosphate group. The nucleobase and sugar form a nucleoside. Nucleotides can be natural nucleotides or non-natural nucleotides.
核苷碱基通常为杂环的。核碱基包括但不限于:嘌呤和嘧啶,更具体地,腺嘌呤(A)、鸟嘌呤(G)、胸腺嘧啶(T)、尿嘧啶(U)和胞嘧啶(C)。Nucleoside bases are typically heterocyclic. Nucleobases include, but are not limited to, purines and pyrimidines, more specifically, adenine (A), guanine (G), thymine (T), uracil (U), and cytosine (C).
糖通常为戊糖。核苷酸糖包括但不限于,核糖和脱氧核糖。所述糖优选为脱氧核糖。The sugar is usually a pentose. Nucleotide sugars include, but are not limited to, ribose and deoxyribose. The sugar is preferably deoxyribose.
所述多核苷酸中的核苷酸通常是核糖核苷酸或脱氧核糖核苷酸。所述多核苷酸可包含以下核苷:腺苷,尿苷,鸟苷和胞苷。核苷酸优选为脱氧核糖核苷酸。所述多核苷酸优选包含下列核苷:脱氧腺苷(dA)、脱氧尿苷(dU)和/或胸苷(dT)、脱氧鸟苷(dG)和脱氧胞苷(dC)。The nucleotides in the polynucleotide are usually ribonucleotides or deoxyribonucleotides. The polynucleotide may contain the following nucleosides: adenosine, uridine, guanosine and cytidine. The nucleotides are preferably deoxyribonucleotides. The polynucleotide preferably contains the following nucleosides: deoxyadenosine (dA), deoxyuridine (dU) and/or thymidine (dT), deoxyguanosine (dG) and deoxycytidine (dC).
所述核苷酸通常含有单磷酸、二磷酸或三磷酸。磷酸酶可以被连接在核苷酸的5'或3'侧。The nucleotides typically contain monophosphate, diphosphate or triphosphate. The phosphatase can be attached to the 5' or 3' side of the nucleotide.
多核苷酸中的核苷酸可以以任何方式彼此连接。如在核酸中一样,核苷酸通常通过它们的糖和磷酸基团连接。如嘧啶二聚体中一样,所述核苷酸可通过它们的核碱基连接。The nucleotides in a polynucleotide can be linked to each other in any manner. As in nucleic acids, the nucleotides are usually linked by their sugar and phosphate groups. As in pyrimidine dimers, the nucleotides can be linked by their nucleobases.
所述多核苷酸可以是单链或双链的。至少所述多核苷酸的一部分优选为双链的。The polynucleotide may be single-stranded or double-stranded. At least a portion of the polynucleotide is preferably double-stranded.
多核苷酸可为核酸,例如脱氧核糖核酸(DNA)或核糖核酸(RNA)。A polynucleotide can be a nucleic acid, such as deoxyribonucleic acid (DNA) or ribonucleic acid (RNA).
多核苷酸优选是DNA、RNA或DNA或RNA杂交体。多核苷酸可以包含单链区和具有其它结构的区域,例如发夹环、三链体和/或四链体。DNA/RNA杂交体可以在同一条链上包含DNA和RNA。优选地,DNA/RNA杂交体包含与RNA链杂交的一条DNA链。The polynucleotide is preferably DNA, RNA or a DNA or RNA hybrid. The polynucleotide may comprise a single-stranded region and a region with other structures, such as a hairpin loop, a triplex and/or a quadruplex. A DNA/RNA hybrid may comprise DNA and RNA on the same strand. Preferably, the DNA/RNA hybrid comprises a DNA strand hybridized to an RNA strand.
多核苷酸可为任意长度。例如,多核苷酸可为至少10个,至少50个,至少100个,至少150个,至少200个,至少250个,至少300个,至少400个或至少500个核苷酸对的长度。多核苷酸可为1000或更多个核苷酸对,5000或更多个核苷酸对的长度或100000或更多个核苷酸对的长度。The polynucleotide can be of any length. For example, the polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 nucleotide pairs in length. The polynucleotide can be 1000 or more nucleotide pairs in length, 5000 or more nucleotide pairs in length or 100000 or more nucleotide pairs in length.
多核苷酸可以存在于任何合适的样品中。本发明通常对已知含有或怀疑含有多核苷酸的样品进行。或者,本发明可对样品进行,以确定一种或多种已知或预期存在于样品中的多核苷酸的种类。The polynucleotides may be present in any suitable sample. The present invention is generally performed on samples known to contain or suspected of containing polynucleotides. Alternatively, the present invention may be performed on samples to determine the types of one or more polynucleotides known or expected to be present in the sample.
多核苷酸结合蛋白Polynucleotide binding protein
多核苷酸结合蛋白可以是能够与多核苷酸结合并且控制其通过孔的移动的任何蛋白质。在本领域中确定蛋白质是否与多核苷酸结合是很简单的。蛋白质通常与多核苷酸相互作用并且修饰其至少一种性质。蛋白质可以通过裂解多核苷酸以形成单独的核苷酸或如二核苷酸或三核苷酸的更短的核苷酸链来修饰多核苷酸。所述部分可通过将多核苷酸定位或移动到特异性位置(即控制其移动)来修饰多核苷酸。The polynucleotide binding protein can be any protein that can bind to the polynucleotide and control its movement through the hole. It is very simple to determine whether a protein binds to a polynucleotide in the art. Proteins usually interact with polynucleotides and modify at least one of their properties. Proteins can modify polynucleotides by cleaving polynucleotides to form a single nucleotide or a shorter nucleotide chain such as a dinucleotide or trinucleotide. The part can modify the polynucleotide by positioning or moving the polynucleotide to a specific position (i.e., controlling its movement).
在一些实施方案中,所述多核苷酸结合蛋白衍生自多核苷酸处理酶;所述多核苷酸处理酶选自聚合酶、解旋酶或核酸外切酶。更优选地,所述解旋酶选自He1308解旋酶、RecD解旋酶、XPD解旋酶、Dda解旋酶或ED1解旋酶。 In some embodiments, the polynucleotide binding protein is derived from a polynucleotide processing enzyme; the polynucleotide processing enzyme is selected from a polymerase, a helicase or a nuclease exonuclease. More preferably, the helicase is selected from a He1308 helicase, a RecD helicase, a XPD helicase, a Dda helicase or an ED1 helicase.
在一些实施方案中,多核苷酸结合蛋白是多核苷酸解链酶。多核苷酸解链酶是能够使双链多核苷酸解链成单链的酶。在一些实施方案中,多核苷酸解链酶能够使双链DNA解链成单链。在一些实施方案中,多核苷酸解链酶是具有解旋酶活性的酶。多核苷酸解链酶的实例包括例如本文所述的解旋酶。In some embodiments, the polynucleotide binding protein is a polynucleotide helicase. A polynucleotide helicase is an enzyme that can melt a double-stranded polynucleotide into a single strand. In some embodiments, a polynucleotide helicase can melt a double-stranded DNA into a single strand. In some embodiments, a polynucleotide helicase is an enzyme with helicase activity. Examples of polynucleotide helicases include, for example, helicases as described herein.
多核苷酸结合能力可以使用本领域中已知的任何方法来测量。例如,可以使蛋白质与多核苷酸接触,并且可以测量蛋白质与多核苷酸结合并沿所述多核苷酸移动的能力。蛋白质可以包括有助于多核苷酸结合和/或有助于其在高盐浓度和/或室温下的活性的修饰。蛋白质可进行修饰,使得其结合多核苷酸(即保持多核苷酸结合能力)但不充当解链酶(即当具备所有便于移动的必需组分(例如ATP和Mg2+)时不沿多核苷酸移动)。此类修饰是所属领域中已知的。例如,解旋酶中的Mg2+结合结构域的修饰通常产生不起解旋酶作用的变体。The polynucleotide binding ability can be measured using any method known in the art. For example, a protein can be contacted with a polynucleotide, and the ability of the protein to bind to the polynucleotide and move along the polynucleotide can be measured. The protein can include modifications that contribute to polynucleotide binding and/or contribute to its activity at high salt concentrations and/or room temperature. The protein can be modified so that it binds to the polynucleotide (i.e., retains the polynucleotide binding ability) but does not act as a helicase (i.e., does not move along the polynucleotide when all the necessary components (e.g., ATP and Mg 2+ ) are available for movement). Such modifications are known in the art. For example, modifications of the Mg 2+ binding domain in a helicase typically produce variants that do not function as a helicase.
酶可以共价附接至孔。可以使用任何方法将酶共价附接至孔。The enzyme may be covalently attached to the pore.Any method may be used to covalently attach the enzyme to the pore.
在链测序中,多核苷酸顺着或逆着外加电位易位通过孔。在双链多核苷酸上逐渐或逐步起作用的核酸外切酶可以在孔的顺侧使用以在外加电位下供给剩余的单链,或在反侧使用以在反向电位下供给剩余的单链。同样,还可以以类似的方式使用使双链DNA解链的解旋酶。还可以使用聚合酶。需要逆着外加电位发生链易位的测序应用也是有可能的,但是DNA必须首先在相反电位或无电位下被酶“捕获”。随着电位随后在结合后转换,链将以顺式到反式的方式通过孔并且通过电流保持呈延长的构型。单链DNA核酸外切酶或单链DNA依赖性聚合酶可充当分子马达,以将最近易位的单链以受控的逐步方式逆着外加电位从反式到顺式从孔中拉回来。In chain sequencing, polynucleotides are translocated through the hole along or against the applied potential. Exonucleases that gradually or stepwise work on double-stranded polynucleotides can be used on the cis side of the hole to supply the remaining single strand under the applied potential, or on the trans side to supply the remaining single strand under the reverse potential. Similarly, helicases that unwind the double-stranded DNA can also be used in a similar manner. Polymerases can also be used. Sequencing applications that require chain translocation against the applied potential are also possible, but DNA must first be "captured" by the enzyme at the opposite potential or without potential. As the potential is subsequently converted after binding, the chain will pass through the hole in a cis-to-trans manner and be in an extended configuration by current retention. Single-stranded DNA exonucleases or single-stranded DNA-dependent polymerases can serve as molecular motors to pull the recently translocated single strand back from the hole in a controlled, stepwise manner against the applied potential from trans to cis.
可以在本发明中使用任何解旋酶。解旋酶可以两种模式对孔起作用。首先,优选使用解旋酶来进行所述方法,使得在由外加电压造成的场的作用下,所述解旋酶使多核苷酸移动通过孔。在这种模式中,多核苷酸的5’端首先被捕获在孔中,并且解旋酶使多核苷酸移动到孔中,使其在场的作用下通过孔,直到其最终易位通过,到达膜的反侧为止。或者,优选这样进行所述方法,解链酶使多核苷酸逆着由外加电压造成的场而移动通过孔。在这种模式中,多核苷酸的3’端首先被捕获在孔中,并且解链酶使多核苷酸移动通过孔,使得其逆着外加场被拉出孔,直到其最终被推回到膜的顺侧为止。Any helicase can be used in the present invention. The helicase can act on the hole in two modes. First, the method is preferably carried out using a helicase so that the helicase moves the polynucleotide through the hole under the action of the field caused by the applied voltage. In this mode, the 5' end of the polynucleotide is first captured in the hole, and the helicase moves the polynucleotide into the hole so that it passes through the hole under the action of the field until it finally translocates through and reaches the reverse side of the membrane. Alternatively, the method is preferably carried out in this way, and the helicase moves the polynucleotide through the hole against the field caused by the applied voltage. In this mode, the 3' end of the polynucleotide is first captured in the hole, and the helicase moves the polynucleotide through the hole so that it is pulled out of the hole against the applied field until it is finally pushed back to the cis side of the membrane.
还可以沿相反的方向进行所述方法。多核苷酸的3’端可以首先被捕获在孔中,并且解链酶可以使多核苷酸移动到孔中,使得其在场的作用下通过孔,直到其最终易位通过,到达膜的反侧为止。The method can also be performed in the reverse direction. The 3' end of the polynucleotide can first be captured in the pore, and the helicase can move the polynucleotide into the pore, allowing it to pass through the pore under the influence of the field until it finally translocates through and reaches the opposite side of the membrane.
当解链酶不具备便于移动的必需组分或被修饰成阻止或防止其移动时,所述解链酶可以与多核苷酸结合并且在多核苷酸被外加场拉入孔中时充当刹车以减慢多核苷酸的移动。在非活性模式中,多核苷酸的3’或5’是否被捕获不重要,在充当刹车的酶的作用下将多核苷酸朝向反侧拉入孔中的是外加场。当在非活性模式中时,解链酶对多核苷酸的移动的控制可以用多种方式来描述,包括齿合、滑动和制动。还可以用这种方式来使用缺乏解链酶活性的解链酶变体。When the helicase does not possess the essential component that is convenient to move or is modified to stop or prevent it from moving, the helicase can be combined with the polynucleotide and serve as a brake to slow down the movement of the polynucleotide when the polynucleotide is pulled into the hole by the external field. In the inactive mode, it is not important whether the 3 ' or 5 ' of the polynucleotide is captured, and it is the external field that pulls the polynucleotide into the hole towards the opposite side under the effect of the enzyme that serves as the brake. When in the inactive mode, the control of the movement of the helicase to the polynucleotide can be described in a variety of ways, including toothing, sliding and braking. The helicase variant lacking helicase activity can also be used in this way.
多核苷酸与多核苷酸结合蛋白(例如,多核苷酸解链酶)和孔可以按任何次序接触。优选的是,当使多核苷酸与如解旋酶的多核苷酸结合蛋白(例如,多核苷酸解链酶)和孔接触时,多核苷酸首先与多 核苷酸结合蛋白(例如,多核苷酸解链酶)形成复合物。当跨孔施加电压时,多核苷酸/多核苷酸结合蛋白(例如,多核苷酸解链酶)复合物就会与孔形成复合物并且控制多核苷酸通过孔的移动。The polynucleotide may be contacted with the polynucleotide binding protein (e.g., a polynucleotide helicase) and the pore in any order. Preferably, when contacting the polynucleotide with a polynucleotide binding protein such as a helicase (e.g., a polynucleotide helicase) and the pore, the polynucleotide is first contacted with the polynucleotide binding protein (e.g., a polynucleotide helicase) and the pore. The nucleotide binding protein (eg, polynucleotide helicase) forms a complex. When a voltage is applied across the pore, the polynucleotide/polynucleotide binding protein (eg, polynucleotide helicase) complex forms a complex with the pore and controls the movement of the polynucleotide through the pore.
使用多核苷酸结合蛋白(例如,多核苷酸解链酶)的方法中的任何步骤通常在游离核苷酸或游离核苷酸类似物和促进多核苷酸结合蛋白(例如,多核苷酸解链酶)的作用的酶辅因子存在下进行。游离核苷酸可以是任何单独的核苷酸中的一种或多种。游离核苷酸包括但不限于:单磷酸腺苷(AMP)、二磷酸腺苷(ADP)、三磷酸腺苷(ATP)、单磷酸鸟苷(GMP)、二磷酸鸟苷(GDP)、三磷酸鸟苷(GTP)、单磷酸胸苷(TMP)、二磷酸胸苷(TDP)、三磷酸胸苷(TTP)、单磷酸尿苷(UMP)、二磷酸尿苷(UDP)、三磷酸尿苷(UTP)、单磷酸胞苷(CMP)、二磷酸胞苷(CDP)、三磷酸胞苷(CTP)、环单磷酸腺苷(cAMP)、环单磷酸鸟苷(cGMP)、单磷酸脱氧腺苷(dAMP)、二磷酸脱氧腺苷(dADP)、三磷酸脱氧腺苷(dATP)、单磷酸脱氧鸟苷(dGMP)、二磷酸脱氧鸟苷(dGDP)、三磷酸脱氧鸟苷(dGTP)、单磷酸脱氧胸苷(dTMP)、二磷酸脱氧胸苷(dTDP)、三磷酸脱氧胸苷(dTTP)、单磷酸脱氧尿苷(dUMP)、二磷酸脱氧尿苷(dUDP)、三磷酸脱氧尿苷(dUTP)、单磷酸脱氧胞苷(dCMP)、二磷酸脱氧胞苷(dCDP)以及三磷酸脱氧胞苷(dCTP)。游离核苷酸优选选自AMP、TMP、GMP、CMP、UMP、dAMP、dTMP、dGMP或dCMP。游离核苷酸优选是三磷酸腺苷(ATP)。酶辅因子是允许构建体起作用的因子。酶辅因子优选是二价金属阳离子。二价金属阳离子优选是Mg2+、Mn2+、Ca2+或Co2+。酶辅因子最优选是Mg2+Any step in the method using a polynucleotide binding protein (e.g., a polynucleotide helicase) is typically performed in the presence of free nucleotides or free nucleotide analogs and an enzyme cofactor that promotes the action of the polynucleotide binding protein (e.g., a polynucleotide helicase). The free nucleotides can be one or more of any individual nucleotides. Free nucleotides include, but are not limited to, adenosine monophosphate (AMP), adenosine diphosphate (ADP), adenosine triphosphate (ATP), guanosine monophosphate (GMP), guanosine diphosphate (GDP), guanosine triphosphate (GTP), thymidine monophosphate (TMP), thymidine diphosphate (TDP), thymidine triphosphate (TTP), uridine monophosphate (UMP), uridine diphosphate (UDP), uridine triphosphate (UTP), cytidine monophosphate (CMP), cytidine diphosphate (CDP), cytidine triphosphate (CTP), cyclic adenosine monophosphate (cAMP), cyclic guanosine monophosphate (cGMP), deoxyguanosine monophosphate (DAMP), deoxyguanosine monophosphate (DGP), deoxyguanosine monophosphate (DTP), deoxyguanosine monophosphate (DMP), deoxyguanosine diphosphate (DTP ... The free nucleotides are preferably selected from AMP, TMP, GMP, CMP, UMP, dAMP, dTMP, dGMP or dCMP. The free nucleotides are preferably adenosine triphosphate (ATP). The enzyme cofactor is a factor that allows the construct to work. The enzyme cofactor is preferably a divalent metal cation. The divalent metal cation is preferably Mg 2+ , Mn 2+ , Ca 2+ or Co 2+ . The enzyme cofactor is most preferably Mg 2+ .
停滞Stagnation
如果多核苷酸结合蛋白已经停止了沿分析物例如多核苷酸移动,则它是停滞的。联合阻滞元件用于停滞多核苷酸结合蛋白。多核苷酸结合蛋白可在阻滞元件之前被停滞。If the polynucleotide binding protein has stopped moving along the analyte, such as a polynucleotide, it is arrested. The combination of an arresting element is used to arrest the polynucleotide binding protein. The polynucleotide binding protein can be arrested before the arresting element.
将所述停滞的解旋酶和多核苷酸与跨膜孔接触并施加电势。在所施加的电势产生的场的作用下,所述多核苷酸移动穿过所述孔。所述多核苷酸结合蛋白通常太大而不能移动穿过所述孔。当多核苷酸的一部分进入所述孔并沿所施加的电势而产生的场移动,所述多核苷酸结合蛋白随着所述多核苷酸移动通过所述孔而通过所述孔移动穿过联合阻滞元件。The stalled helicase and polynucleotide are contacted with a transmembrane pore and an electric potential is applied. Under the action of the field generated by the applied electric potential, the polynucleotide moves through the pore. The polynucleotide-binding protein is usually too large to move through the pore. When a portion of the polynucleotide enters the pore and moves along the field generated by the applied electric potential, the polynucleotide-binding protein moves through the pore through the associated blocking element as the polynucleotide moves through the pore.
这使得多核苷酸结合蛋白在多核苷酸上的位置被控制。在将停滞的多核苷酸结合蛋白和多核苷酸与跨膜孔接触和施加电势之前,多核苷酸结合蛋白停留在它们被停滞的位置。甚至在存在必要的组分(例如ATP和Mg2+)以促进多核苷酸结合蛋白的移动,所述多核苷酸结合蛋白将不会移动穿过多核苷酸上的联合阻滞元件,直到有跨膜孔和所施加的电势的存在。This allows the position of the polynucleotide binding protein on the polynucleotide to be controlled. Before the stalled polynucleotide binding protein and polynucleotide are contacted with the transmembrane pore and an electric potential is applied, the polynucleotide binding protein stays at the position where they are stalled. Even in the presence of necessary components (e.g., ATP and Mg2+) to promote the movement of the polynucleotide binding protein, the polynucleotide binding protein will not move through the combined blocking element on the polynucleotide until there is a transmembrane pore and an applied electric potential.
membrane
可根据本发明使用任何膜。合适的膜被本领域中所熟知。所述膜优选地为两亲性层。两亲性层是由具有亲水性和亲脂性的两亲性分子(例如磷脂)形成的层。两亲分子可为合成的或天然存在的。非天然存在的两亲物和形成单层的两亲物在本领域中已知,包括,例如,嵌段共聚物。嵌段共聚物是聚合材料,其中两种或多种单体亚单位聚合在一起以产生单个的聚合物链。嵌段共聚物通常具有由每种单体亚单位提供的特性。然而,嵌段共聚物可具有由单独亚单位形成的聚合物所不具有的独特性质。可将嵌段共聚物设计成这样:一种单体亚单位是疏水性的(即亲脂性的),而其他的一种或多种亚单位在水性介质中时是亲水性的。在这种情况下,嵌段共聚物可具有两亲性质,并可形成模 拟生物膜的结构。嵌段共聚物可为二嵌段(由两种单体亚单位组成),但还可由多于两种单体亚单位构成,以形成更复杂的表现为两亲物的排列。所述共聚物可为三嵌段、四嵌段或五嵌段共聚物。Any membrane may be used in accordance with the present invention. Suitable membranes are well known in the art. The membrane is preferably an amphiphilic layer. An amphiphilic layer is a layer formed by amphiphilic molecules (e.g., phospholipids) having both hydrophilic and lipophilic properties. The amphiphilic molecules may be synthetic or naturally occurring. Non-naturally occurring amphiphiles and amphiphiles that form monolayers are known in the art and include, for example, block copolymers. Block copolymers are polymeric materials in which two or more monomer subunits are polymerized together to produce a single polymer chain. Block copolymers typically have properties provided by each monomer subunit. However, block copolymers may have unique properties that polymers formed by individual subunits do not have. Block copolymers may be designed such that one monomer subunit is hydrophobic (i.e., lipophilic) while the other subunit or subunits are hydrophilic when in an aqueous medium. In this case, the block copolymer may have amphiphilic properties and may form a membrane. The structure of the mimicking biofilm. The block copolymers can be diblock (composed of two monomer subunits), but can also be composed of more than two monomer subunits to form more complex arrangements that behave as amphiphiles. The copolymers can be triblock, tetrablock or pentablock copolymers.
古细菌双极性四醚脂质是天然存在的脂质,其被构造为使所述脂质形成单层膜。这些脂质通常被发现于在恶劣的生物环境中生存的嗜极生物、嗜热生物、嗜盐生物和嗜酸生物中。认为它们的稳定性来自最终双层的融合性质。很容易通过产生具有通用基序亲水-疏水-亲水的三嵌段聚合物,来构建模拟这些生物实体的嵌段共聚物材料。该材料可形成表现与脂双层相似且具有一系列从囊泡到层膜的相状态的单体膜。由这些三嵌段共聚物形成的膜具有一些优于生物脂质膜的优势。因为所述三嵌段共聚物是合成的,可仔细地控制所述精确构建,以提供形成膜以及与孔和其他蛋白质相互作用所需的正确的链长度和性质。Archaeal bipolar tetraether lipids are naturally occurring lipids that are constructed so that the lipids form monolayer membranes. These lipids are commonly found in extremophiles, thermophiles, halophiles, and acidophiles that survive in harsh biological environments. It is believed that their stability comes from the fusogenic nature of the final bilayer. It is easy to construct block copolymer materials that mimic these biological entities by creating triblock polymers with the general motif hydrophilic-hydrophobic-hydrophilic. The material can form a monomeric membrane that behaves similarly to a lipid bilayer and has a range of phase states from vesicles to laminae. The membranes formed by these triblock copolymers have some advantages over biological lipid membranes. Because the triblock copolymers are synthetic, the precise construction can be carefully controlled to provide the correct chain length and properties required to form a membrane and interact with pores and other proteins.
还可由不被分类为脂质亚材料的亚单位构建嵌段共聚物;例如疏水性聚合物可由硅氧烷或其他基于非烃化合物的单体制成。嵌段共聚物亲水性的亚部分还可具有低的蛋白质结合性质,这使得能够产生在暴露于未加工的生物样品时具有高度抵抗力的膜。该首基单位还可衍生自非典型的脂质首基。Block copolymers can also be constructed from subunits that are not classified as lipid submaterials; for example, hydrophobic polymers can be made from monomers based on siloxane or other non-hydrocarbon compounds. The hydrophilic subportion of the block copolymer can also have low protein binding properties, which enables the production of membranes that are highly resistant when exposed to raw biological samples. The head group unit can also be derived from atypical lipid head groups.
与生物脂质膜相比,三嵌段共聚物膜还具有增强的机械稳定性和环境稳定性,例如高得多的操作温度或pH范围。所述嵌段共聚物的合成性质提供了定制基于聚合物的膜用于大量应用的平台。Compared to biological lipid membranes, triblock copolymer membranes also have enhanced mechanical and environmental stability, such as much higher operating temperature or pH range.The synthetic nature of the block copolymers provides a platform to tailor polymer-based membranes for a wide range of applications.
可化学修饰或功能化所述两亲性分子以促进分析物的偶联。The amphiphilic molecules may be chemically modified or functionalized to facilitate coupling of the analyte.
两亲性层可为单层或双层。两亲性层通常为平面的。两亲性层可为非平面的例如弯曲的。The amphiphilic layer can be a single layer or a double layer. The amphiphilic layer is usually planar. The amphiphilic layer can be non-planar, for example curved.
两亲性层通常为脂双层。脂双层是细胞膜的模型,并为一系列的实验性研究充当极好的平台。例如,脂双层可用于使用单通道记录的膜蛋白的体外研究。或者,脂双层可用作生物传感器来检测一系列物质的存在。所述脂双层可为任何脂双层。合适的脂双层包括但不限于平面的脂双层、支撑的双层或脂质体。脂双层优选地为平面的脂双层。The amphiphilic layer is typically a lipid bilayer. The lipid bilayer is a model for a cell membrane and serves as an excellent platform for a range of experimental studies. For example, lipid bilayers can be used for in vitro studies of membrane proteins using single channel recording. Alternatively, lipid bilayers can be used as biosensors to detect the presence of a range of substances. The lipid bilayer can be any lipid bilayer. Suitable lipid bilayers include, but are not limited to, planar lipid bilayers, supported bilayers or liposomes. The lipid bilayer is preferably a planar lipid bilayer.
在另一个优选实施方案中,所述膜是固态层。固态层不是生物来源的。换言之,固态层不衍生自或者分离自生物环境,例如生物体或细胞,或者合成制造形式的生物学可用结构。固态层可以用有机或无机材料形成,包括但不限于,微电子材料,绝缘材料如Si3N4、Al2O3、和SiO2,有机和无机聚合物如聚酰胺,塑料如或弹性体如双组分加成型硅橡胶(two-component addition-cure silicone rubber)和玻璃。固态层可以用石墨烯形成。In another preferred embodiment, the film is a solid layer. The solid layer is not of biological origin. In other words, the solid layer is not derived from or separated from a biological environment, such as an organism or a cell, or a biologically usable structure in a synthetically manufactured form. The solid layer can be formed with organic or inorganic materials, including but not limited to microelectronic materials, insulating materials such as Si 3 N 4 , Al 2 O 3 , and SiO 2 , organic and inorganic polymers such as polyamides, plastics such as or elastomers such as two-component addition-cure silicone rubber and glass. The solid layer can be formed with graphene.
跨膜孔Transmembrane pore
跨膜孔是允许外加电势驱动的水合离子从膜的一侧流动到膜的另一侧的结构。Transmembrane pores are structures that allow hydrated ions to flow from one side of the membrane to the other side driven by an applied electrical potential.
跨膜孔优选地为跨膜蛋白孔。跨膜蛋白孔为允许水合离子(例如分析物)从膜的一侧流动到膜的另一侧的多肽或多肽的集合。在本发明中,跨膜蛋白孔能够形成允许外加电势驱动的水合离子从膜的一侧流动到另一侧的孔。跨膜蛋白孔优选地允许分析物(例如核苷酸)从膜(例如脂双层)的一侧流动到另一侧。跨膜蛋白孔允许多核苷酸(例如DNA或RNA)移动通过所述孔。The transmembrane pore is preferably a transmembrane protein pore. A transmembrane protein pore is a polypeptide or a collection of polypeptides that allows hydrated ions (e.g., analytes) to flow from one side of the membrane to the other side of the membrane. In the present invention, the transmembrane protein pore can form a hole that allows hydrated ions driven by an applied potential to flow from one side of the membrane to the other side. The transmembrane protein pore preferably allows analytes (e.g., nucleotides) to flow from one side of the membrane (e.g., a lipid bilayer) to the other side. The transmembrane protein pore allows polynucleotides (e.g., DNA or RNA) to move through the pore.
跨膜蛋白孔可为单体或寡聚体。所述孔优选地由数个重复亚基(例如6、7或8个亚基)构成。所述孔更优选地为七聚体或八聚体孔。 The transmembrane protein pore may be a monomer or an oligomer. The pore is preferably composed of several repeating subunits (e.g. 6, 7 or 8 subunits). The pore is more preferably a heptamer or octamer pore.
跨膜蛋白孔通常包含桶状体或通道,所述离子可通过所述桶状体或通道流动。所述孔的亚基通常围绕中心轴并且为跨膜β桶状体或通道或者跨膜α-螺旋束状体或通道提供链。Transmembrane protein pores typically comprise a barrel or channel through which the ions can flow. The subunits of the pore typically surround a central axis and provide strands for a transmembrane beta barrel or channel or a transmembrane alpha-helical bundle or channel.
跨膜蛋白孔的桶状体或通道通常包含促进与分析物(例如核苷酸、多核苷酸或核酸)的相互作用的氨基酸。这些氨基酸优选地位于所述桶状体或通道的缩窄处附近。跨膜蛋白孔通常包含一个或多个带正电的氨基酸(例如精氨酸、赖氨酸或组氨酸)或者芳香族氨基酸(例如酪氨酸或色氨酸)。这些氨基酸通常促进所述孔与核苷酸或多核苷酸或核酸之间的相互作用。The barrel or channel of a transmembrane protein pore typically comprises amino acids that promote interaction with an analyte (e.g., a nucleotide, a polynucleotide, or a nucleic acid). These amino acids are preferably located near the constriction of the barrel or channel. Transmembrane protein pores typically comprise one or more positively charged amino acids (e.g., arginine, lysine, or histidine) or aromatic amino acids (e.g., tyrosine or tryptophan). These amino acids typically promote interaction between the pore and a nucleotide, polynucleotide, or nucleic acid.
可用于本发明的跨膜蛋白孔可衍生自β-桶状体孔或α-螺旋束状体孔,β-桶状体孔包含由β-链形成的桶状体或通道。合适的β-桶状体孔包括但不局限于β-毒素,例如α-溶血素、炭疽毒素和杀白细胞素,以及细菌的外膜蛋白/孔蛋白(porin),例如包皮垢分支杆菌(Mycobacterium smegmatis)孔蛋白(Msp)(例如MspA)、外膜孔蛋白F(OmpF)、外膜孔蛋白G(OmpG)、外膜磷脂酶A和奈瑟氏菌(Neisseria)自转运脂蛋白(NalP)。α-螺旋束状体孔包含由α-螺旋形成的桶状体或通道。合适的α-螺旋束状体孔包括但不局限于内膜蛋白和α-外膜蛋白,例如WZA和ClyA毒素。跨膜孔可衍生自Msp或α-溶血素(α-HL)。Transmembrane protein pores useful in the present invention may be derived from β-barrel pores or α-helical bundle pores, β-barrel pores comprising a barrel or channel formed by β-strands. Suitable β-barrel pores include, but are not limited to, β-toxins, such as α-hemolysin, anthrax toxin and leukocidin, and bacterial outer membrane proteins/porins, such as Mycobacterium smegmatis porins (Msp) (e.g., MspA), outer membrane porin F (OmpF), outer membrane porin G (OmpG), outer membrane phospholipase A and Neisseria autotransporter lipoprotein (NalP). α-helical bundle pores comprise a barrel or channel formed by α-helices. Suitable α-helical bundle pores include, but are not limited to inner membrane proteins and α-outer membrane proteins, such as WZA and ClyA toxins. Transmembrane pores may be derived from Msp or α-hemolysin (α-HL).
跨膜蛋白孔优选地衍生自Msp,优选衍生自MspA。这类孔是低聚体,并且通常包含7、8、9或10个衍生自Msp的单体。所述孔可为衍生自Msp的包含相同单体的同低聚体(homo-oligomeric)孔。或者,所述孔可为衍生自Msp的含有至少一个与其他单体不同的单体的异寡聚体(hetero-oligomeric)孔。所述孔还可包含一个或多个构建体,所述构建体包含两个或多个共价连接的衍生自Msp的单体。The transmembrane protein pore is preferably derived from Msp, preferably derived from MspA. Such pores are oligomers and typically comprise 7, 8, 9 or 10 monomers derived from Msp. The pore may be a homo-oligomeric pore derived from Msp comprising the same monomers. Alternatively, the pore may be a hetero-oligomeric pore derived from Msp containing at least one monomer different from the other monomers. The pore may also comprise one or more constructs comprising two or more covalently linked monomers derived from Msp.
跨膜蛋白孔还优选地衍生自α-溶血素(α-HL)。野生型α-HL孔由7个相同的单体或亚基形成(即其为七聚体)。The transmembrane protein pore is also preferably derived from alpha-hemolysin (alpha-HL).The wild-type alpha-HL pore is formed by 7 identical monomers or subunits (ie it is a heptamer).
在一些实施方案中,所述跨膜蛋白孔被化学修饰。可用任何方式在任何位点化学修饰所述孔。优选地通过分子与一个或多个半胱氨酸的结合(半胱氨酸连接)、分子与一个或多个赖氨酸的结合、分子与一个或多个非天然氨基酸的结合、表位的酶修饰或者末端的修饰,对跨膜蛋白孔进行化学修饰。进行这类修饰的合适方法在本领域中是熟知的。可通过结合任何分子来化学修饰所述跨膜蛋白孔。例如,可通过结合染料或荧光团来化学修饰所述孔。In some embodiments, the transmembrane protein pore is chemically modified. The pore can be chemically modified at any site in any manner. Preferably, the transmembrane protein pore is chemically modified by binding of a molecule to one or more cysteines (cysteine ligation), binding of a molecule to one or more lysines, binding of a molecule to one or more non-natural amino acids, enzymatic modification of an epitope, or modification of a terminal end. Suitable methods for performing such modifications are well known in the art. The transmembrane protein pore can be chemically modified by binding any molecule. For example, the pore can be chemically modified by binding a dye or a fluorophore.
可化学修饰所述孔中任意数量的单体。优选地,如上所述化学修饰一个或多个例如2、3、4、5、6、7、8、9或10个所述单体。Any number of monomers in the pore may be chemically modified. Preferably, one or more, for example 2, 3, 4, 5, 6, 7, 8, 9 or 10, of the monomers are chemically modified as described above.
移动move
在本发明的方法中,使分析物例如所多核苷酸移动通过跨膜孔并被测序。使多核苷酸移动通过跨膜孔是指使多核苷酸从所述孔的一侧移动到另一侧。多核苷酸通过孔的移动可受电势或酶促作用或电位和酶促作用驱动或控制。移动可以是单向的,或可允许向后和向前移动。In the method of the present invention, the analyte, such as the polynucleotide, is moved through the transmembrane pore and sequenced. Moving the polynucleotide through the transmembrane pore refers to moving the polynucleotide from one side of the pore to the other side. The movement of the polynucleotide through the pore may be driven or controlled by an electric potential or an enzymatic action or both an electric potential and an enzymatic action. The movement may be unidirectional, or backward and forward movement may be allowed.
优选地使用多核苷酸结合蛋白来控制多核苷酸移动通过所述孔。Preferably a polynucleotide binding protein is used to control the movement of the polynucleotide through the pore.
分析物表征Analyte characterization
所述表征方法可以包括测量分析物例如多核苷酸的一个、两个、三个、四个或五个或更多个特 征。所述方法包括控制分析物移动穿过跨膜孔,当分析物相对于孔移动时,获取一个或多个测量值,其中测量值代表分析物的一个或多个特征。The characterization method may include measuring one, two, three, four or five or more characteristics of an analyte, such as a polynucleotide. The method includes controlling movement of an analyte through a transmembrane pore and obtaining one or more measurements as the analyte moves relative to the pore, wherein the measurements represent one or more characteristics of the analyte.
所述特征优选选自(i)多核苷酸的长度,(ii)多核苷酸的同一性,(iii)多核苷酸的序列,(iv)多核苷酸的二级结构,以及(v)多核苷酸是否被修饰。The characteristic is preferably selected from (i) the length of the polynucleotide, (ii) the identity of the polynucleotide, (iii) the sequence of the polynucleotide, (iv) the secondary structure of the polynucleotide, and (v) whether the polynucleotide is modified.
在一些实施方式中,所述分析物是多核苷酸。可以表征任意数量的多核苷酸。例如,本发明的方法可以涉及表征2、3、4、5、6、7、8、9、10、20、30、50、100个或更多个多核苷酸。所述多核苷酸可以是天然存在的或人工的。例如,该方法可以被用于检验所制造的寡核苷酸的序列。该方法一般在体外进行。In some embodiments, the analyte is a polynucleotide. Any number of polynucleotides can be characterized. For example, the method of the present invention can involve characterizing 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 50, 100 or more polynucleotides. The polynucleotide can be naturally occurring or artificial. For example, the method can be used to test the sequence of the manufactured oligonucleotide. The method is generally performed in vitro.
本发明的方法包含移动所述单链多核苷酸通过所述跨膜孔,使得所述单链多核苷酸的一部分核苷酸与所述孔相互作用。The method of the present invention comprises moving the single-stranded polynucleotide through the transmembrane pore such that a portion of the nucleotides of the single-stranded polynucleotide interact with the pore.
所述方法可使用如上所述的任何合适的膜进行,优选脂双层系统,其中孔被插入脂双层中。所述方法通常使用如下膜进行:(i)包含孔的人造双层,(ii)分离的天然存在的含孔脂双层,或(iii)有孔插入其中的细胞。所述方法优选地使用人造脂双层进行。除了所述孔之外,所述双层可包含其他跨膜蛋白和/或膜内蛋白以及其他分子。下文参考本发明的测序实施方案详述了合适的装置和条件。本发明的方法通常在体外进行。The method can be performed using any suitable membrane as described above, preferably a lipid bilayer system, wherein the pore is inserted into the lipid bilayer. The method is typically performed using a membrane that includes (i) an artificial bilayer containing a pore, (ii) an isolated naturally occurring lipid bilayer containing a pore, or (iii) a cell having a pore inserted therein. The method is preferably performed using an artificial lipid bilayer. In addition to the pore, the bilayer may include other transmembrane proteins and/or intramembrane proteins and other molecules. Suitable apparatus and conditions are described in detail below with reference to the sequencing embodiment of the present invention. The method of the present invention is typically performed in vitro.
本发明提供了表征多核苷酸的方法,所述方法包括:The present invention provides a method for characterizing a polynucleotide, the method comprising:
(a1)提供具有多核苷酸的构建体,其中多核苷酸在其一端或两端与本发明的包含联合阻滞元件的衔接体连接;以及将构建体与多核苷酸结合蛋白接触,使得多核苷酸结合蛋白在衔接体上的阻滞元件的联合作用下停滞在衔接体上;或(a1) providing a construct having a polynucleotide, wherein the polynucleotide is connected to an adaptor comprising a combined blocking element of the present invention at one or both ends thereof; and contacting the construct with a polynucleotide binding protein, so that the polynucleotide binding protein is arrested on the adaptor under the combined action of the blocking element on the adaptor; or
(a2)使多核苷酸结合蛋白装载到本发明的包含联合阻滞元件的衔接体上,多核苷酸结合蛋白在衔接体上的阻滞元件的联合作用下停滞在衔接体上;以及使装载有多核苷酸结合蛋白的衔接体连接到多核苷酸;(a2) loading the polynucleotide binding protein onto the adapter comprising the combined blocking element of the present invention, the polynucleotide binding protein being arrested on the adapter under the combined action of the blocking element on the adapter; and connecting the adapter loaded with the polynucleotide binding protein to the polynucleotide;
(b)将步骤(a)中提供的装载有多核苷酸结合蛋白的多核苷酸与跨膜孔接触;以及(b) contacting the polynucleotide loaded with the polynucleotide binding protein provided in step (a) with the transmembrane pore; and
(c)跨跨膜孔施加电势,使得多核苷酸结合蛋白移动穿过第一阻滞元件、第二阻滞元件的部分区域,并控制多核苷酸穿过跨膜孔的移动;(c) applying an electric potential across the transmembrane pore to allow the polynucleotide binding protein to move through the first blocking element and a partial region of the second blocking element, and controlling the movement of the polynucleotide through the transmembrane pore;
(d)随着多核苷酸相对于所述跨膜孔移动,获取一个或多个测量值,其中测量值代表多核苷酸的一个或多个特征,并由此表征多核苷酸。(d) obtaining one or more measurements as the polynucleotide moves relative to the transmembrane pore, wherein the measurements represent one or more characteristics of the polynucleotide, and thereby characterize the polynucleotide.
在一些实施方式中,当衔接体仅包含紧邻设置的一个第一阻滞元件与一个第二阻滞元件时,多核苷酸结合蛋白在阻滞元件的联合作用下停滞在衔接体的第一阻滞元件之前,然后在电势的作用下移动穿过衔接体的主体链。在主体链上的移动顺序依次为:第一阻滞元件、与第二阻滞元件互补的区段、第二区段,然后通过与主体链连接的多核苷酸链。在一些实施方式中,多核苷酸结合蛋白不穿过衔接体的阻挡链。在一些实施方式中,多核苷酸结合蛋白还穿过衔接体的阻挡链,当第二阻滞元件位于阻挡链上时,在阻挡链上的移动顺序依次为:第四区段、第二阻滞元件的核苷酸。多核苷酸结合蛋白不穿过第二阻滞元件的修饰物。 In some embodiments, when the adapter only comprises a first retarding element and a second retarding element that are arranged adjacent to each other, the polynucleotide binding protein stops in front of the first retarding element of the adapter under the combined action of the retarding elements, and then moves through the main chain of the adapter under the action of the electric potential. The order of movement on the main chain is: the first retarding element, the segment complementary to the second retarding element, the second segment, and then through the polynucleotide chain connected to the main chain. In some embodiments, the polynucleotide binding protein does not pass through the blocking chain of the adapter. In some embodiments, the polynucleotide binding protein also passes through the blocking chain of the adapter, and when the second retarding element is located on the blocking chain, the order of movement on the blocking chain is: the fourth segment, the nucleotide of the second retarding element. The polynucleotide binding protein does not pass through the modifier of the second retarding element.
这些方法是可能的,因为跨膜蛋白孔可用于区分具有相似结构的核苷酸,这是基于它们对通过所述孔的电流具有不同的效应。可根据各个核苷酸与所述孔相互作用时它们的电流振幅,在单分子水平上鉴定各个核苷酸。如果电流以对某种核苷酸特异性的方式流经所述孔(即如果检测到与该核苷酸相关的特征性电流流过所述孔),那么该核苷酸就在所述孔中存在。连续鉴定多核苷酸中的核苷酸,使得能够估计或确定所述多核苷酸的序列。These methods are possible because transmembrane protein pores can be used to distinguish nucleotides with similar structures based on their different effects on the current passing through the pore. Individual nucleotides can be identified at the single molecule level based on their current amplitude when they interact with the pore. If the current flows through the pore in a manner specific for a certain nucleotide (i.e., if a characteristic current associated with the nucleotide is detected flowing through the pore), then the nucleotide is present in the pore. Continuous identification of nucleotides in a polynucleotide enables the sequence of the polynucleotide to be estimated or determined.
因此,所述方法涉及为了对所述多核苷酸进行测序,当所述多核苷酸中的一部分核苷酸逐个通过所述桶状体或通道时,对所述核苷酸进行跨膜孔传感。如上所述,这是链测序。Thus, the method involves sensing a portion of the nucleotides in the polynucleotide across a membrane pore as they pass through the barrel or channel one by one in order to sequence the polynucleotide. As described above, this is strand sequencing.
在一些实施方案中,所述方法包括提供系链用于使所述构建体靠近所述跨膜孔;所述系链包括捕获区和锚定区,所述捕获区用于捕获所述构建体的衔接体,所述锚定区用于与所述跨膜孔或所述跨膜孔所在的膜锚定结合。In some embodiments, the method includes providing a tether for bringing the construct into proximity with the transmembrane pore; the tether includes a capture region and an anchor region, wherein the capture region is used to capture the adaptor of the construct, and the anchor region is used to anchor and bind to the transmembrane pore or the membrane in which the transmembrane pore is located.
可使用该方法对所述多核苷酸的全部或仅一部分进行测序。所述多核苷酸可为任意长度。例如,所述多核苷酸可为至少10、至少50、至少100、至少150、至少200、至少250、至少300、至少400或至少500个核苷酸对的长度。所述多核苷酸可为1000或更多个核苷酸对,5000或更多个核苷酸对或者100000或更多个核苷酸对的长度。所述多核苷酸可为天然存在的或人造的。例如,所述方法可用于验证制造的寡核苷酸的序列。所述方法通常在体外进行。The method can be used to sequence all or only a portion of the polynucleotide. The polynucleotide can be of any length. For example, the polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 nucleotide pairs in length. The polynucleotide can be 1000 or more nucleotide pairs, 5000 or more nucleotide pairs or 100000 or more nucleotide pairs in length. The polynucleotide can be naturally occurring or artificial. For example, the method can be used to verify the sequence of the oligonucleotide manufactured. The method is usually performed in vitro.
所述单链多核苷酸可在所述膜的任一侧与所述孔相互作用。所述单链多核苷酸可以以任何方式在任何位点与所述孔相互作用。The single stranded polynucleotide may interact with the pore on either side of the membrane.The single stranded polynucleotide may interact with the pore at any site in any manner.
在所述单链多核苷酸中的核苷酸与所述孔相互作用的过程中,所述核苷酸以该核苷酸特异性的方式影响流经所述孔的电流。例如,特定的核苷酸将降低流经所述孔的电流,这一降低持续特定的平均时长并且达到特定的程度。换言之,流经所述孔的电流对于特定的核苷酸是特征性的。可进行对照实验,以确定特定的核苷酸对流经所述孔的电流的影响。然后,可以将对测试样品进行本发明的方法所获得的结果与获自这类对照实验的结果进行比较,以确定或估计所述多核苷酸的序列。In the process of the nucleotide in the single-stranded polynucleotide interacting with the hole, the nucleotide affects the current flowing through the hole in a manner specific to the nucleotide. For example, a specific nucleotide will reduce the current flowing through the hole, and this reduction lasts for a specific average duration and reaches a specific degree. In other words, the current flowing through the hole is characteristic for a specific nucleotide. Control experiments can be performed to determine the influence of a specific nucleotide on the current flowing through the hole. Then, the results obtained by performing the method of the present invention on the test sample can be compared with the results obtained from such control experiments to determine or estimate the sequence of the polynucleotide.
所述测序方法可使用任何合适的膜/孔系统进行,在所述膜/孔系统中孔被插入膜中。所述方法通常使用包含天然存在的或合成的脂质的膜进行。所述膜通常是在体外形成。优选地不使用分离的天然存在的含孔膜、或表达孔的细胞进行所述方法。所述方法优选地使用人造膜进行。除了所述孔之外,所述膜可包含其他跨膜蛋白和/或膜内蛋白以及其他分子。The sequencing method can be performed using any suitable membrane/pore system in which the pore is inserted into the membrane. The method is typically performed using a membrane comprising naturally occurring or synthetic lipids. The membrane is typically formed in vitro. The method is preferably not performed using an isolated naturally occurring pore-containing membrane, or a cell expressing a pore. The method is preferably performed using an artificial membrane. In addition to the pore, the membrane may contain other transmembrane proteins and/or intramembrane proteins and other molecules.
当分析物为多核苷酸外的其它分析物,例如多肽时,在表征多肽的方法中,首先将多肽与核酸连接获得核酸-多肽连接物,然后使本发明的衔接体与核酸-多肽连接物连接,从而对多肽进行表征。表征多肽的方法的其他步骤与表征多核苷酸的步骤类似。试剂盒 When the analyte is an analyte other than a polynucleotide, such as a polypeptide, in the method for characterizing the polypeptide, the polypeptide is first connected to a nucleic acid to obtain a nucleic acid-polypeptide conjugate, and then the adaptor of the present invention is connected to the nucleic acid-polypeptide conjugate, thereby characterizing the polypeptide. The other steps of the method for characterizing the polypeptide are similar to the steps for characterizing the polynucleotide. Kit
本发明还提供了用于制备用于表征分析物例如多核苷酸的试剂盒。所述试剂盒包含(a)本发明的衔接体,和(b)多核苷酸结合蛋白,和/或(c)跨膜孔,以及任选的系链。The present invention also provides a kit for preparing a method for characterizing an analyte such as a polynucleotide, wherein the kit comprises (a) an adaptor of the present invention, and (b) a polynucleotide binding protein, and/or (c) a transmembrane pore, and optionally a tether.
所述试剂盒优选地还包含一个或多个与跨膜孔相互作用时产生特征性电流的标志物。这类标志物在上文详细描述。所述试剂盒优选还包含将所述多核苷酸偶联到膜上的工具(means)。上文描述了将所述多核苷酸偶联到膜上的工具。所述偶联的工具优选地包含反应基团。合适的基团包括但不 限于巯基、胆固醇、脂质和生物素基团。所述试剂盒还可包含膜的组分,例如形成脂双层所需的磷脂。The kit preferably also comprises one or more markers that produce a characteristic current when interacting with the transmembrane pore. Such markers are described in detail above. The kit preferably also comprises a means for coupling the polynucleotide to the membrane. The means for coupling the polynucleotide to the membrane are described above. The means for coupling preferably comprises a reactive group. Suitable groups include but are not limited to Limited to sulfhydryl, cholesterol, lipid and biotin groups. The kit may also contain components of membranes, such as phospholipids required for lipid bilayer formation.
任何上文详述的关于本发明方法的实施方案同样适用于本发明的试剂盒。Any of the embodiments detailed above with respect to the methods of the invention are equally applicable to the kits of the invention.
本发明的试剂盒可额外地包含一种或多种可使上述任何实施方案得以实施的其他试剂或仪器。这类试剂或仪器包括下述的一种或多种:合适的缓冲液(水性溶液)、用于从受试者取样的工具(例如包含针的管或器具)、用于扩增和/或表达多核酸的工具,如上文定义的膜或电压钳或膜片钳装置。试剂可以以干燥状态存在于所述试剂盒中,这样可以用流体样品重悬所述试剂。任选地,所述试剂盒还可包括使所述试剂盒可用于本发明方法的说明书,或者关于所述方法可用于哪些患者的详细说明。任选地,所述试剂盒可包括核苷酸。The kit of the present invention may additionally comprise one or more other reagents or instruments that enable any of the above embodiments to be implemented. Such reagents or instruments include one or more of the following: a suitable buffer (aqueous solution), a tool for sampling from a subject (e.g., a tube or apparatus comprising a needle), a tool for amplifying and/or expressing polynucleic acids, such as a membrane or voltage clamp or patch clamp device as defined above. The reagents may be present in the kit in a dry state so that the reagents can be resuspended with a fluid sample. Optionally, the kit may also include instructions for making the kit useful in the method of the present invention, or detailed instructions on which patients the method may be used for. Optionally, the kit may include nucleotides.
实施例Example
以下各实施例中未具体注明的实验操作细节可以参考本文为所引用的参考文献,所采用的实验试剂和仪器设备均为常规商业可得的试剂或仪器,所采用的序列由生物公司合成。For experimental operation details not specifically noted in the following examples, references may be made to the references cited herein. The experimental reagents and instruments used are all conventional commercially available reagents or instruments, and the sequences used are synthesized by a biological company.
实施例1:含阻滞元件的E1接头的制备以及质检Example 1: Preparation and quality inspection of E1 linker containing blocking element
分别合成主体链、荧光链和阻挡链,将主体链、荧光链和阻挡链分别以1:1.1:1.1的比例进行退火处理,形成如图1所示的Y型接头。退火处理具体为从95℃缓慢降温到25℃,降温幅度不超过0.1℃/s。退火处理体系包括160mM HEPES 7.0,200mM NaCl,退火体系中主体链的浓度为4-8uM。The main chain, fluorescent chain and blocking chain were synthesized separately, and the main chain, fluorescent chain and blocking chain were annealed at a ratio of 1:1.1:1.1 to form a Y-type connector as shown in Figure 1. The annealing treatment was specifically to slowly cool from 95°C to 25°C, with the cooling amplitude not exceeding 0.1°C/s. The annealing treatment system included 160mM HEPES 7.0, 200mM NaCl, and the concentration of the main chain in the annealing system was 4-8uM.
取500nM Y型接头、6倍物质的量的解旋酶E1(具有M1G/E94C/C109A/C136A/A360C突变的SEQ ID NO.:14)和1.5mM TMAD(偶氮二甲酰胺)混合并室温孵育30分钟,制备得到测序接头复合物QMX1-QMX10。将测序接头复合物加入DNAPac PA200柱,用洗脱缓冲液进行纯化,以将没有结合到测序接头复合物上的酶从柱子上洗脱掉。然后用10倍柱体积的缓冲液A和缓冲液B的混合物对测序接头复合物进行洗脱。然后汇集主洗脱峰,测量其浓度,并用TBE PAGE凝胶160V下运行40分钟。其中,缓冲液A:20mMNa-CHES,250mM NaCl,4%(W/V)甘油,pH 8.6;缓冲液B:20mM Na-CHES,1MNaCl,4%(W/V)甘油,pH 8.6。500nM Y-type adapter, 6 times the amount of substance of helicase E1 (SEQ ID NO.: 14 with M1G/E94C/C109A/C136A/A360C mutations) and 1.5mM TMAD (azodicarbonamide) were mixed and incubated at room temperature for 30 minutes to prepare sequencing adapter complexes QMX1-QMX10. The sequencing adapter complex was added to a DNAPac PA200 column and purified with elution buffer to elute the enzyme that was not bound to the sequencing adapter complex from the column. The sequencing adapter complex was then eluted with a mixture of buffer A and buffer B with 10 times the column volume. The main elution peaks were then pooled, their concentrations measured, and run on a TBE PAGE gel at 160V for 40 minutes. Among them, buffer A: 20mM Na-CHES, 250mM NaCl, 4% (W/V) glycerol, pH 8.6; buffer B: 20mM Na-CHES, 1M NaCl, 4% (W/V) glycerol, pH 8.6.
本实施例的接头所使用的的主体链、荧光链和阻挡链的序列如下:The sequences of the main chain, fluorescent chain and blocking chain used in the linker of this example are as follows:
主体链序列:Y1(如SEQ ID NO.:1所示)、Y1-3C3(如SEQ ID NO.:2所示)、Y1-5C3(如SEQ ID NO.:3所示)、Y1-5C3-cPIP(如SEQ ID NO.:4所示)Main chain sequence: Y1 (as shown in SEQ ID NO.: 1), Y1-3C3 (as shown in SEQ ID NO.: 2), Y1-5C3 (as shown in SEQ ID NO.: 3), Y1-5C3-cPIP (as shown in SEQ ID NO.: 4)
荧光链序列:Y2(如SEQ ID NO.:8所示)Fluorescent chain sequence: Y2 (as shown in SEQ ID NO.: 8)
阻挡链序列:B(如SEQ ID NO.:9所示)、B-LNA(如SEQ ID NO.:10所示)、B-PNA(如SEQ ID NO.:11所示)、B-cPIP(如SEQ ID NO.:12所示)Blocking chain sequence: B (as shown in SEQ ID NO.: 9), B-LNA (as shown in SEQ ID NO.: 10), B-PNA (as shown in SEQ ID NO.: 11), B-cPIP (as shown in SEQ ID NO.: 12)
质检:取0.2pmol的测序接头复合物加入到测序buffer体系中(含有10mM HEPES 7.0、500mM KCl、100mM MgCL2、50mM ATP),其中cPIP组加入6倍物质的量的cPIP,放置于40℃,室温孵育30min,并用TBE PAGE凝胶160V下运行40分钟,用Cy5模式进行扫胶,之后进行灰度计算:未结合酶条带/(结合酶条带+未结合酶条带)比例即为其酶脱落率,结果示出在下表1中。Quality inspection: 0.2 pmol of sequencing adapter complex was added to the sequencing buffer system (containing 10 mM HEPES 7.0, 500 mM KCl, 100 mM MgCL2, 50 mM ATP), where 6 times the amount of cPIP was added to the cPIP group, placed at 40°C, incubated at room temperature for 30 min, and run TBE PAGE gel at 160 V for 40 min, scanned with Cy5 mode, and then calculated the grayscale: the ratio of unbound enzyme band/(bound enzyme band + unbound enzyme band) is the enzyme detachment rate. The results are shown in Table 1 below.
表1:QMX1-QMX10的酶脱落率结果
Table 1: Enzyme shedding rate results of QMX1-QMX10
结果解读:1号和2号样品,当仅阻挡链上有PNA何LNA修饰的核苷酸作为阻滞元件,而主体链上没有阻滞元件时,其质检酶脱落率分别为100%和100%。3和4号样品,当仅主体链上有3C3和5C3作为阻滞元件,而阻挡链上没有阻滞元件时,其质检酶脱落率分别为70.9%和60.9%。当主体链上分别有3C3和5C3作为阻滞元件且阻挡链链上有增强双链Tm值的LNA修饰时,其质检酶脱落率分别下降到51.8%和50.1(7和8号样品)。而当主体链上分别有3C3和5C3作为阻滞元件且阻挡链上有PNA修饰时,其质检酶脱落率分别下降到36.1%和28.7%(5和6号样品)。9号样品为主体链上存在5C3,但不存在cPIP的样品;10号样品为主体链上存在5C3作为阻滞元件,且存在与双链互作的cPIP作为另一阻滞元件的样品。从9号和10号样本可以看出,在cPIP存在的情况下,酶质检脱落率由60.5%下降到51.8%。Interpretation of results: For samples 1 and 2, when only PNA and LNA modified nucleotides were used as retarding elements on the blocking strand, and there was no retarding element on the main strand, the quality inspection enzyme shedding rates were 100% and 100%, respectively. For samples 3 and 4, when only 3C3 and 5C3 were used as retarding elements on the main strand, and there was no retarding element on the blocking strand, the quality inspection enzyme shedding rates were 70.9% and 60.9%, respectively. When 3C3 and 5C3 were used as retarding elements on the main strand, and the blocking strand was modified with LNA to enhance the double-stranded Tm value, the quality inspection enzyme shedding rates dropped to 51.8% and 50.1, respectively (samples 7 and 8). When 3C3 and 5C3 were used as retarding elements on the main strand, and there was PNA modification on the blocking strand, the quality inspection enzyme shedding rates dropped to 36.1% and 28.7%, respectively (samples 5 and 6). Sample No. 9 is a sample with 5C3 on the main chain but no cPIP; Sample No. 10 is a sample with 5C3 on the main chain as a blocking element and cPIP interacting with the double chain as another blocking element. It can be seen from samples No. 9 and No. 10 that in the presence of cPIP, the enzyme quality inspection shedding rate dropped from 60.5% to 51.8%.
实施例2:含阻滞元件的E2接头的制备以及质检Example 2: Preparation and quality inspection of E2 linker containing blocking element
分别合成主体链、荧光链和阻挡链,将主体链、荧光链和阻挡链分别以1:1.1:1.1的比例进行退火处理,形成Y型接头。退火处理具体为从95℃缓慢降温到25℃,降温幅度不超过0.1℃/s。退火处理体系包括160mM HEPES 7.0,200mM NaCl,退火体系中主体链的浓度为4-8uM。The main chain, fluorescent chain and blocking chain were synthesized separately, and the main chain, fluorescent chain and blocking chain were annealed at a ratio of 1:1.1:1.1 to form a Y-type connector. The annealing treatment was specifically to slowly cool from 95°C to 25°C, with the cooling amplitude not exceeding 0.1°C/s. The annealing treatment system included 160mM HEPES 7.0, 200mM NaCl, and the concentration of the main chain in the annealing system was 4-8uM.
取500nM Y型接头、5倍物质的量的解旋酶E2(具有D99C/A366C/C308T/C419D/E286K/F246Y/S293N/V422H突变的SEQ ID NO.:15)和1.5mM TMAD(偶氮二甲酰胺)混合并室温孵育30分钟,制备得到测序接头复合物QMX11-QMX14。将测序接头复合物复合物加入DNAPac PA200柱,用洗脱缓冲液进行纯化,以将没有结合到测序接头复合物上的酶从柱子上洗脱掉。然后用10倍柱体积的缓冲液A和缓冲液B的混合物对测序接头复合物进行洗脱。然后汇集主洗脱峰,测量其浓度,并用TBE PAGE凝胶160V下运行40分钟。其中,缓冲液A:20mMNa-CHES,250mM NaCl,4%(W/V)甘油,pH 8.6;缓冲液B:20mM Na-CHES,1MNaCl,4%(W/V)甘油,pH 8.6。500 nM Y-type adapter, 5 times the amount of substance of helicase E2 (SEQ ID NO.: 15 with D99C/A366C/C308T/C419D/E286K/F246Y/S293N/V422H mutations) and 1.5 mM TMAD (azodicarbonamide) were mixed and incubated at room temperature for 30 minutes to prepare sequencing adapter complexes QMX11-QMX14. The sequencing adapter complex complex was added to a DNAPac PA200 column and purified with elution buffer to elute the enzyme that was not bound to the sequencing adapter complex from the column. The sequencing adapter complex was then eluted with a mixture of buffer A and buffer B with 10 times the column volume. The main elution peak was then pooled, its concentration was measured, and a TBE PAGE gel was run at 160V for 40 minutes. Among them, buffer A: 20mM Na-CHES, 250mM NaCl, 4% (W/V) glycerol, pH 8.6; buffer B: 20mM Na-CHES, 1M NaCl, 4% (W/V) glycerol, pH 8.6.
本实施例的接头所使用的的主体链、荧光链和阻挡链的序列如下:The sequences of the main chain, fluorescent chain and blocking chain used in the linker of this example are as follows:
主体链序列:Y1-Sp(如SEQ ID NO.:5所示)、Y1-2Sp(如SEQ ID NO.:6所示) Main chain sequence: Y1-Sp (as shown in SEQ ID NO.: 5), Y1-2Sp (as shown in SEQ ID NO.: 6)
荧光链序列:Y2(如SEQ ID NO.:8所示)Fluorescent chain sequence: Y2 (as shown in SEQ ID NO.: 8)
阻挡链序列:B(如SEQ ID NO.:9所示)、B-LNA(如SEQ ID NO.:10所示)Blocking chain sequence: B (as shown in SEQ ID NO.: 9), B-LNA (as shown in SEQ ID NO.: 10)
质检:取0.2pmol的测序接头复合物加入到测序buffer体系中(含有10mM HEPES 7.0,500mM KCl,100mM MgCL2,50mM ATP),放置于34℃,室温孵育30min,并用TBE PAGE凝胶160V下运行40分钟,用Cy5模式进行扫胶,之后进行灰度计算:未结合酶条带/(结合酶条带+未结合酶条带)比例即为其酶脱落率,结果示出在下表2中。Quality inspection: Take 0.2 pmol of sequencing adapter complex and add it to the sequencing buffer system (containing 10mM HEPES 7.0, 500mM KCl, 100mM MgCL2, 50mM ATP), place it at 34℃, incubate it at room temperature for 30min, run TBE PAGE gel at 160V for 40min, scan the gel in Cy5 mode, and then calculate the grayscale: the ratio of unbound enzyme band/(bound enzyme band + unbound enzyme band) is the enzyme detachment rate. The results are shown in Table 2 below.
表2:QMX11-QMX14的酶脱落率结果
Table 2: Enzyme shedding rate results of QMX11-QMX14
结果解读:11和12号样品分别使用1个Sp和2个Sp作为主体链上的阻滞元件,而阻挡链上没有阻滞元件时,其接头质检酶脱落率分别为35.2%和18.1%;样品13和样品14,当主体链上有1个Sp和2个Sp作为主体链上的阻滞元件,且阻挡链上有LNA修饰时,其质检酶脱落率分别下降到13.5%和8.6%。Interpretation of results: When samples 11 and 12 used 1 Sp and 2 Sp as blocking elements on the main chain, respectively, and there was no blocking element on the blocking chain, their linker quality inspection enzyme shedding rates were 35.2% and 18.1%, respectively; for samples 13 and 14, when there were 1 Sp and 2 Sp as blocking elements on the main chain and LNA modification on the blocking chain, their quality inspection enzyme shedding rates dropped to 13.5% and 8.6%, respectively.
实施例3:含单一阻滞元件的接头与含联合阻滞的接头的芯片混库测试Example 3: Chip mixed library test of linkers containing single blocking elements and linkers containing combined blocking elements
通过末端修复方式制备长为10kb的四类文库,并且分别用实施例2中制备的QMX-11(Y1-Sp/Y2/B/E2)、QMX-12(Y1-2Sp/Y2/B/E2),QMX-13(Y1-Sp/Y2/B-LNA/E2)、QMX-14(Y1-2Sp/Y2/B-LNA/E2)与文库进行连接建库。将等物质的量的QMX-11、QMX-12、QMX-13、QMX-14文库均匀加载到三张不同的齐碳科技有限公司QNome-9604上进行测序。测序过程中所使用的系链的序列如SEQ ID NO:13所示。其混测数据通量对比如下表3所示。Four types of libraries with a length of 10 kb were prepared by end repair, and QMX-11 (Y1-Sp/Y2/B/E2), QMX-12 (Y1-2Sp/Y2/B/E2), QMX-13 (Y1-Sp/Y2/B-LNA/E2), and QMX-14 (Y1-2Sp/Y2/B-LNA/E2) prepared in Example 2 were connected to the library for construction. The same amount of QMX-11, QMX-12, QMX-13, and QMX-14 libraries were evenly loaded onto three different QNome-9604s of Qi Carbon Technology Co., Ltd. for sequencing. The sequence of the tether used in the sequencing process is shown in SEQ ID NO: 13. The comparison of the mixed test data throughput is shown in Table 3 below.
表3:三张芯片的混测数据通量
Table 3: Mixed test data throughput of three chips
结果解读:阻滞效果越好,则混合测试时其数据量会越多,即测序信号条数越多。从表中可以看到仅存在Sp或2Sp作为单一阻滞元件时,其测得文库数据量在混合测试芯片中均显著低于同时存在1Sp/LNA或2Sp/LNA等联合阻滞元件的文库。因此,包含两种联合阻滞元件的接头在测序过程中的阻滞效果显著优于仅包含单一阻滞元件的接头。Interpretation of results: The better the blocking effect, the more data will be obtained during the mixed test, that is, the more sequencing signals will be obtained. From the table, it can be seen that when only Sp or 2Sp is used as a single blocking element, the amount of library data measured in the mixed test chip is significantly lower than that of the library with combined blocking elements such as 1Sp/LNA or 2Sp/LNA. Therefore, the blocking effect of the connector containing two combined blocking elements during sequencing is significantly better than that of the connector containing only a single blocking element.
实施例4:含联合阻滞元件的接头与商业化接头的芯片混库测试 Example 4: Chip mixed library test of adapters containing joint blocking elements and commercial adapters
用实施例1的方法制备商业化含4C18的接头复合物QMX-15(Y1-4C18/Y2/B/E2),该接头复合物中的接头由主体链Y1-4C18(如SEQ ID NO.:7所示)、荧光链(如SEQ ID NO.:8所示)和阻挡链(如SEQ ID NO.:9所示)形成。The commercial 4C18-containing linker complex QMX-15 (Y1-4C18/Y2/B/E2) was prepared using the method of Example 1. The linker in the linker complex was formed by a main chain Y1-4C18 (as shown in SEQ ID NO.: 7), a fluorescent chain (as shown in SEQ ID NO.: 8) and a blocking chain (as shown in SEQ ID NO.: 9).
通过末端修复方式制备长为10kb的文库,并且分别用实施例2中制备QMX-13和商业化含4C18阻滞元件的接头QMX-15(Y1-4C18/Y2/B/E2)与文库进行连接建库。将等物质的量的QMX-13和QMX-15文库均匀加载到三张不同的齐碳科技有限公司QNome-9604上进行测序。测序过程中所使用的系链的序列如SEQ ID NO:13所示。其混测数据通量对比如下表4所示。A 10 kb library was prepared by end repair, and the QMX-13 prepared in Example 2 and the commercial linker QMX-15 (Y1-4C18/Y2/B/E2) containing the 4C18 blocking element were connected to the library. Equal amounts of QMX-13 and QMX-15 libraries were evenly loaded onto three different QNome-9604s of Qi Carbon Technology Co., Ltd. for sequencing. The sequence of the tether used in the sequencing process is shown in SEQ ID NO: 13. The comparison of the mixed test data throughput is shown in Table 4 below.
表4:三张芯片的混测数据通量
Table 4: Mixed test data throughput of three chips
结果解读:采用含联合阻滞元件的接头与商业化含4C18阻滞元件的接头进行混测的数据显示,本发明的含联合阻滞元件的接头在测序过程中的阻滞效果不输商业化含4C18的接头,甚至比商业化接头的阻滞效果更好。 Interpretation of results: The data of mixed tests using adapters containing combined blocking elements and commercial adapters containing 4C18 blocking elements showed that the blocking effect of the adapters containing combined blocking elements of the present invention during sequencing is not inferior to that of commercial adapters containing 4C18, and is even better than that of commercial adapters.

Claims (22)

  1. 一种用于表征分析物的衔接体,所述衔接体包含一个或多个第一阻滞元件和一个或多个与所述第一阻滞元件不同的第二阻滞元件,并且在所述衔接体与多核苷酸结合蛋白接触之后,所述一个或多个第一阻滞元件和所述一个或多个第二阻滞元件能够联合作用来停滞所述多核苷酸结合蛋白。A linker for characterizing an analyte, the linker comprising one or more first blocking elements and one or more second blocking elements different from the first blocking elements, and after the linker is contacted with a polynucleotide binding protein, the one or more first blocking elements and the one or more second blocking elements can work together to arrest the polynucleotide binding protein.
  2. 根据权利要求1所述的衔接体,其中所述衔接体包含互补的主体链和阻挡链,所述一个或多个第一阻滞元件以共价键连接到所述主体链上;所述一个或多个第二阻滞元件以共价键修饰到所述主体链或所述阻挡链上或以非共价键与所述主体链和/或所述阻挡链互作;The adapter according to claim 1, wherein the adapter comprises a complementary main chain and a blocking chain, the one or more first blocking elements are covalently linked to the main chain; the one or more second blocking elements are covalently modified to the main chain or the blocking chain or interact with the main chain and/or the blocking chain by non-covalent bonds;
    优选地,所述一个或多个第二阻滞元件与所述阻挡链或所述主体链互补。Preferably, the one or more second blocking elements are complementary to the barrier chain or the main chain.
  3. 根据权利要求1或2所述的衔接体,其中所述一个或多个第一阻滞元件具有与核苷酸不同的结构;所述一个或多个第二阻滞元件具有用于提高双链稳定性的结构。The adapter according to claim 1 or 2, wherein the one or more first blocking elements have a structure different from that of nucleotides; and the one or more second blocking elements have a structure for improving double-stranded stability.
  4. 根据权利要求1至3中任一项所述的衔接体,其中所述第一阻滞元件包含一个或多个选自由有机寡阳离子、iSpC3、iSp18、iSp9、硝基吲哚、肌苷、吖啶、2-氨基嘌呤、2-6-二氨基嘌呤、5-溴-脱氧尿嘧啶、反向胸苷(反向dT)、反向二脱氧胸苷(ddT)、二脱氧胞苷(ddC)、5-甲基胞苷酸、5-羟甲基胞苷、2'-O-甲基RNA碱基、异脱氧胞苷(异-dC)、异脱氧鸟苷(异-dG)、光裂解(PC)的基团或己二醇组成的群组;所述第二阻滞元件包含一个或多个选自由经锁核酸(LNA)、肽核酸(PNA)、甲氧基(OMe)、双环核苷(BNA)、甘油核酸(GNA)、苏糖核酸(TNA)、环吡咯咪唑聚酰胺(cPIP)或双链结合蛋白修饰的核苷酸组成的群组。The adapter according to any one of claims 1 to 3, wherein the first blocking element comprises one or more selected from the group consisting of organic oligocations, iSpC3, iSp18, iSp9, nitroindole, inosine, acridine, 2-aminopurine, 2-6-diaminopurine, 5-bromo-deoxyuracil, inverted thymidine (inverted dT), inverted dideoxythymidine (ddT), dideoxycytidine (ddC), 5-methylcytidylic acid, 5-hydroxymethylcytidine, 2'-O-methyl RNA base, isodeoxycytidine (iso-dC), isodeoxyguanosine (iso-dG), a photocleavable (PC) group or hexanediol; the second blocking element comprises one or more selected from the group consisting of nucleotides modified with locked nucleic acid (LNA), peptide nucleic acid (PNA), methoxy (OMe), bicyclic nucleoside (BNA), glycerol nucleic acid (GNA), threose nucleic acid (TNA), cyclopyrrolimidazole polyamide (cPIP) or double-stranded binding protein.
  5. 根据权利要求4所述的衔接体,其中所述有机寡阳离子具有通式Bj,Bj是j‐聚体的有机寡阳离子部分,j=1‐50,其中B选自包括以下基团的组:The adapter according to claim 4, wherein the organic oligocation has the general formula Bj, Bj is the organic oligocation part of the j-mer, j=1-50, wherein B is selected from the group including the following groups:
    ‐HPO3‐R1‐(X‐R2 n)n1‐X‐R3‐O‐,其中R1、R2 n和R3是相同或不同的C1‐C5低级亚烷基,X是NH或NC(NH2)2,n1=2‐20,-HPO 3 -R 1 -(X-R 2 n ) n1 -X-R 3 -O-, wherein R 1 , R 2 n and R 3 are the same or different C1-C5 lower alkylene groups, X is NH or NC(NH 2 ) 2 , n1=2-20,
    ‐HPO3‐R4‐CH(R5X1)‐R6‐O‐,其中R4是C1‐C5低级亚烷基,R5和R6是相同或不同的C1‐C5低级亚烷基,X1为腐胺、亚精胺或精胺残基,-HPO 3 -R 4 -CH(R 5 X 1 )-R 6 -O-, wherein R 4 is a C1-C5 lower alkylene group, R 5 and R 6 are the same or different C1-C5 lower alkylene groups, and X 1 is a putrescine, spermidine or spermine residue,
    ‐HPO3‐R7‐(aa)n2‐R8‐O‐,其中R7是C1‐C5低级亚烷基,R8是C1‐C5低级亚烷基、丝氨酸、天然氨基醇,(aa)n2是含有具有阳离子侧链的天然氨基酸的肽,n2=2‐20;‐HPO 3 ‐R 7 ‐(aa) n2 ‐R 8 ‐O‐, wherein R 7 is C1‐C5 lower alkylene, R 8 is C1‐C5 lower alkylene, serine, natural amino alcohol, (aa) n2 is a peptide containing a natural amino acid having a cationic side chain, and n2=2‐20;
    优选地,所述有机寡阳离子选自精胺(Sp)。Preferably, the organic oligocation is selected from spermine (Sp).
  6. 根据权利要求1至5中任一项所述的衔接体,其中所述第一阻滞元件与所述第二阻滞元件不互补。The adapter according to any one of claims 1 to 5, wherein the first blocking element is not complementary to the second blocking element.
  7. 根据权利要求1至6中任一项所述的衔接体,其中所述衔接体还包含第三链,所述第三链与所述主体链部分互补;优选地,所述主体链与所述阻挡链的部分互补并且所述主体链与所述阻挡链的互补端用于与所述分析物直接或间接连接。 The adapter according to any one of claims 1 to 6, wherein the adapter further comprises a third chain, which is partially complementary to the main chain; preferably, the main chain is partially complementary to the blocking chain and the complementary ends of the main chain and the blocking chain are used to directly or indirectly connect to the analyte.
  8. 根据权利要求1至7中任一项所述的衔接体,其中所述分析物选自多核苷酸、多肽、脂质或多糖,优选多核苷酸,所述多核苷酸是完全双链多核苷酸、部分双链多核苷酸或单链多核苷酸。The adapter according to any one of claims 1 to 7, wherein the analyte is selected from a polynucleotide, a polypeptide, a lipid or a polysaccharide, preferably a polynucleotide, and the polynucleotide is a fully double-stranded polynucleotide, a partially double-stranded polynucleotide or a single-stranded polynucleotide.
  9. 一种用于表征分析物的构建体,所述构建体包含分析物和权利要求1-8中任一项所述的衔接体,其中所述衔接体与所述分析物的任一端或两端直接或间接连接。A construct for characterizing an analyte, the construct comprising the analyte and the adapter according to any one of claims 1 to 8, wherein the adapter is directly or indirectly linked to either or both ends of the analyte.
  10. 一种用于表征分析物的复合物,所述复合物包含多核苷酸结合蛋白,和权利要求1-8中任一所述的衔接体或权利要求9所述的构建体;A complex for characterizing an analyte, the complex comprising a polynucleotide binding protein, and the adaptor according to any one of claims 1 to 8 or the construct according to claim 9;
    其中所述多核苷酸结合蛋白在所述第一阻滞元件和所述第二阻滞元件的联合作用下停滞在所述衔接体上;wherein the polynucleotide binding protein is arrested on the adaptor under the combined action of the first blocking element and the second blocking element;
    优选地,所述多核苷酸结合蛋白衍生自多核苷酸处理酶;所述多核苷酸处理酶选自聚合酶、解旋酶或核酸外切酶。Preferably, the polynucleotide binding protein is derived from a polynucleotide handling enzyme; the polynucleotide handling enzyme is selected from a polymerase, a helicase or an exonuclease.
  11. 一种控制多核苷酸结合蛋白在分析物上装载的方法,所述方法包括:A method for controlling the loading of a polynucleotide binding protein onto an analyte, the method comprising:
    提供具有分析物的构建体,其中所述分析物在其一端或两端与衔接体直接或间接连接,所述衔接体包含一个或多个第一阻滞元件和一个或多个与所述第一阻滞元件不同的第二阻滞元件;以及将所述构建体与所述多核苷酸结合蛋白接触,使得所述多核苷酸结合蛋白在所述一个或多个第一阻滞元件和所述一个或多个第二阻滞元件的联合作用下停滞在所述衔接体上;或Providing a construct having an analyte, wherein the analyte is directly or indirectly connected to an adaptor at one or both ends thereof, the adaptor comprising one or more first retarding elements and one or more second retarding elements different from the first retarding elements; and contacting the construct with the polynucleotide binding protein, such that the polynucleotide binding protein is arrested on the adaptor under the combined action of the one or more first retarding elements and the one or more second retarding elements; or
    使多核苷酸结合蛋白装载到衔接体上,所述衔接体包含一个或多个第一阻滞元件和一个或多个与所述第一阻滞元件不同的第二阻滞元件,所述多核苷酸结合蛋白在所述一个或多个第一阻滞元件和所述一个或多个第二阻滞元件的联合作用下停滞在所述衔接体上;以及使所述装载有多核苷酸结合蛋白的衔接体连接到所述分析物。The polynucleotide binding protein is loaded onto a linker, wherein the linker comprises one or more first blocking elements and one or more second blocking elements different from the first blocking elements, and the polynucleotide binding protein is arrested on the linker under the combined action of the one or more first blocking elements and the one or more second blocking elements; and the linker loaded with the polynucleotide binding protein is connected to the analyte.
  12. 根据权利要求11所述的方法,其中所述衔接体如权利要求1至8中任一项所定义。The method according to claim 11, wherein the adapter is as defined in any one of claims 1 to 8.
  13. 根据权利要求11或12所述的方法,其中所述多核苷酸结合蛋白衍生自多核苷酸处理酶;所述多核苷酸处理酶选自聚合酶、解旋酶或核酸外切酶。The method according to claim 11 or 12, wherein the polynucleotide binding protein is derived from a polynucleotide processing enzyme; the polynucleotide processing enzyme is selected from a polymerase, a helicase or a nuclease.
  14. 一种控制分析物穿过跨膜孔的移动的方法,所述方法包括:A method of controlling movement of an analyte through a transmembrane pore, the method comprising:
    (a)实施权利要求11至13中任一项所述的方法;(a) carrying out the method according to any one of claims 11 to 13;
    (b)将步骤(a)中提供的装载有所述多核苷酸结合蛋白的分析物与所述跨膜孔接触;以及(b) contacting the analyte loaded with the polynucleotide binding protein provided in step (a) with the transmembrane pore; and
    (c)跨所述跨膜孔施加电势,使得所述多核苷酸结合蛋白移动穿过所述一个或多个第一阻滞元件、任选地所述一个或多个第二阻滞元件的部分区域,并控制所述分析物穿过所述跨膜孔的移动。(c) applying an electric potential across the transmembrane pore to allow the polynucleotide binding protein to move through the one or more first retardation elements, optionally a portion of the one or more second retardation elements, and control the movement of the analyte through the transmembrane pore.
  15. 根据权利要求14所述的方法,其中所述第二阻滞元件包含经共价修饰的核苷酸或经非共价修饰的核苷酸;当所述多核苷酸结合蛋白移动穿过所述一个或多个第二阻 滞元件时,所述多核苷酸结合蛋白移动穿过所述一个或多个第二阻滞元件的核苷酸,而不穿过所述一个或多个第二阻滞元件的共价或非共价修饰物。The method according to claim 14, wherein the second blocking element comprises a covalently modified nucleotide or a non-covalently modified nucleotide; when the polynucleotide binding protein moves through the one or more second blocking elements When the polynucleotide binding protein is bound to one or more second retardation elements, the polynucleotide binding protein moves through the nucleotides of the one or more second retardation elements without passing through the covalent or non-covalent modifications of the one or more second retardation elements.
  16. 根据权利要求14或15所述的方法,其中所述方法包括提供系链用于使所述装载有所述多核苷酸结合蛋白的分析物靠近所述跨膜孔;所述系链包括捕获区和锚定区,所述捕获区用于捕获所述衔接体,所述锚定区用于与所述跨膜孔或所述跨膜孔所在的膜锚定结合。A method according to claim 14 or 15, wherein the method comprises providing a tether for bringing the analyte loaded with the polynucleotide binding protein close to the transmembrane pore; the tether comprises a capture region and an anchor region, the capture region being used to capture the adapter, and the anchor region being used to anchor and bind to the transmembrane pore or the membrane where the transmembrane pore is located.
  17. 一种表征分析物的方法,所述方法包括:A method of characterizing an analyte, the method comprising:
    (a)实施权利要求14至16中任一项所述的方法;以及(a) carrying out the method of any one of claims 14 to 16; and
    (b)随着所述分析物相对于所述跨膜孔移动,获取一个或多个测量值,其中所述测量值代表所述分析物的一个或多个特征,并由此表征所述分析物。(b) obtaining one or more measurements as the analyte moves relative to the transmembrane pore, wherein the measurements represent one or more characteristics of the analyte and thereby characterize the analyte.
  18. 根据权利要求17所述的方法,其中所述跨膜孔是蛋白孔或固态孔,和/或,所述膜是两亲层或固态层。The method according to claim 17, wherein the transmembrane pore is a protein pore or a solid-state pore, and/or the membrane is an amphiphilic layer or a solid-state layer.
  19. 根据权利要求17或18所述的方法,其中所述分析物选自多核苷酸、多肽、脂质或多糖,优选多核苷酸,所述多核苷酸是完全双链多核苷酸、部分双链多核苷酸或单链多核苷酸。The method according to claim 17 or 18, wherein the analyte is selected from polynucleotides, polypeptides, lipids or polysaccharides, preferably polynucleotides, and the polynucleotides are fully double-stranded polynucleotides, partially double-stranded polynucleotides or single-stranded polynucleotides.
  20. 一种表征分析物的试剂盒,所述试剂盒包含:A kit for characterizing an analyte, the kit comprising:
    (a)权利要求1至8中任一项所述的衔接体,和(b)多核苷酸结合蛋白,和/或(c)跨膜孔。(a) the adaptor of any one of claims 1 to 8, and (b) a polynucleotide binding protein, and/or (c) a transmembrane pore.
  21. 有机寡阳离子作为用于停滞多核苷酸结合蛋白的阻滞元件的用途,所述有机寡阳离子具有通式Bj,Bj是j‐聚体的有机寡阳离子部分,j=1‐50,其中B选自包括以下基团的组:Use of an organic oligocation as a blocking element for arresting a polynucleotide binding protein, the organic oligocation having the general formula Bj, Bj is the organic oligocation part of a j-mer, j=1-50, wherein B is selected from the group comprising the following groups:
    ‐HPO3‐R1‐(X‐R2 n)n1‐X‐R3‐O‐,其中R1、R2 n和R3是相同或不同的C1‐C5低级亚烷基,X是NH或NC(NH2)2,n1=2‐20,-HPO 3 -R 1 -(X-R 2 n ) n1 -X-R 3 -O-, wherein R 1 , R 2 n and R 3 are the same or different C1-C5 lower alkylene groups, X is NH or NC(NH 2 ) 2 , n1=2-20,
    ‐HPO3‐R4‐CH(R5X1)‐R6‐O‐,其中R4是C1‐C5低级亚烷基,R5和R6是相同或不同的C1‐C5低级亚烷基,X1为腐胺、亚精胺或精胺残基,-HPO 3 -R 4 -CH(R 5 X 1 )-R 6 -O-, wherein R 4 is a C1-C5 lower alkylene group, R 5 and R 6 are the same or different C1-C5 lower alkylene groups, and X 1 is a putrescine, spermidine or spermine residue,
    ‐HPO3‐R7‐(aa)n2‐R8‐O‐,其中R7是C1‐C5低级亚烷基,R8是C1‐C5低级亚烷基、丝氨酸、天然氨基醇,(aa)n2是含有具有阳离子侧链的天然氨基酸的肽,n2=2‐20;‐HPO 3 ‐R 7 ‐(aa) n2 ‐R 8 ‐O‐, wherein R 7 is C1‐C5 lower alkylene, R 8 is C1‐C5 lower alkylene, serine, natural amino alcohol, (aa) n2 is a peptide containing a natural amino acid having a cationic side chain, and n2=2‐20;
    优选地,所述有机寡阳离子选自精胺。Preferably, the organic oligocation is selected from spermine.
  22. 权利要求1至8中任一项所述的衔接体、权利要求9所述的构建体、权利要求10所述的复合物、权利要求11-19中任一项所述方法、或权利要求20所述的试剂盒在制备用于表征分析物的产品或在表征分析物中的应用。 Use of the adapter of any one of claims 1 to 8, the construct of claim 9, the complex of claim 10, the method of any one of claims 11-19, or the kit of claim 20 in the preparation of a product for characterizing an analyte or in characterizing an analyte.
PCT/CN2023/136344 2022-12-13 2023-12-05 Adapter containing blocking elements used in combination, construct, method and use WO2024125342A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211595915.2A CN116200477A (en) 2022-12-13 2022-12-13 Adaptors, constructs, methods and uses comprising a combination blocker
CN202211595915.2 2022-12-13

Publications (1)

Publication Number Publication Date
WO2024125342A1 true WO2024125342A1 (en) 2024-06-20

Family

ID=86508415

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/136344 WO2024125342A1 (en) 2022-12-13 2023-12-05 Adapter containing blocking elements used in combination, construct, method and use

Country Status (2)

Country Link
CN (1) CN116200477A (en)
WO (1) WO2024125342A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116200477A (en) * 2022-12-13 2023-06-02 成都齐碳科技有限公司 Adaptors, constructs, methods and uses comprising a combination blocker

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107109485A (en) * 2014-10-10 2017-08-29 因维蒂公司 Method for the general blocking oligomer system and improved hybrid capture of multiple capture reaction
CN111615560A (en) * 2018-01-05 2020-09-01 牛津纳米孔技术公司 Methods for selecting polynucleotides based on enzyme interaction duration
CN114262735A (en) * 2021-12-15 2022-04-01 成都齐碳科技有限公司 Adaptors for characterising polynucleotides and uses thereof
CN114269947A (en) * 2019-05-22 2022-04-01 牛津纳米孔科技公开有限公司 Method
CN116200477A (en) * 2022-12-13 2023-06-02 成都齐碳科技有限公司 Adaptors, constructs, methods and uses comprising a combination blocker

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107109485A (en) * 2014-10-10 2017-08-29 因维蒂公司 Method for the general blocking oligomer system and improved hybrid capture of multiple capture reaction
CN111615560A (en) * 2018-01-05 2020-09-01 牛津纳米孔技术公司 Methods for selecting polynucleotides based on enzyme interaction duration
CN114269947A (en) * 2019-05-22 2022-04-01 牛津纳米孔科技公开有限公司 Method
CN114262735A (en) * 2021-12-15 2022-04-01 成都齐碳科技有限公司 Adaptors for characterising polynucleotides and uses thereof
CN116200477A (en) * 2022-12-13 2023-06-02 成都齐碳科技有限公司 Adaptors, constructs, methods and uses comprising a combination blocker

Also Published As

Publication number Publication date
CN116200477A (en) 2023-06-02

Similar Documents

Publication Publication Date Title
US11649490B2 (en) Method of target molecule characterisation using a molecular pore
US11236385B2 (en) Method for characterising a double stranded nucleic acid using a nano-pore and anchor molecules at both ends of said nucleic acid
CN116334200B (en) Method for determining polynucleotide characteristics using nanopores and complexes for use in the method
EP3619224B1 (en) Transmembrane pore consisting of two csgg pores
JP6637040B2 (en) Method
AU2012264497B2 (en) Coupling method
US20240076729A9 (en) Method
CN109196116B (en) Method for characterizing target polynucleotide
EP3108010A1 (en) Sample preparation method
WO2024125342A1 (en) Adapter containing blocking elements used in combination, construct, method and use
WO2023143247A1 (en) Linker for characterizing sequencing of target polynucleotide, construct, method, and use
WO2024109455A1 (en) Rna-dna chimeric adapter and use thereof
CN118574941A (en) method
WO2024131530A1 (en) Preparation method for sequencing library
CN115747211A (en) Design and application of sequencing joint for nanopore sequencing
WO2024236325A1 (en) Method and products for characterizing a polynucleotide using a nanopore

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23902539

Country of ref document: EP

Kind code of ref document: A1