[go: up one dir, main page]

WO2025011502A1 - Method for preparing circular rna and nucleic acid sequence for method - Google Patents

Method for preparing circular rna and nucleic acid sequence for method Download PDF

Info

Publication number
WO2025011502A1
WO2025011502A1 PCT/CN2024/104101 CN2024104101W WO2025011502A1 WO 2025011502 A1 WO2025011502 A1 WO 2025011502A1 CN 2024104101 W CN2024104101 W CN 2024104101W WO 2025011502 A1 WO2025011502 A1 WO 2025011502A1
Authority
WO
WIPO (PCT)
Prior art keywords
less
tag
fragment
rna
nucleotides
Prior art date
Application number
PCT/CN2024/104101
Other languages
French (fr)
Chinese (zh)
Inventor
张卫国
董翊洁
陈华
Original Assignee
仁景(苏州)生物科技有限公司
仁景国际香港有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 仁景(苏州)生物科技有限公司, 仁景国际香港有限公司 filed Critical 仁景(苏州)生物科技有限公司
Publication of WO2025011502A1 publication Critical patent/WO2025011502A1/en

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K31/00Medicinal preparations containing organic active ingredients
    • A61K31/70Carbohydrates; Sugars; Derivatives thereof
    • A61K31/7088Compounds having three or more nucleosides or nucleotides
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P11/00Drugs for disorders of the respiratory system
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P25/00Drugs for disorders of the nervous system
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P3/00Drugs for disorders of the metabolism
    • A61P3/04Anorexiants; Antiobesity agents
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P3/00Drugs for disorders of the metabolism
    • A61P3/08Drugs for disorders of the metabolism for glucose homeostasis
    • A61P3/10Drugs for disorders of the metabolism for glucose homeostasis for hyperglycaemia, e.g. antidiabetics
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P35/00Antineoplastic agents
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P7/00Drugs for disorders of the blood or the extracellular fluid
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P9/00Drugs for disorders of the cardiovascular system
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/10Cells modified by introduction of foreign genetic material

Definitions

  • the present invention relates to methods and nucleic acid sequences for preparing and purifying circular RNA.
  • linear mRNA drugs face major challenges in vivo, such as low stability and short half-life. Since circular RNA does not contain free ends, it can effectively avoid degradation by exonuclease in cells, thus having significantly higher stability than linear mRNA molecules (see non-patent literature 1), providing an effective candidate solution to the above challenges.
  • Circular RNA is a type of covalently closed circular RNA molecule. Since the genome of plant viroids was first discovered as a circular RNA molecule in 1976, hundreds of natural circular RNAs have been identified in viruses (such as hepatitis D virus HDV) and eukaryotic cells (such as mammalian cells), and it has been found that these molecules perform a variety of different biological functions in vivo, including but not limited to encoding proteins, regulating gene transcription, acting as microRNA sponges, protein scaffolds, etc. (see non-patent literature 2).
  • viruses such as hepatitis D virus HDV
  • eukaryotic cells such as mammalian cells
  • the ribozyme method is one of the main ways to synthesize circular RNA in the current field of nucleic acid drugs. It uses the PIE system (Permuted Intron-Exon System) established by modifying type I or type II self-splicing introns to prepare circular RNA. It is reported that the splicing of type I introns does not depend on any protein.
  • PIE system Permuted Intron-Exon System
  • splicing can be achieved through a two-step transesterification reaction: in the first step, guanosine attacks the 5' splicing site (5'SS), and the 3' hydroxyl group of guanosine undergoes an ester exchange reaction at this site, causing splicing between the 5' exon and the 5'intron; in the second step, the 3'-OH of the intermediate formed after the first reaction attacks the 3' splicing site (3'SS), and a second ester exchange reaction is carried out at this site, resulting in the exons on both sides of the intron being directly connected, and the 5' end of the intron itself is cyclized with the 3' end and then sheared off.
  • 5'SS 5' splicing site
  • 3' hydroxyl group of guanosine undergoes an ester exchange reaction at this site, causing splicing between the 5' exon and the 5'intron
  • the 3'-OH of the intermediate formed after the first reaction attacks the 3' s
  • the natural type I self-splicing intron is divided into two, and the arrangement order of "exon 1-intron fragment 1-intron fragment 2-exon 2" is artificially replaced with the arrangement of "intron fragment 2-exon 2-exon 1-intron fragment 1", and the circular RNA obtained by cyclization of the "exon 2-exon 1" fragment can be prepared in the presence of Mg 2+ and guanosine.
  • non-patent documents 3 and 4 respectively reported the PIE system established by transforming the type I self-splicing intron of Anabaena (fish algae) pre-Trna or T4 phage Td gene through the above replacement method.
  • Non-patent document 5 also reports the optimization of the PIE system based on Anabaena pre-tRNA type I intron ribozyme, which can significantly improve the in vitro cyclization efficiency by adding external homology arms, internal homology arms, etc., and shorten the length of the cyclizable RNA fragment to 1.
  • the length of circular RNA was increased to ⁇ 5kb, laying a good foundation for the synthesis of circular RNA drugs.
  • RNA byproducts including but not limited to: introns on both sides that are cut off after the circularization reaction, linear RNA precursors that have not yet been circularized, and linear and circular high molecular weight polymerized RNA produced by polymerization of multiple linear precursors.
  • introns on both sides that are cut off after the circularization reaction
  • linear RNA precursors that have not yet been circularized linear and circular high molecular weight polymerized RNA produced by polymerization of multiple linear precursors.
  • the prior art has reported that the presence of these byproducts has various adverse effects on the final circular RNA biological products, which not only reduces the purity and titer of the products, but also significantly increases the immunogenicity in vivo (see non-patent document 2). Therefore, there is an urgent need for a method that can efficiently prepare high-purity circular RNA molecules, especially a preparation method that can meet the production level of industrialization.
  • the existing preparation methods for separating byproducts from circular RNA mainly utilize the difference in molecular weight between the two, or the difference in physical properties between linear and circular structures, such as size exclusion chromatography (SEC) or high performance liquid chromatography (HPLC) (see non-patent literature 2 to 6, patent literature 1).
  • SEC size exclusion chromatography
  • HPLC high performance liquid chromatography
  • the conventional practice in the industry is to perform RNase R digestion before SEC or HPLC to remove the above impurities.
  • SEC and HPLC themselves have common problems such as low loading capacity, high cost, complex process, and low resolution. After adding additional enzyme treatment steps, it is even more difficult to simultaneously achieve improvements in yield, efficiency, and purity, let alone industrial-level circular RNA production.
  • Patent document 2 reports a method for purifying linear mRNA with a poly(A) structure using column chromatography using oligo dT as an affinity ligand. Although patent document 2 also mentions that poly(A) sequences can be artificially added to circular RNA molecules, the patent method adds poly(A) to the circular region of the RNA molecule, so theoretically it is impossible to simultaneously separate all self-splicing byproducts and target circular RNA molecules in one column chromatography step, especially it is impossible to separate linear RNA precursors, nicked RNA and target circular RNA molecules that also have poly(A) sequences and have not yet formed a circle.
  • Patent document 3 reports another method for purifying circular RNA, which includes the steps of adding a poly(A) tail to linear RNA mixed with circular RNA, and then removing the RNA containing the poly(A) tail using oligo(dT)25-coupled magnetic beads.
  • the main purpose of this method is to separate natural circular RNA and its corresponding linear transcript fragments, rather than various byproducts in the synthesis process of circular RNA.
  • Patent document 4 reports a method of incorporating at least one purification tag into the 3' or 5' end of a linear precursor RNA, thereby enabling the use of antisense oligo affinity purification of circular RNA, but Patent document 4 has clearly excluded the use of poly(A) and its variants as purification tags as a whole.
  • poly(A) is an essential element constituting the circular RNA of Patent document 4. If it is also inserted as a tag into the 3' or 5' end of the linear RNA precursor, it is obvious that according to the design principle of affinity purification, it will not be possible to distinguish between the desired isolated circular RNA and the uncircularized impurities.
  • Patent document 4 also clearly states that "as a result of self-splicing, the corresponding circularized RNA no longer contains 3' and/or 5' end purification tags". Therefore, Patent document 4 actually provides technical teachings that are completely opposite to the present invention.
  • Patent Document 4 also describes the use of SEQ ID NO: 208 (tctttaccctcgtcttgacg) and 209 (tatgctgttatccgtcgatt) as oligos for affinity purification of circular RNA.
  • FIG. 6 of the document shows that the circular RNA The enrichment effect occurs after the step of separating the target circular RNA from the impurities that are not completely circularized.
  • adding a purification tag to the 3' or 5' end does not directly improve the circularization reaction efficiency and circular RNA purity.
  • Non-patent literature 1 Chen et al., Frontiers in Bioengineering and Biotechnology (2021). 9: 787881;
  • Non-patent literature 2 Liu et al., Molecular Cell (2022), 82: 1-15;
  • Non-patent literature 3 Been et al., Nucleic Acids Research (1992). 20: 5357-5364;
  • Non-patent literature 6 Wesselhoeft et al., Molecular Cell (2019). 74: 508-520;
  • Patent document 1 WO2019/236673A1;
  • Patent document 2 CN114381454A;
  • Patent document 3 CN110283895A;
  • Patent document: 4 WO2023/073228A1.
  • the inventors have found through in-depth research that by inserting a tag sequence into the two end arms of the linear RNA precursor used to prepare circular RNA, the specific binding of the tag sequence to the corresponding affinity ligand can be utilized to separate the prepared target circular RNA molecule from various RNA byproducts generated during the preparation process using affinity chromatography.
  • the method of the present invention can use an affinity chromatography process that is easy to operate and industrialized, and can achieve industrial production while reducing production costs.
  • the circular RNA prepared by the method of the present invention substantially does not contain linear RNA byproducts, providing excellent active pharmaceutical ingredients for the development of circular RNA drugs.
  • the present invention also screened the linear RNA precursor and its encoding DNA for suitable insertion of purification tags. The results showed that after the purification tag was inserted into a specific region of the encoding DNA of the linear RNA precursor, the yield of the linear RNA precursor produced by in vitro transcription (IVT) of the encoding DNA would not be reduced.
  • the present invention provides a method for preparing circular RNA, the method comprising:
  • Step A generating a linear RNA precursor, wherein the linear precursor comprises a 5' end arm, a 3' self-splicing site, a circularization region, a 5' self-splicing site and a 3' end arm in a manner operably connected to each other in sequence from 5' to 3' direction, wherein a first tag is inserted into the 5' end arm and a second tag is inserted into the 3' end arm;
  • Step B placing the linear RNA precursor under conditions suitable for self-splicing of the 3' self-splicing site and the 5' self-splicing site, to obtain a mixture comprising a linear RNA fragment with the first tag and/or the second tag and a circular RNA obtained by cyclization of the cyclization region;
  • Step C contacting the mixture with an affinity chromatography medium capable of simultaneously binding the first tag and the second tag for a period of time sufficient to allow the affinity chromatography medium to bind to the linear RNA fragment containing at least one of the tags;
  • Step D separating the affinity chromatography medium and the mixture after contacting the affinity chromatography medium, collecting the supernatant, and obtaining the circular RNA.
  • the functional variant of the poly(A) tag used in the method of the present invention is to insert one or more non-A bases into the poly(A) tag, preferably insert 1 to 20 non-A bases, and more preferably insert 1 to 10 non-A bases.
  • functional variants of the poly(A) tag used in the methods of the present invention include the following:
  • the element a is composed of more than 20 consecutive adenine nucleotides
  • the element b is composed of more than 3 and less than 20 consecutive adenine nucleotides
  • the element c is composed of a nucleotide selected from uracil nucleotides, cytosine nucleotides, and guanine nucleotides
  • the element d is composed of more than 2 and less than 20 nucleotides
  • the nucleotides are arbitrarily selected from adenine nucleotides, uracil nucleotides, cytosine nucleotides, and guanine nucleotides
  • the element d does not contain more than 3 consecutive adenine nucleotides
  • the 5' and 3' terminal nucleotides are not adenine nucleotides
  • the Poly(A) tag contains two or more of the element b, the element c or the element d at the same time
  • the sequences of every two elements b may be the same or different
  • the sequences of every two elements c may be the same or different
  • the sequences of every two elements d may be the same or different.
  • the element a and the element b, the element c and the element d, the elements b, the elements c, and the elements d are not adjacent to each other.
  • element a used in the method of the present invention consists of more than 20 and less than 80 consecutive adenine nucleotides, preferably consists of 30 to 70, 35 to 65, 40 to 60, or 45 to 55 consecutive adenine nucleotides, and more preferably consists of 60 consecutive adenine nucleotides.
  • the element b used in the method of the present invention consists of 3 to 10, 10 to 19, 12 to 15, 14 nt to 17, or 16 to 19, preferably 19 consecutive adenine nucleotides. In some embodiments, the number of element b used in the method of the present invention is 2 to 10, preferably 2 to 5, and more preferably 3.
  • the element c used in the method of the present invention is a guanine nucleotide. In some embodiments, the number of the element c used in the method of the present invention is 2 to 10, 3 to 8, 4 to 6, or 2 to 5, preferably 2.
  • the element d used in the method of the present invention consists of 3 to 18, 5 to 16, 4 to 10, or 6 to 12 nucleotides, preferably 6 nucleotides.
  • the element d used in the method of the present invention can be independently selected from any one of GAUAUC, GUAUAC, GAAUCU, GCAUAUGACU or GAUAUCGUAUAC.
  • the number of elements d used in the method of the present invention is 0 to 5, preferably 1 to 3, and more preferably 1.
  • the total number of element c and element d used in the method of the present invention is 2 to 15, preferably 3 to 5, and more preferably 3.
  • the functional variant of the poly(A) tag used in the method of the present invention has any one structure selected from the following structures: element a-element c-element b-element c-element b-element c-element b-element c-element b, element b-element c-element b-element c-element a-element d-element b-element c-element b-element c-element b, element b-element c-element b-element c-element b-element d-element a-element c, element a-element d-element b-element c-element b-element c-element b-element b, or, element b-element c-element b-element c-element b-element d-element a.
  • the functional variant of the poly (A) tag used in the method of the present invention may further comprise an element e, wherein the element e is composed of one or two consecutive adenine nucleotides, wherein the element e is located at the 3' end of the Poly (A) tag sequence and is adjacent to the element d or the element c.
  • the linear RNA precursor in the method of the present invention comprises a 5' external homology arm and a 3' intron fragment in the 5' end arm in the 5' to 3' direction
  • the first tag is inserted in the 5' external homology arm, or inserted in the 5' terminal region of the 3' intron fragment near the 5' external homology arm
  • the 5' terminal region is preferably 20 nucleotides at the 5' end of the 3' intron fragment, more preferably 15 nucleotides, and further preferably 10 nucleotides, or inserted at the 5' end of the 5' external homology arm.
  • the linear RNA precursor in the method of the present invention also comprises a 5' intron fragment and a 3' external homologous arm in the 3' end arm in the 5' to 3' direction, and the second tag is inserted in the 3' external homologous arm, or inserted in the 3' terminal region of the 5' intron fragment close to the 3' external homologous arm, the 3' terminal region is preferably 20 nucleotides at the 3' end of the 5' intron fragment, more preferably 15 nucleotides, further preferably 10 nucleotides, or inserted downstream of the 3' end of the 3' external homologous arm.
  • the first tag is inserted at any position in the 5' external homology arm, and the second tag is inserted downstream of the 3' end of the 3' external homology arm (for example, the position immediately adjacent to the last nucleotide residue at the 3' end of the 3' external homology arm).
  • the first tag is inserted at any position in the 5' external homology arm, and the second tag is inserted at any position in the 3' external homology arm.
  • the first tag is inserted at any position in the 5' external homology arm, and the second tag is inserted at any position in the 3' terminal region of the 5' intron fragment.
  • the first tag is inserted at any position in the 5' external homology arm except the first nucleotide residue at the 5' end. In some preferred embodiments, the first tag is inserted at any position in the 5' external homology arm except the first nucleotide residue at the 5' end, and the second tag is inserted at any position in the 3' external homology arm. In some preferred embodiments, the first tag is inserted at any position in the 5' external homology arm except the first nucleotide residue at the 5' end, and the second tag is inserted at any position in the 3' external homology arm. In some preferred embodiments, the first tag is inserted at any position in the 5' external homology arm except the first nucleotide residue at the 5' end, and the second tag is inserted at any position in the 3' terminal region of the 5' intron fragment.
  • the first tag is inserted at the 5' outer homology arm from the first nucleotide residue at the 5' end, in the 5' to 3' direction, at the 1st, 2nd, 3rd, 4th, 5th, 6th, 7th, 8th, 9th, 10th, 11th, 12th, 13th, 14th, 15th, 16th, 17th, 18th, 19th, or 20th residue position.
  • the first tag is inserted at the 5' outer homology arm from the first nucleotide residue at the 5' end, at the 2nd, 3rd, 4th, 5th, 6th, 7th, 8th, 9th, 10th, 11th, 12th, 13th, 14th, 15th, 16th, 17th, 18th, 19th, or 20th residue position.
  • the first tag is inserted at the 5' outer homology arm from the first nucleotide residue at the 5' end, at the 2nd, 3rd, 4th, 5th, 6th, 7th, 8th, 9th, 10th, or 11th residue position.
  • inserted at position n or "inserted at residue position n” means that the 5'-most and 3'-most nucleotide residues of the described insertion sequence are covalently linked to the residues at positions n and n+1, respectively, before insertion.
  • inserted at position 1 means that the inserted sequence is located between the original residues at positions 1 and 2.
  • the second tag is inserted in the 3' outer homology arm from the last nucleotide residue at the 3' end, in the 3' to 5' direction at the 1st, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20th residue position.
  • the second tag is inserted in the 3' outer homology arm from the last nucleotide residue at the 3' end, in the 3' to 5' direction at the 1st, 2, 3, 4, 5, 6, 7, 8, 9, 10 or 11th residue position.
  • the second tag is inserted in the 5' intron fragment from the last nucleotide residue at the 3' end, in the 3' to 5' direction at the 1st, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20th residue position.
  • the second tag is inserted into the 5' intron fragment at the 1st, 2nd, 3rd, 4th, 5th, 6th, 7th, 8th, 9th, 10th or 11th residue position in the 3' to 5' direction starting from the last nucleotide residue at the 3' end.
  • the circularization region comprises a 3' coding region fragment, a translation initiation element, and a 5' coding region fragment in a manner that is operably connected to each other in the 5' to 3' direction.
  • the circularization region comprises a 3' exon fragment, a 5' internal homology arm, an insert fragment, a 3' internal homology arm, and a 5' exon fragment in a manner that is operably connected to each other in the 5' to 3' direction.
  • the circularization region comprises a first spacer between the insert fragment and the 5' internal homology arm, and a second spacer between the insert fragment and the 3' internal homology arm.
  • the insert fragment comprises a translation initiation element, or comprises a translation initiation element and a coding region.
  • the translation initiation element is an IRES sequence.
  • the IRES sequence can be selected from, but not limited to, the following IRES sequences: Taura syndrome virus, blood-sucking assassin bug virus, Theile's encephalomyelitis virus, simian virus 40, fire ant virus 1, cereal aphid virus, reticuloendotheliosis virus, Forman polio virus 1, soybean looper virus, Kashmir bee virus, human rhinovirus 2, glass leafhopper virus-1, human immunodeficiency virus type 1, glass leafhopper virus-1, lice P virus, hepatitis C virus, hepatitis A virus, GB hepatitis virus, foot-and-mouth disease virus, human enterovirus 71, equine rhinovirus, tea looper virus Like virus, encephalomyocarditis virus (EMCV), Drosophila C
  • EMCV encephalomyo
  • the inserted fragment comprises the coding sequence of a structural gene or its functional fragment or the sequence of a non-coding RNA or its complementary sequence
  • the structural gene is selected from a polypeptide, a protein subunit, a protein active center, a protein or a protein hybrid of a non-natural catalytic group, a recombinant protein active subunit or active center/, a recombinant artificial enzyme or other biological effector units mainly composed of amino acids
  • the non-coding RNA is selected from microRNA (miRNA), small interfering RNA (siRNA), PIWI protein-interacting RNA (piRNA), transfer RNA-derived small RNA (tsRNA), small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), long non-coding RNA (lncRNA), pseudogene, ceRNA (competing endogenous RNAs), microRNA sponge or other types of non-mRNA RNA.
  • the length of the 5' and 3' outer homology arms is each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, 5-60 nt, 10-55 nt, 15-50 nt, 20-45 nt, 25-40 nt, 30-35 nt, 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, or 50 nt.
  • the length of the 5' and 3' intron fragments is each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, less than 70 nt, less than 80 nt, less than 90 nt, less than 100 nt, less than 150 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, 0nt, less than 90nt, less than 100nt, less than 150nt, less than 150 nt, less than 20 n
  • the lengths of the 5' and 3' internal homology arms are each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, Less than 50nt, less than 60nt, 5-60nt, 10-55nt, 15-50nt, 20-45nt, 25-40nt, 30-35nt, 10nt, 15nt, 20nt, 25nt, 30nt, 35nt, 40nt, 45nt, or 50nt.
  • the length of the insert is greater than 50 nt, greater than 100 nt, greater than 150 nt, greater than 200 nt, greater than 250 nt, greater than 300 nt, greater than 400 nt, greater than 500 nt, greater than 600 nt, greater than 1k nt, greater than 1.5k nt, greater than 2k nt, greater than 3k nt, less than 50 nt, less than 100 nt, less than 150 nt, less than 200 nt, less than 250 nt, less than 300 nt, less than 400 nt, less than 500 nt, less than 600 nt, less than 600 nt, less than 1k nt, less than 1.5k nt, less than 2k nt, less than 3k nt, 50-5k nt, 50-5k nt, 50-4k nt, 50-3k nt, 50-2k nt, 50-1.5k nt, 50-1k
  • the 3' intron fragment and the 5' intron fragment are both from type II introns, preferably from type II introns of Clostridium such as Clostridium tetani, or type II introns of Bacillus such as Bacillus thuringiensis.
  • the 3' intron fragment and the 5' intron fragment are from the type II intron contained in the nucleotide sequence shown in SEQ ID NO:5 or 6.
  • the 3' exon fragment and the 5' exon fragment are from the 3' terminal region and the 5' terminal region of the natural exon, respectively, preferably from the cyanobacterium Anabaena pre-tRNA gene or the T4 phage Td gene.
  • the 3' exon fragment has a sequence as shown in SEQ ID NO:7
  • the 5' exon fragment has a sequence as shown in SEQ ID NO:8
  • the 5' internal homology arm has a sequence as shown in SEQ ID NO:9
  • the 3' internal homology arm has a sequence as shown in SEQ ID NO:10.
  • the first spacer is the same as or different from the second spacer.
  • the affinity chromatography medium used in the method of the present invention is operably linked to an affinity ligand capable of specifically binding to the first tag and/or the second tag.
  • the affinity ligand is selected from the group consisting of poly X1, poly X1 - X2 , poly X1 - X2 - X3 , poly X1- X2 - X3 - X4 , wherein X1 , X2 , X3 , X4 are independently any one of A, G, C, T, U.
  • the affinity ligand is selected from any one of the group consisting of Oligo dT, Oligo dC, Oligo dG, Oligo dU.
  • the affinity chromatography medium used in the method of the present invention is selected from any one of the group consisting of magnetic beads, dextran molecules, polyacrylamide macromolecules, macromolecular cellulose molecules, chitosan materials, modified polylactic acid materials, PET materials, inorganic silicate materials or other high molecular polymers.
  • the methods of the present invention do not include the step of adding RNase R to the mixture comprising circular RNA to remove linear RNA.
  • the method of the present invention comprises adding RNase R to a mixture comprising circular RNA
  • the amount of RNase R added is reduced to 30-50%, preferably 40% or 50%, of the amount used for purification without the poly(A) tag or its functional variant.
  • the present invention provides a circular RNA prepared using the preparation method of the present invention, which substantially does not contain non-circular RNA molecules.
  • the circular RNA prepared using the preparation method of the present invention has a purity of greater than 90%, greater than 91%, greater than 92%, greater than 93%, greater than 94%, greater than 95%, greater than 96%, greater than 97%, greater than 98%, greater than 99%, greater than 99.5%, greater than 99.9% or higher.
  • the circular RNA prepared using the method of the present invention has a purity of more than 90%, more than 91%, more than 92%, more than 93%, more than 94%, more than 95%, more than 96%, more than 97%, more than 98%, more than 99%, more than 99.5%, more than 99.9% or more.
  • the content of non-ideal nucleic acids e.g., various intermediate complexes that are not completely cyclized but still retain elements such as ribozymes, etc.
  • the content of non-ideal nucleic acids is no more than 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.5%, 0.1% or less of the total RNA.
  • the present invention provides a linear RNA precursor for use in the methods of the present invention.
  • the present invention provides a nucleic acid sequence that can be transcribed to produce the linear RNA precursor of the present invention under conditions suitable for transcription.
  • the nucleic acid sequence of the present invention also comprises a regulatory sequence necessary for transcription to produce the linear RNA precursor in an operably linked manner, including but not limited to a promoter, a terminator, a transcription factor binding site, an untranslated region (UTR), an enhancer, a palindrome, a cis-acting element, a trans-acting element, a TATA box, a CAAT box, an operator, and a transposon.
  • the present invention provides a vector comprising the nucleic acid sequence of the present invention.
  • the vector of the present invention is a linear DNA, a plasmid, a viral nucleic acid fragment or a cell genomic DNA fragment.
  • the present invention provides an engineered cell comprising the linear RNA precursor, circular RNA, nucleic acid sequence, or vector of the present invention.
  • the present invention provides a composition comprising the linear RNA precursor, circular RNA, nucleic acid sequence, vector, or engineered cell of the present invention.
  • the present invention also provides the use of the linear RNA precursor, nucleic acid sequence, vector, or engineered cell of the present invention to prepare circular RNA.
  • the present invention also provides the use of the linear RNA precursor, circular RNA, nucleic acid sequence, vector, or engineered cell of the present invention for preparing a drug, a cytotoxic agent, or an immunomodulatory preparation.
  • the therapeutic drug, cytotoxic agent, or immunomodulatory preparation is selected from viruses, pluripotent or multipotent stem cells, iPS, engineered immune cells, antibodies or antibody fragments, antibodies or antibody fragments conjugated to drugs, chemotherapeutic agents, immunosuppressive or modulatory agents, anti-infective drugs, anticancer agents, hypoglycemic drugs, cardiovascular and cerebrovascular disease therapeutic drugs, degenerative neurological disease drugs, obesity therapeutic drugs, hematological disease therapeutic drugs, respiratory disease therapeutic drugs, or retroviral disease therapeutic drugs.
  • the present invention also provides a method for administering circular RNA, which comprises administering an effective amount of the circular RNA of the present invention to an organism in need thereof, or administering an effective amount of the circular RNA of the present invention using the nucleic acid sequence, vector, engineered cell or the like.
  • the circular RNA is prepared from cells or compositions.
  • FIG1 schematically shows a nucleic acid sequence of a linear RNA precursor for preparing circular RNA according to the present application, wherein the following elements are sequentially included in the 5′ to 3′ direction: a 5′ external homology arm, a 3′ intron fragment, a 3′ exon fragment, a 5′ internal homology arm, a first spacer, a miRNA binding region (micro RNA binding sites), a second spacer, a 3′ internal homology arm, a 5′ exon fragment, a 5′ intron fragment, and a 3′ external homology arm, wherein the locations of two self-splicing sites are schematically shown by vertical arrows;
  • FIG2 shows a schematic diagram of a linear RNA precursor for preparing circular RNA according to the present application
  • FIG3 shows a schematic diagram of another linear RNA precursor for preparing circular RNA according to the present application
  • FIG4 shows a schematic diagram of another linear RNA precursor for preparing circular RNA according to the present application.
  • Figures 5A and 5B show gel electrophoresis images of the cyclization reaction products of the linear RNA precursor according to some embodiments of the present invention and the products after RNase R digestion.
  • M represents the Riboruler low molecular weight RNA reference
  • CK represents the comparative example 1 without poly (A) insertion
  • FIG6 shows the relative IVT yields of the cyclization reaction products of the linear RNA precursors according to Preparation Examples 6 to 24 of the present application.
  • the IVT yield of each preparation example is expressed as a multiple of the control IVT yield and subjected to One-way ANOVA statistical analysis. **** indicates P ⁇ 0.0001;
  • FIG7 shows the relative IVT yields of the cyclization reaction products of the linear RNA precursors according to Preparation Examples 25 to 29 of the present application.
  • the IVT yield of each preparation example is expressed as a multiple of the control IVT yield and subjected to One-way ANOVA statistical analysis. *** indicates P ⁇ 0.001, **** indicates P ⁇ 0.0001;
  • Figures 8A to 8D show HPLC graphs of the cyclization reaction products of the linear RNA precursor according to the present application before and after RNase R enzymatic cleavage treatment.
  • the term “comprising” or “including” means that the sequence, composition and method include the described components or steps, but do not exclude other components or steps. "Mainly composed of", when used to define sequences, compositions and methods, should mean excluding any other components or other steps that are obviously important for the technical effect it should achieve. "Composed of" should mean excluding other components and steps not mentioned.
  • linear RNA precursor or “linear precursor” is used interchangeably and refers to an RNA precursor that is not covalently closed in a circle but can generate circular RNA during the circularization process, which is generally transcribed from a template DNA.
  • the linear RNA precursor comprises a complete circular RNA sequence that has not yet been circularized, and a self-splicing sequence (e.g., a self-splicing site, an intron ribozyme fragment, and a homology arm, etc.) required for the circularization of the RNA sequence.
  • a self-splicing sequence e.g., a self-splicing site, an intron ribozyme fragment, and a homology arm, etc.
  • the linear RNA precursor of the present invention can be used to produce circular RNA by incubation in the presence of magnesium ions and guanosine nucleotides or nucleosides at a temperature (e.g., between 20° C. and 60° C.) at which RNA circularization occurs.
  • a temperature e.g., between 20° C. and 60° C.
  • the linear RNA precursor includes the following elements: 5' external homology arm, first tag, 3' intron fragment, 3' self-splicing site, 3' exon fragment, 5' internal homology arm, inserted fragment, 3' internal homology arm, 5' exon fragment, 5' self-splicing site, 5' intron fragment, second tag and 3' external homology arm.
  • the 3' intron fragment, 3' self-splicing site, 3' exon fragment located 5' upstream of the inserted fragment, and the 5' exon fragment, 5' self-splicing site, 5' intron fragment located 3' downstream of the inserted fragment play a key role in the process of forming circular RNA by splicing of linear RNA precursor.
  • the 3' self-splicing site connecting the 3' intron fragment and the 3' exon fragment is broken under GTP initiation, and the break site can further trigger the break of the 5' self-splicing site connecting the 5' intron fragment and the 5' exon fragment, so that the 5' exon fragment and the 3' exon fragment are connected at the self-splicing site to form a circular RNA.
  • the presence of the external homology arm and the internal homology arm in pairs can improve the self-splicing efficiency and the cyclization efficiency in the above-mentioned cyclization reaction.
  • the nucleotides in the linear RNA precursor of the present invention may be unmodified natural nucleotides or partially or completely modified non-natural nucleotides.
  • the linear RNA precursor of the present invention comprises only naturally occurring nucleotides.
  • the linear RNA precursor of the present invention comprises one or more modifications that can increase stability, such as 2'-O-methyl, fluorine or O-methoxyethyl conjugates, thiophosphate backbones or 2',4'-cyclic 2'-O-ethyl modifications (Holdt et al., Front Physiol., 9: 1262 (2016); Krutzfeldt et al., Nature, 438 (7068): 685-9 (2005); Crooke et al., Cell Metab 27 (4): 714-739 (2016)), and/or one or more modifications that can reduce the innate immunogenicity of the circular RNA molecule in the host, such as at least one N6-methyladenosine (m 6 A)
  • linear RNA fragment refers to all RNA products derived from the linear RNA precursor after the linear RNA precursor undergoes a cyclization reaction, except for the target circular RNA molecule produced thereby, including but not limited to: the self-splicing sequences on both sides that are sheared off after the cyclization reaction, and the various splicing intermediate sequences produced during the cyclization reaction. Since the linear RNA precursor molecules may not all undergo a cyclization reaction, the "linear RNA fragment” may also include the linear RNA precursor sequence remaining in the cyclization reaction product.
  • the length of the "linear RNA fragment” is not necessarily less than the linear RNA precursor from which it is derived, because the term also covers linear and circular high molecular weight polymerized RNA produced by polymerization of multiple identical or different sequences mentioned above.
  • the term “linear RNA fragment” has substantially the same meaning as the "byproduct” or “impurity” produced by the cyclization reaction.
  • circular RNA As used herein, the terms “circular RNA” or “circRNA” are used interchangeably and refer to polyribonucleotides that form a closed circular structure through covalent bonds. It has been reported that circular RNA is a 3-5' covalently closed RNA ring without a 5' end cap and a 3' end poly (A) tail. Due to the lack of free ends required for exonuclease-mediated degradation, it has the property of resisting RNA enzyme degradation and has a longer life or half-life than ordinary linear RNA products (e.g., mature mRNA).
  • ordinary linear RNA products e.g., mature mRNA
  • Circular RNA can be produced by a splicing process, and cyclization mainly occurs at the annotated exon boundaries using conventional splicing sites (Starke et al., 2015; Szabo et al., 2015).
  • a circular RNA is a single-stranded RNA molecule.
  • the circular RNA molecule herein can have any structure suitable for circular RNA known in the art, but does not contain the same sequence as the first tag and the second tag present in the circular RNA precursor.
  • RNA nicking refers to the phenomenon of single-point breakage of the RNA chain.
  • metal ions especially Mg2+
  • the presence of metal ions (especially Mg2+) in the reaction system causes random breakage of circular RNA, thereby generating nicked RNA, which has the same molecular weight as circular RNA and is easily digested by RNase.
  • intron fragment refers to, for example, a sequence having 75% or higher similarity to a natural type I or type II intron ribozyme (Ribozyme or intronribozyme) or its main active fragment, or a small ribozyme (e.g., satellite RNA, mainly including hammerhead ribozymes, hairpin ribozymes, hepatitis D virus (HDV) RNA, Varkud satellite (VS) ribozyme, and GlmS riboswitch).
  • satellite RNA mainly including hammerhead ribozymes, hairpin ribozymes, hepatitis D virus (HDV) RNA, Varkud satellite (VS) ribozyme, and GlmS riboswitch.
  • an exemplary type I intron ribozyme can be a natural intron self-cleaving ribozyme sequence of Anabaena (whose accession number is included in the GenBank database (GenBank: AY768517), with a total sequence length of 313 bases), or a main active fragment of the sequence (e.g., a sequence as described in SEQ ID NO.2 in CN115786374A, with a total sequence length of 246 bases), or an equivalent of the above ribozyme with various base substitutions or truncations.
  • Exemplary type II ribozymes can be derived from yeast mitochondrial DNA (as described in the following references: Zimmerly et al., Mob DNA, 2015), the Ll.LtrB intron of Lactococcus lactis, the TeI3c/4c type II intron of Thermosynechococcus elongatus (as described in the following references: Monat et al., PloS One, 2020; Costa et al., Sscience, 2016), type II introns of Clostridium such as Clostridium tetani, type II introns of Bacillus such as Bacillus thuringiensis, or the commercial ribozyme tool Targetron.
  • the term "3' intron fragment” refers to a sequence having 75% or higher similarity to the 3' terminal region of the Anabaena ribozyme
  • the term “5' intron fragment” refers to a sequence having 75% or higher similarity to the 5' terminal region of the Anabaena ribozyme, as long as the two intron fragments can be close to each other in the spatial structure of the RNA to form a complex with complete ribozyme activity (as described in the following reference: Wesselhoeft et al., Nature Communication, 2018).
  • a person skilled in the art can determine the sequences that can serve as the "3' intron fragment” and the "5' intron fragment” in the natural intron ribozyme.
  • exon fragment may refer to an exon recognized and spliced by an intron ribozyme in the intron-exon (PIE) system of a ribozyme, or a fragment of a signal sequence recognized by an intron ribozyme.
  • PIE intron-exon
  • a "3' exon fragment” may refer to a fragment of a signal sequence recognized by an intron ribozyme.
  • a "5' exon fragment” can represent a sequence having 75% or higher similarity with the 5' end region of the exon recognized and cleaved by the Anabaena ribozyme, as long as the two exon fragments are reversed in direction, can be recognized by the ribozyme and undergo self-splicing (as described in the following reference: Wesselhoeft et al., Nature Communication, 2018).
  • a person skilled in the art can determine the sequences in the exons of natural genes that can serve as "3' exon fragments" and "5' exon fragments”.
  • self-splicing site or “splice site” refers to the position between dinucleotides where cleavage occurs during the RNA circularization reaction and generates free ends for circularization.
  • circularization region refers to a region that is included in the formed circular RNA but not in the sheared RNA after linear RNA is prepared into circular RNA in vitro or circular RNA is formed in vivo by splint-mediated method, permuted intron-exon method, RNA ligase-mediated method or other methods, and is irrelevant to the RNA circularization method, the structure and function of the circularization intermediate.
  • the term "tag” refers to a sequence that can be used to bind to a complementary oligonucleotide (which can be referred to as an "affinity ligand") on an affinity chromatography matrix, so that molecules containing the tag are specifically adsorbed to the affinity chromatography matrix and separated from molecules that do not contain the tag.
  • the nucleotides constituting the tag can be naturally occurring nucleotides or artificially modified non-natural nucleotides.
  • first, second or similar expressions used here and in the text of this article are only intended to distinguish the two elements in a specific category, and do not indicate the importance, order, etc. of the elements; the “first” and “second” elements can be the same or different referents/concepts.
  • first tag and the “second tag” can represent the same tag (e.g., polyA) or different tags (e.g., polyA or its functional variants).
  • poly (A) tag has a meaning recognized and understood by a person of ordinary skill in the art, for example, it refers to a sequence composed of polyadenine nucleotides, which is generally located near the 3' terminal region or the 3' end (also commonly referred to as the poly (A) tail) in a linear messenger RNA molecule. It is generally believed that the poly (A) sequence at the 3' end generally protects the mRNA from 3' end degradation in a linear mRNA molecule and plays an important role in cap-dependent protein translation. Patent document 4 also reports that the poly (A) sequence in circular RNA has the effect of reducing its immunogenicity.
  • the poly (A) tag of the present invention is generally composed of about 20 to up to about 100 adenine nucleotides, preferably 30 to 90, more preferably 40 to 80, further preferably 45 to 70, and most preferably 50 to 65 consecutive adenine nucleotides.
  • the poly (A) tag of the invention is composed of 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58 , 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 consecutive adenine nucleotides.
  • Poly(A) tag may also refer to a class of poly(A) tags comprising at least one non-A base, including but not limited to PCT application PCT/CN2023/079037, Chinese patent application CN112805386A, U.S. patent application Various poly (A) sequences described in US10717982B2. The above application or patent text is hereby incorporated in its entirety.
  • poly (A) sequences are also referred to herein as "functional variants of poly (A)", meaning that the above sequence consisting of continuous adenine nucleotides is interrupted by individual non-A nucleotides, but still has substantially the same functional activity (such as the effect on 3' terminal stability and/or translation activity) as the sequence consisting of continuous adenine nucleotides.
  • the "functional variant of poly(A)" in the present invention includes the following different situations:
  • element a is composed of multiple consecutive adenine (A) nucleotides, and the length range is ⁇ 20nt;
  • element b is composed of multiple consecutive A nucleotides, and in some embodiments, the length range of element b is 3nt ⁇ b ⁇ 20nt;
  • element c is composed of a non-A nucleotide, and the nucleotide is selected from thymine (T), cytosine (C), and guanine (G) nucleotides;
  • element d is composed of any two or more consecutive nucleotides, and the nucleotides are selected from A, T, C, and G nucleotides, wherein the nucleotides at the 5' and 3' ends of element d are not A nucleotides, and element d does not contain more than 3 consecutive A nucleotides, and the length range of element d is 2nt ⁇ d ⁇ 20nt;
  • element e is composed of one or two consecutive A, and
  • element a and element b in the poly(A) tag are not adjacent, element c and element d are not adjacent, elements b are not adjacent to each other, elements c are not adjacent to each other, and elements d are not adjacent to each other.
  • the Poly(A) tag further comprises a unique element e, which consists of one or two consecutive A's and is located at the 3' end of the Poly(A) tag and adjacent to element d or element c.
  • Poly(A) and its functional variants can be a segment of RNA or a hybrid molecule of DNA and RNA.
  • sequence structure of the Poly(A) tag is selected from:
  • binding refers to the reversible complementary pairing between nucleotide bases through intermolecular interaction forces (e.g., hydrogen bonds), or complementary pairing through chemical bonds, so that nucleic acid molecules can be reversibly paired under specific conditions or reactions.
  • a certain affinity can be achieved between nucleic acids and matrix materials (e.g., organic polymer materials or magnetic beads) carrying nucleic acids, or between matrix materials carrying nucleic acids.
  • a certain length (e.g., 50 nt) of polyA oligosingle-stranded RNA or a long single-stranded RNA containing the oligosingle-stranded polyA can "bind" to a dextran matrix carrying oligo dT under certain conditions by means of complementary base pairing, and optionally, the hydrogen bond between polyA and oligo dT can be opened by changing environmental conditions (e.g., temperature change, strong ion conditions, or strong acid-base conditions), thereby making the "binding" reversible, and further making the oligosingle-stranded RNA or the long single-stranded RNA containing the oligosingle-stranded polyA detached from the dextran matrix; in addition, optionally, an irreversible chemical bond can be formed between polyA and oligo dT under certain conditions, thereby making the RNA irreversibly bound to the dextran matrix.
  • environmental conditions e.g., temperature change, strong i
  • affinity chromatography medium refers to materials with oligonucleotides complementary to purification tags and affinity chromatography matrices.
  • Ideal oligonucleotides can be designed according to specific needs. For example, in the case where the tag is poly(A), the ideal oligonucleotide on the medium can be oligo dT.
  • Affinity chromatography matrices can be selected from stable materials that are unreactive with nucleotides or inert with nucleotide molecules, and there is no obvious mutual repulsion or mutual attraction between the molecular surface and the nucleotide molecules.
  • the matrix can be dextran molecules, polyacrylamide macromolecules, macromolecular cellulose molecules, chitosan materials, modified polylactic acid materials, PET materials, inorganic silicate materials, metal materials with coating layers, etc.
  • the oligonucleotides and affinity chromatography matrices are cross-linked by covalent bonds.
  • the oligonucleotides on the medium can also be replaced by other molecules, as long as such molecules have a preferred intermolecular force with the nucleotides.
  • the tag is an oligonucleotide composed of at least one of U, C and A
  • hypoxanthine can be cross-linked to the matrix to form a desired affinity chromatography medium.
  • the term "separation” refers to the separation of circular RNA products from linear RNA precursors, linear RNA fragments, various intermediate complexes formed by self-splicing processes, etc., thereby simply obtaining the effect of purification.
  • Techniques for purifying target polynucleotides and polypeptides are well known in the art and include, for example, ion exchange chromatography, affinity chromatography, and sedimentation according to density. Generally, a substance is purified when it is present in a sample in an amount greater than its naturally occurring amount relative to other components of the sample.
  • the circular RNA separated by affinity chromatography or affinity adsorption has a purity of greater than 90%, greater than 91%, greater than 92%, greater than 93%, greater than 94%, greater than 95%, greater than 96%, greater than 97%, greater than 98%, greater than 99%, greater than 99.5%, greater than 99.9% or higher, or has a purity of 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, 99.9% or higher, wherein the content of non-ideal nucleic acids (e.g., various intermediate complexes that are not completely circularized and still retain elements such as ribozymes, etc.) therein is no more than 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.5%, 0.1% or less.
  • non-ideal nucleic acids e.g., various intermediate complexes that are not completely circularized and still retain elements such as ribozy
  • Linear nucleic acid molecules are said to have a "5'-terminus” (5' end) and a "3'-terminus” (3' end) because nucleic acid phosphodiester bonds are present at the 5' carbon and 3' carbon of the sugar portion of the substituted mononucleotide.
  • the 5' terminal nucleotide of the nucleic acid molecule is a nucleotide that can form a new phosphodiester bond with its ribose 5' carbon
  • the 3' terminal nucleotide is a nucleotide that can form a new phosphodiester bond with its ribose 3' carbon.
  • the term “5'upstream” refers to the portion of a nucleic acid fragment that precedes the fragment in the 5' to 3' direction and is not within the fragment.
  • the term “3' end downstream” refers to the position of a specific nucleic acid fragment in the 5' to 3' direction, but not in the fragment.
  • the "5' end upstream" of a specific nucleic acid fragment located at the 100bp to 199bp may include any position from the 0bp to the 99bp, and its "3' end downstream” may include any position from the 200bp to the 1000bp.
  • the term “5' terminal region” refers to a number of consecutive nucleotide residues in the 5' to 3' direction from the 5' terminal nucleotide of a specific nucleic acid fragment
  • the term “3' terminal region” refers to a number of consecutive nucleotide residues in the 3' to 5' direction from the 3' terminal nucleotide of a specific nucleic acid fragment.
  • the "5' terminal region" of a specific nucleic acid fragment located at 100bp to 199bp can represent 100bp to 119bp, 100bp to 114bp, 100bp to 109bp, and its "3' terminal region” can represent 170bp to 199bp, 175bp to 199bp, 180bp to 199bp.
  • the specific nucleic acid lengths and positions listed in the explanations here are exemplary, and these terms can be supplemented, extended, expanded, replaced, synonymous, and equivalent without violating the concept and spirit of the present disclosure.
  • structural gene refers to a gene that can encode various polypeptides or proteins
  • functional fragments of structural genes may refer to fragments of broken structural genes that can ultimately be expressed as polypeptides in whole or in part.
  • functional fragments of structural genes include protein subunits, protein active centers, protein hybrids of proteins or non-natural catalytic groups, recombinant protein active subunits or active centers, recombinant artificial enzymes or other biological effect units mainly composed of amino acids, etc.
  • non-coding RNA may refer to RNA that does not encode polypeptide or protein products, including rRNA, tRNA, snRNA, snoRNA, microRNA, micronRNA sponge, miRNA, lncRNA, circRNA, piRNA and other RNAs with known functions, and also include RNAs with unknown functions.
  • the term "spacer” refers to a nucleic acid sequence between a functional sequence (e.g., a promoter or a ribosome entry sequence) and another functional sequence (e.g., a structural gene or a fragment thereof, or a non-coding RNA or a DNA transcribed from a non-coding RNA), which is used to space the binding factors of the two functional sequences at a certain distance, or to prevent the two functional sequences from affecting each other's spatial structure, or to space the binding factors of a functional sequence at a certain distance from the spatial structure of the other functional sequence itself, which does not encode a polypeptide, protein or non-coding RNA with a biological effect, or the transcription product or translation product at least does not degrade the biological activity of the polypeptide, protein or non-coding RNA.
  • the spacer can be a scrambled, mononucleotide repeated, or polynucleotide repeated sequence of a specific length (e.
  • RNase R is a ribonuclease exonuclease derived from the Escherichia coli RNR superfamily, which can cut and degrade linear RNA molecules from the 3' to 5' direction, but basically does not digest circular RNA, lasso structure or double-linked RNA molecules with 7nt missing at the 3' protruding end.
  • RNase R can include polypeptides, hybrid enzymes, polypeptide analogs, etc. with the same enzyme activity that are artificially modified or recombinantly produced based on RNase R.
  • immunogenicity refers to the potential to induce an immune response to a substance.
  • an immune response may be induced.
  • cyclization efficiency refers to a measure of the resulting circular polyribonucleotide compared to its linear starting material.
  • translation efficiency refers to the rate or amount of protein or peptide produced from ribonucleotide transcripts. In some embodiments, translation efficiency can be expressed as the amount of protein or peptide produced per a given amount of transcript encoding a protein or peptide.
  • nucleotide refers to a ribonucleotide, a deoxyribonucleotide, a modified form thereof, or an analog thereof.
  • Nucleotides include substances including purines (e.g., adenine, hypoxanthine, guanine, and derivatives and analogs thereof) and pyrimidines (e.g., cytosine, uracil, thymine, and derivatives and analogs thereof).
  • Nucleotide analogs include nucleotides having modifications in the chemical structure of bases, sugars, and/or phosphates, including but not limited to, 5'-position pyrimidine modifications, 8'-position purine modifications, modifications at the cytosine exocyclic amine, and substitutions of 5-bromo-uracil; and 2'-position sugar modifications, including but not limited to sugar-modified ribonucleotides, wherein 2'-OH is substituted by a group such as H, OR, R, halo, SH, SR, NH2, NHR, NR2, or CN, wherein R is an alkyl moiety as defined herein.
  • Nucleotide analogs are also intended to include nucleotides having bases such as inosine, quercetin, xanthine; sugars such as 2'-methylribose; non-natural phosphodiester linkages such as methylphosphonate, phosphorothioate, and peptide linkages. Nucleotide analogs include 5-methoxyuridine, 1-methylpseudouridine, and 6-methyladenosine.
  • nucleic acid and “polynucleotide” are used interchangeably herein to describe a polymer of any length (e.g., greater than about 2 bases, greater than about 10 bases, greater than about 100 bases, greater than about 500 bases, greater than 1000 bases, or up to about 10,000 or more bases), composed of nucleotides (e.g., deoxyribonucleotides or ribonucleotides), and can be produced enzymatically or synthetically (e.g., as described in U.S. Pat. No.
  • Naturally occurring nucleic acids are composed of nucleotides, including guanine, cytosine, adenine, thymine, and uracil (G, C, A, T, and U, respectively).
  • ribonucleic acid and "RNA” refer to a polymer composed of ribonucleotides.
  • deoxyribonucleic acid and “DNA” refer to a polymer composed of deoxyribonucleotides.
  • a polynucleotide sequence has "homology” when it is identical or shares sequence identity with a reverse complementary sequence or “complementary” sequence.
  • the percentage of sequence identity between a homologous region and the reverse complementary sequence of the corresponding homologous region can be any percentage of sequence identity that allows hybridization to occur.
  • the internal duplex forming region of the polynucleotide of the present invention is capable of forming a duplex with another internal duplex forming region and does not form a duplex with an external duplex forming region.
  • Transcription refers to the formation or synthesis of RNA molecules by RNA polymerase using a DNA molecule as a template.
  • the present invention is not limited to the RNA polymerase used for transcription.
  • a T7-type RNA polymerase may be used.
  • Enzymes refers to the formation of polypeptide molecules by ribosomes based on RNA templates.
  • the term “about” should be understood to be within the normal tolerance range of the field, such as within 2 standard deviations of the mean value. “About” can be understood to be within 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.9%, 0.8%, 0.7%, 0.6%, 0.5%, 0.4%, 0.3%, 0.2%, 0.1%, 0.09%, 0.08%, 0.07%, 0.06%, 0.05%, 0.04%, 0.03%, 0.02% or 0.01% of the value. Unless otherwise obvious from the context, all numerical values provided herein are modified by the term "about”.
  • the term "encoding" refers broadly to any process in which information in a polymeric macromolecule is used to direct the production of a second molecule that is different from the first molecule.
  • the second molecule may have a chemical structure that is different in chemical nature from the first molecule.
  • Combination or “co-administration” refers to the co-administration of a therapeutic agent provided herein and one or more additional therapeutic agents sufficiently close in time that the therapeutic agent provided herein can enhance the effect of the one or more additional therapeutic agents, or vice versa.
  • the terms “treat” and “prevent” and words derived therefrom do not necessarily mean 100% or complete treatment or prevention. Rather, there are varying degrees of treatment or prevention that one of ordinary skill in the art recognizes as having potential benefits or therapeutic effects.
  • the treatment or prevention provided by the methods disclosed herein may include treating or preventing one or more disorders or symptoms of a disease.
  • prevention may include delaying the onset of a disease or a symptom or disorder thereof.
  • the term "expressed sequence” may refer to a nucleic acid sequence encoding a product such as a peptide or polypeptide, a regulatory nucleic acid, or a non-coding nucleic acid.
  • An exemplary expressed sequence encoding a peptide or polypeptide may comprise a plurality of nucleotide triplets, each of which may encode an amino acid and is referred to as a "codon”.
  • an antibody includes, but is not limited to, glycoprotein immunoglobulins that specifically bind to an antigen.
  • an antibody may comprise at least two heavy (H) chains and two light (L) chains interconnected by disulfide bonds, or an antigen-binding molecule thereof.
  • Each H chain comprises a heavy chain variable region (abbreviated herein as VH) and a heavy chain constant region.
  • the heavy chain constant region comprises three constant domains, CH1, CH2, and CH3.
  • Each light chain comprises a light chain variable region (abbreviated herein as VL) and a light chain constant region.
  • the light chain constant region comprises one constant domain, CL.
  • the VH and VL regions can be further subdivided into hypervariable regions, called complementary determining regions (CDRs), interspersed with more conservative regions, called framework regions (FRs).
  • CDRs complementary determining regions
  • FRs framework regions
  • Each VH and VL comprises three CDRs and four FRs, arranged in the following order from the amino terminus to the carboxyl terminus: FR1, CDR1, FR2, CDR2, FR3, CDR3, and FR4.
  • the variable regions of the heavy and light chains contain binding domains that interact with the antigen.
  • the constant regions of the antibodies may mediate the binding of the immunoglobulin to host tissues or factors, including various cells of the immune system (e.g., effector cells) and the first component of the classical complement system.
  • Antibodies may include, for example, monoclonal antibodies.
  • Immunoglobulins can be derived from any known isotype, including but not limited to IgA, secretory IgA, IgG and IgM.
  • IgG subclasses are also well known to those skilled in the art, including but not limited to human IgG1, IgG2, IgG3 and IgG4.
  • Immunotype refers to the Ab class or subclass (e.g., IgM or IgG1) encoded by the heavy chain constant region gene.
  • the term "antibody” includes naturally occurring and non-naturally occurring antibodies; monoclonal and polyclonal antibodies; chimeric and humanized antibodies; human or non-human antibodies; fully synthetic antibodies; and single-chain antibodies.
  • Non-human antibodies can be humanized by recombinant methods to reduce their immunogenicity in humans.
  • antibody also includes an antigen-binding fragment or antigen-binding portion of any of the above-mentioned immunoglobulins, and includes monovalent and divalent fragments or portions, as well as single-chain antibodies.
  • Antigen binding molecule refers to any molecule that comprises an antigen binding portion (e.g., CDR) of an antibody from which the molecule is derived.
  • Antigen binding molecules may include antigen complementary determining regions (CDRs).
  • Examples of antibody fragments include, but are not limited to, Fab, Fab', F(ab')2 and Fv fragments, dAbs, linear antibodies, scFv antibodies, and multispecific antibodies formed by antigen binding molecules.
  • Peptibodies i.e., Fc fusion molecules comprising peptide binding domains
  • the antigen binding molecules bind to antigens on tumor cells.
  • the antigen binding molecules bind to antigens on cells involved in hyperproliferative diseases or to viral or bacterial antigens. In some embodiments, the antigen binding molecules bind to BCMA. In other embodiments, the antigen binding molecules are antibody fragments that specifically bind to antigens, including one or more complementary determining regions (CDRs) thereof. In other embodiments, the antigen binding molecules are single-chain variable fragments (scFv).
  • Cancer refers to a broad variety of diseases characterized by the uncontrolled growth of abnormal cells in the body. Unregulated cell division and growth lead to the formation of malignant tumors that invade adjacent tissues and may also metastasize to distant parts of the body via the lymphatic system or bloodstream. "Cancer” or “cancerous tissue” may include tumors. Examples of cancers that may be treated by the methods disclosed herein include, but are not limited to, cancers of the immune system, including lymphomas, leukemias, myelomas, and other white blood cell malignancies.
  • the methods disclosed herein can be used to reduce the size of tumors originating from, for example, bone cancer, pancreatic cancer, skin cancer, head and neck cancer, cutaneous or intraocular malignant melanoma, uterine cancer, ovarian cancer, rectal cancer, anal cancer, stomach cancer, testicular cancer, uterine cancer, multiple myeloma, Hodgkin's disease, non-Hodgkin's lymphoma (NHL), primary mediastinal large B-cell lymphoma (PMBC), diffuse large B-cell lymphoma (DLBCL), follicular lymphoma (FL), transformed follicular lymphoma, splenic marginal zone lymphoma (SMZL), esophageal cancer, small intestine cancer, endocrine system cancer, thyroid cancer, parathyroid cancer adenocarcinoma, adrenal cancer, urethral cancer, penile cancer, chronic or acute leukemia, acute myeloid leukemia, chronic mye
  • the methods disclosed herein can be used to reduce the size of tumors derived from, for example, sarcomas and carcinomas, fibrosarcomas, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, Kaposi's sarcoma, soft tissue sarcomas and other sarcomas, synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon cancer, pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, hepatocellular carcinoma, lung cancer, colorectal cancer, squamous cell carcinoma, basal cell carcinoma, adenocarcinoma (e.g., pancreatic, colon, ovarian, lung, breast, stomach, prostate, cervical cancer, or esophageal adenocarcinoma), sweat gland cancer, sebaceous gland cancer, pa
  • NK cells include natural killer (cytotoxic) lymphocytes that represent the main components of the innate immune system. NK cells repel tumors and cells infected by viruses. It works through the process of apoptosis or programmed cell death. They are called “natural killers” because they do not need to be activated to kill cells. T cells play a major role in cell-mediated immunity (without antibody involvement).
  • the term “genetic engineering” or “engineering” refers to a method of modifying the genome of a cell, including but not limited to deleting a coding or non-coding region or a portion thereof or inserting a coding region or a portion thereof.
  • the modified cell is a lymphocyte, such as a T cell, which can be obtained from a patient or a donor.
  • the cell can be modified to express an exogenous construct incorporated into the cell genome, such as a chimeric antigen receptor (CAR) or a T cell receptor (TCR).
  • CAR chimeric antigen receptor
  • TCR T cell receptor
  • sequence identity or, for example, “a sequence that is 50% identical to ...” refers to the degree to which the sequences are identical on a nucleotide-by-nucleotide or amino acid-by-amino acid basis over the comparison window.
  • the "percentage of sequence identity" can be calculated by comparing two optimally aligned sequences over the comparison window, determining the number of positions where the same nucleic acid base (e.g., A, T, C, G, I) or the same amino acid residue (e.g., Ala, Pro, Ser, Thr, Gly, Val, Leu, Ile, Phe, Tyr, Trp, Lys, Arg, His, Asp, Glu, Asn, Gln, Cys and Met) appears in the two sequences to obtain the number of matching positions, dividing the number of matching positions by the total number of positions in the comparison window (i.e., the window size), and multiplying the result by 100 to obtain the percentage of sequence identity.
  • the same nucleic acid base e.g., A, T, C, G, I
  • the same amino acid residue e.g., Ala, Pro, Ser, Thr, Gly, Val, Leu, Ile, Phe, Tyr, Trp, Lys, Arg, His
  • treatment refers to both therapeutic treatment and prophylactic or preventive or preventive measures, wherein the purpose is to prevent or slow down (mitigate) an undesirable pathological change or condition.
  • beneficial or desired clinical results include, but are not limited to, alleviation of symptoms, reduction in disease severity, delay or slowing of disease progression, improvement or alleviation of the disease state, and remission (whether partial or complete), whether detectable or undetectable.
  • the term "therapeutically effective amount” refers to an amount of a compound of the invention that can: (i) treat or prevent a disease or condition described herein, (ii) ameliorate or eliminate one or more diseases or conditions described herein, or (iii) prevent or delay the onset of one or more symptoms of a disease or condition described herein.
  • non-coding RNA mainly includes microRNA (miRNA), small interfering RNA (siRNA), PIWI-interacting RNA (piRNA), tRNA-derived small RNA (tsRNA), small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), long non-coding RNA (lncRNA), circular RNA (circRNA), pseudogenes, etc.
  • miRNA is a short ncRNA of about 22 to 23 nucleotides. Its coding gene is transcribed by RNA polymerase II and regulates the expression of mRNA by binding to the 3′ untranslated region (3′ UTR) of mRNA.
  • lncRNA is a non-coding RNA with a length of more than 200 nucleotides. Its biogenesis process is similar to that of mRNA. lncRNA plays an important role in a variety of biological processes such as cell cycle regulation, chromatin modification, and mRNA translation.
  • CircRNA belongs to lncRNA, which is mainly produced by exon or intron sequences. It is a single-stranded circular RNA molecule that can bind to miRNA as a competitive endogenous RNA to regulate transcription or affect parental gene expression.
  • piRNA is a small RNA with a length of about 21 to 35 nucleotides, which is processed from long single-stranded transcripts.
  • RNA polymerase II The genomic sites of these transcripts are aggregated throughout the genome and transcribed by RNA polymerase II. There are about 20,000 piRNAs in the human genome, which are mainly expressed in gonadal cells.
  • tsRNA is derived from the cleavage of tRNA by nucleases. It is usually 18 to 40 nucleotides in length and plays an important role in regulating translation, maintaining the stability of mRNA, gene silencing, reverse transcription, etc.
  • genomic DNA refers to the heritable information of a host organism.
  • the genomic DNA includes all genetic material of a cell or organism, including nuclear DNA (chromosomal DNA), extrachromosomal DNA, and organelle (e.g., mitochondrial) DNA.
  • chromosomal DNA chromosomal DNA
  • extrachromosomal DNA e.g., extrachromosomal DNA
  • organelle e.g., mitochondrial
  • genomic DNA refers to the chromosomal DNA of the nucleus.
  • recombinant is used to describe an organism or cell (eg, a microorganism), it is used to convey that the organism or cell contains at least one "transgene,” “transgenic,” or “recombinant” polynucleotide, typically described later.
  • a polynucleotide that is "exogenous" with respect to an individual organism is a polynucleotide that has been introduced into that organism by any means other than sexual hybridization.
  • promoter or "RNase binding site” is a polynucleotide region that initiates transcription of a coding sequence.
  • the promoter is located near the transcription start site of a gene, on the same strand of DNA and upstream (towards the 5' region of the sense strand).
  • Some Promoters are constitutive in that they are active under all circumstances in the cell, whereas other promoters are regulated to become active in response to a specific stimulus (e.g., inducible promoters).
  • promoter activity and its grammatical equivalents refer to the extent of expression of a nucleotide sequence operably linked to the promoter whose activity is being measured.
  • Promoter activity can be measured directly by determining the amount of RNA transcript produced, such as by Northern blot analysis, or indirectly by determining the amount of product encoded by a linked nucleic acid sequence, such as a reporter nucleic acid sequence linked to the promoter.
  • plasmid refers to an extrachromosomal element that often carries genes that are not part of the core metabolic machinery of the cell, and is usually in the form of a circular double-stranded DNA molecule. These elements can be autonomous replication sequences, genome integration sequences, phage or nucleotide sequences of any origin, linear, circular or supercoiled single-stranded or double-stranded DNA or RNA. Typically, a plasmid contains a replication origin that is functional in a host cell (e.g., E. coli), and a selectable marker for detecting host cells containing the plasmid.
  • a host cell e.g., E. coli
  • transformation refers to the introduction of one or more exogenous polynucleotides into a host cell by using physical or chemical methods.
  • expression is the process of revealing the information encoded within a gene. If the gene encodes a protein, expression includes transcribing the DNA into mRNA, processing the mRNA (if necessary) into a mature mRNA product, and translating the mature mRNA into a protein.
  • the experimental methods in the following examples, unless otherwise specified, are all conventional methods, and are carried out according to the techniques or conditions described in the literature in the art or according to the product instructions.
  • the materials, reagents, instruments, etc. used in the following examples, unless otherwise specified, can all be obtained from commercial sources.
  • the quantitative tests in the following examples, unless otherwise specified, are the average values of three repeated experiments.
  • the nucleotide sequences in the sequence table are written from left to right in the order of 5' to 3' end, and the amino acid sequences are written from left to right in the order of amino terminal to carboxyl terminal.
  • reagents, instruments and cell lines used in the present invention are commercially available.
  • RNase R (Cat. No. E224, nearshore protein);
  • Oligo d(T)25 magnetic beads (Cat. No. S1419S, NEB).
  • Nanodrop spectrophotometer (model Nanodrop ONE c , Thermo Fisher Scientific),
  • RNA template double strand for preparing a linear RNA precursor.
  • the template double strand was incorporated into the pUC57 vector (company name, product number) by gene synthesis, and placed downstream of the 3' end of the T7 promoter sequence (TAATACGACTCACTATA) to obtain the template plasmid pUC-PIE-miR21 for synthesizing a linear RNA precursor
  • a 20 ⁇ l system (1X reaction buffer, 7.5 mM each of ATP, GTP, CTP and UTP, 1 ⁇ l of T7 RNA polymerase mix, 1 ⁇ g of linear template plasmid pUC-PIE-miR21) was used for in vitro transcription at 37°C for 4 h to prepare the linear RNA precursor shown in SEQ ID NO: 11.
  • the precursor contains the elements listed in Table 1 in the 5' to 3' direction.
  • SEQ ID NO: 11 The underlines in SEQ ID NO: 11 indicate the 5' outer homology arm near the 5' end and the 3' outer homology arm near the 3' end, respectively. Homology arm.
  • the first cytosine (C) in the 5' outer homology arm is numbered as position 1, and the guanine (G) adjacent to it is the transcription start point of the T7 promoter (TAATACGACTCACTATA), which is called position 0.
  • the last nucleotide in SEQ ID NO:11 in the 5' to 3' direction is called position 807, and the 3' end nucleotide adjacent to it can be called position 808 accordingly.
  • the nucleotide residues 1 to 151 located 5' upstream of "#" constitute its 5' end arm
  • the nucleotide residues 673 to 807 located 3' downstream of "*" constitute its 3' end arm
  • the circular RNA formed by it will be cyclized by the sequence represented by the nucleotides 152 to 672 located between "#" and "*", that is, the cyclized region specifically includes: 3' exon fragment (positions 152 to 202), 5' internal homology arm (positions 203 to 227), insert fragment (positions 228 to 635), 3' internal homology arm (positions 636 to 657), 5' exon fragment (positions 658 to 672).
  • FIG1 schematically shows the linear RNA precursor, and shows the name and start and end position numbers of each element above the corresponding position.
  • Example 2 Linear RNA precursors with purification tags in one or both end arms and IVT yield and circularization efficiency test
  • linear RNA precursors having nucleotide sequences as shown in SEQ ID NO: 26 to SEQ ID NO: 49 were prepared as Preparation Examples 1 to 24, respectively.
  • the linear RNA precursors of Preparation Examples 1 to 24 respectively contained a poly (A) with a length of 50 nt as a purification tag at different positions of SEQ ID NO: 11.
  • SEQ ID NO: 11 prepared in Example 1 was used as a comparative example (CK).
  • poly A is located at the 3' end of the first nucleotide residue of the linear RNA precursor and is directly connected to the first nucleotide residue.
  • Preparation Examples 1 to 5, 7, 11, 24 and Comparative Example 1 were subjected to cyclization reaction under the same conditions.
  • 2 mm GTP was added and transferred to 55°C for incubation for 15 minutes, and then the incubated mixture was purified with RNA Cleanup Kit according to the manufacturer's instructions to obtain the cyclization reaction product before RNase R treatment (-).
  • the RNA content of the above cyclization reaction product was determined using the Nanodrop method. 20 U RNase R and 2 ⁇ g RNA were added to a 100 ⁇ l reaction system, incubated at 37°C for 15 minutes, and the cyclization reaction product (+) after RNase R treatment was obtained.
  • RNase R is a 3’-5’ exonuclease that can only degrade linear RNA, not circular RNA.
  • Figure 5 shows that after RNase R digestion, obvious bands corresponding to circular RNA can be observed in Preparation Examples 1 to 5, 7, 11, 24 and Comparative Example 1 (CK), indicating that the insertion of poly(A) did not affect the cyclization of these linear RNA precursors. However, in the lanes corresponding to Preparation Examples 1 to 5, 7, 11, and 24, no obvious secondary bands were observed except for the main band, whether before (-) or after (+) RNase R digestion.
  • this example shows that, without being constrained by any theoretical mechanism, inserting a purification tag into the two end arms of a linear RNA precursor (especially the region of the 5' external homology arm, the 5' intron or the 3' external homology arm) generally does not significantly interfere with the cyclization reaction, but can surprisingly improve the cyclization reaction efficiency of the linear RNA precursor, significantly increase the proportion of the desired complete cyclization product in the reaction products, and significantly reduce or even eliminate undesirable by-products such as nicked RNA, thereby significantly improving the purity of the crude product of the cyclization reaction, reducing or even eliminating the dependence of the circular RNA purification method on the RNase R enzyme treatment step, and ultimately greatly simplifying the subsequent purification process while not affecting or even significantly improving the quality of the cyclization product.
  • Example 3 Linear RNA precursor with purification tags in both side arms and IVT yield and circularization efficiency test
  • RNA precursor having a nucleotide sequence as shown in SEQ ID NO:50 was prepared as Comparative Example 2. Except for the 5' external homology arm and the 3' external homology arm shown in Table 3, the remaining element sequences in the linear RNA precursor were the same as those in Table 1.
  • linear RNA precursors having nucleotide sequences as shown in SEQ ID NO: 51 to 55 were prepared as Preparation Examples 25 to 29, respectively.
  • the linear RNA precursors of Preparation Examples 25 to 29 respectively contained poly (A) with a length of 50 nt as a purification tag at different positions of SEQ ID NO: 50.
  • RNA precursor having the element structure shown in Figures 2 to 4 is prepared.
  • the external homology arms, intron fragments, coding region fragments, translation initiation elements, exon fragments, internal homology arms, spacer sequences, and insertion sequences shown in the figures can all use elements known in the art to have corresponding functions, or elements that can be reasonably inferred to have corresponding functions, including but not limited to various functional elements described in CN112399860A and CN115404240A.
  • the above-mentioned application or patent text is hereby incorporated herein in its entirety.
  • the final reaction solution containing 5 ⁇ g of RNA was taken from the cyclization reaction products of Preparation Example 24 and Comparative Example 1 for purification. Without adding RNase R, the final solution was incubated with 200 ⁇ l of Oligo d(T)25 magnetic beads for 20 minutes to allow the magnetic beads to fully contact and adsorb the RNA with the poly(A) label. After 20 minutes, the sample was placed on a magnetic stand and allowed to stand for 10 seconds to separate the magnetic beads from the supernatant, the magnetic beads were removed, and the supernatant was recovered using the RNA Cleanup Kit according to the manufacturer's instructions. The purity of the cyclization reaction product before RNase R treatment (-) and the cyclization reaction product after RNase R treatment (+) was detected using an Agilent 5200 fragment analyzer under default settings. The results are shown in Figures 8A to 8D.
  • FIG8 shows that before incubation with Oligo d(T)25 magnetic beads, the purity of circular RNA in the samples of Comparative Example 1 and Preparation Example 24 was 81.3% and 92.7%, respectively. After incubation with magnetic beads, the purity of circular RNA in Comparative Example 1 was determined to be 78.1%, and the purity of Preparation Example 24 was increased to 100%.
  • the above results show that after inserting poly(A) into the two end arms of the linear RNA precursor, various impurities coexisting with the target circular RNA in the cyclization reaction system can be removed by affinity chromatography media (such as Oligo d(T)25 magnetic beads), and high-purity circular RNA can be obtained.
  • sequence table The sequences used in the above examples of the present application are shown in the sequence table. It should be understood that these sequences are only exemplary sequences of the present application's embodiments, rather than any restriction to the present application's scheme.
  • the nucleotide sequence in the present application's sequence table can represent both a DNA sequence and an RNA sequence, and when it represents an RNA sequence, "T" therein represents uridine.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Zoology (AREA)
  • Biotechnology (AREA)
  • Wood Science & Technology (AREA)
  • Medicinal Chemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Veterinary Medicine (AREA)
  • Animal Behavior & Ethology (AREA)
  • Public Health (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Molecular Biology (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • Microbiology (AREA)
  • Diabetes (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Hematology (AREA)
  • Obesity (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Communicable Diseases (AREA)
  • Oncology (AREA)
  • Neurology (AREA)
  • Neurosurgery (AREA)
  • Pulmonology (AREA)
  • Cardiology (AREA)
  • Heart & Thoracic Surgery (AREA)
  • Toxicology (AREA)

Abstract

Provided is a method for preparing a circular RNA, which method comprises inserting purification tags, such as poly(A) tags or functional variants thereof into two terminal arms of a linear RNA precursor, and separating the circular RNA, which is obtained by means of the cyclization reaction of the linear RNA precursor, from other byproducts of the cyclization reaction as impurities using an affinity chromatographic medium capable of simultaneously binding to the tags. Further provided are a linear RNA precursor with a purification tag inserted in a specific region and an encoding DNA thereof. The insertion of the purification tag in the specific region does not affect the yield of linear RNA precursors produced by means of the in-vitro transcription (IVT) of the encoding DNA.

Description

制备环状RNA方法及用于所述方法的核酸序列Method for preparing circular RNA and nucleic acid sequence used in the method 技术领域Technical Field

本发明涉及用于制备和纯化环状RNA的方法和核酸序列。The present invention relates to methods and nucleic acid sequences for preparing and purifying circular RNA.

背景技术Background Art

近年来,mRNA疫苗或mRNA药物的研发和应用为控制新冠疫情起到了至关重要的作用,同时也大幅加快了mRNA药物在其他疾病治疗领域的研发。人们注意到,线状mRNA类药物在体内面临着稳定性低、半衰期短等重大挑战,而环状RNA由于不含游离末端,在细胞内能够有效避免被外切RNA酶降解,从而具有明显高于线状mRNA分子的稳定性(参见非专利文献1),为解决上述挑战提供了一种有效候选方案。In recent years, the development and application of mRNA vaccines or mRNA drugs have played a vital role in controlling the COVID-19 pandemic, and have also greatly accelerated the development of mRNA drugs in other disease treatment fields. It has been noted that linear mRNA drugs face major challenges in vivo, such as low stability and short half-life. Since circular RNA does not contain free ends, it can effectively avoid degradation by exonuclease in cells, thus having significantly higher stability than linear mRNA molecules (see non-patent literature 1), providing an effective candidate solution to the above challenges.

环状RNA(circRNA)是一类共价闭合环状的RNA分子,自1976年首次发现植物类病毒(plant viroids)的基因组是环状RNA分子以来,已经陆续在病毒(如丁型肝炎病毒HDV)和真核细胞(如哺乳动物细胞)中鉴定出成百上千种天然的环状RNA,并发现这些分子在体内执行多种不同的生物学功能,包括但不限于编码蛋白质、调节基因转录、作为microRNA海绵(microRNA sponge)、蛋白支架等(参见非专利文献2)。相关的分子生物学和功能的研究分析推动了环状RNA的体外合成技术,已报道过化学合成法、连接酶法(T4 DNA连接酶、T4 RNA连接酶1、T4 RNA连接酶2)、核酶法(包括I型内含子核酶和II型内含子核酶)(参见非专利文献1)。Circular RNA (circRNA) is a type of covalently closed circular RNA molecule. Since the genome of plant viroids was first discovered as a circular RNA molecule in 1976, hundreds of natural circular RNAs have been identified in viruses (such as hepatitis D virus HDV) and eukaryotic cells (such as mammalian cells), and it has been found that these molecules perform a variety of different biological functions in vivo, including but not limited to encoding proteins, regulating gene transcription, acting as microRNA sponges, protein scaffolds, etc. (see non-patent literature 2). Related molecular biology and functional research and analysis have promoted the in vitro synthesis technology of circular RNA, and chemical synthesis, ligase method (T4 DNA ligase, T4 RNA ligase 1, T4 RNA ligase 2), and ribozyme method (including type I intron ribozyme and type II intron ribozyme) have been reported (see non-patent literature 1).

其中,核酶法是当前核酸药物领域内合成环状RNA的主要途径之一,其利用通过改造I型或II型自剪接内含子而建立的PIE系统(Permuted Intron-Exon System)来制备环状RNA。据报道,I型内含子的剪接不依赖于任何蛋白质,在存在Mg2+和鸟苷的条件下,通过两步转酯反应即可实现剪接:第一步,鸟苷攻击5’剪接点(5’SS),鸟苷的3’羟基在该位点进行酯交换反应,使5’外显子和5’内含子之间发生剪切;第二步,第一步反应后形成的中间体的3’-OH攻击3’剪接点(3’SS),于该位点进行第二次酯交换反应,导致位于内含子两侧的外显子直接相连,而内含子本身的5’端与3’端发生环化后被剪切掉。根据上述作用机制,将天然I型自剪接内含子一分为二,并将“外显子1-内含子片段1-内含子片段2-外显子2”的排列顺序人为置换为“内含子片段2-外显子2-外显子1-内含子片段1”的排列,就可以在Mg2+和鸟苷的存在条件下,制备由“外显子2-外显子1”片段环化而得的环状RNA。例如,非专利文献3和4分别报道了通过上述置换方式改造Anabaena(鱼腥藻)pre-Trna或T4噬菌体Td基因的I型自剪接内含子而建立的PIE系统,上述系统能够成功环化~100nt或~550nt长度不等的RNA。基于这2个I型自剪接内含子核酶的PIE系统至今依然被广泛用于环状RNA合成。非专利文献5还报道了对基于Anabaena pre-tRNA I型内含子核酶的PIE系统的优化,通过添加外部同源臂(external homology arm)、内部同源臂(internal homology arm)等可以显著提高体外环化效率,将可环化的RNA片段长 度提高至~5kb,为环状RNA药物的合成打下了良好的基础。Among them, the ribozyme method is one of the main ways to synthesize circular RNA in the current field of nucleic acid drugs. It uses the PIE system (Permuted Intron-Exon System) established by modifying type I or type II self-splicing introns to prepare circular RNA. It is reported that the splicing of type I introns does not depend on any protein. In the presence of Mg 2+ and guanosine, splicing can be achieved through a two-step transesterification reaction: in the first step, guanosine attacks the 5' splicing site (5'SS), and the 3' hydroxyl group of guanosine undergoes an ester exchange reaction at this site, causing splicing between the 5' exon and the 5'intron; in the second step, the 3'-OH of the intermediate formed after the first reaction attacks the 3' splicing site (3'SS), and a second ester exchange reaction is carried out at this site, resulting in the exons on both sides of the intron being directly connected, and the 5' end of the intron itself is cyclized with the 3' end and then sheared off. According to the above mechanism of action, the natural type I self-splicing intron is divided into two, and the arrangement order of "exon 1-intron fragment 1-intron fragment 2-exon 2" is artificially replaced with the arrangement of "intron fragment 2-exon 2-exon 1-intron fragment 1", and the circular RNA obtained by cyclization of the "exon 2-exon 1" fragment can be prepared in the presence of Mg 2+ and guanosine. For example, non-patent documents 3 and 4 respectively reported the PIE system established by transforming the type I self-splicing intron of Anabaena (fish algae) pre-Trna or T4 phage Td gene through the above replacement method. The above system can successfully cyclize RNAs of varying lengths of ~100nt or ~550nt. The PIE system based on these two type I self-splicing intron ribozymes is still widely used in circular RNA synthesis. Non-patent document 5 also reports the optimization of the PIE system based on Anabaena pre-tRNA type I intron ribozyme, which can significantly improve the in vitro cyclization efficiency by adding external homology arms, internal homology arms, etc., and shorten the length of the cyclizable RNA fragment to 1. The length of circular RNA was increased to ~5kb, laying a good foundation for the synthesis of circular RNA drugs.

上述环状RNA的合成过程均会产生多种RNA副产物,包括但不限于:成环反应后被剪切掉的两侧内含子、尚未成环的线状RNA前体、以及由多个线状前体聚合产生的线状及环状高分子量聚合RNA。现有技术已报道过这些副产物的存在对最终的环状RNA生物制品具有种种不利的影响,不仅会降低了制品的纯度和效价,而且会显著增加体内免疫原性(参见非专利文献2)。因此迫切需要能够高效制备高纯度环状RNA分子的方法,尤其是能够满足产业化的产量水平的制备方法。The synthesis process of the above circular RNA will produce a variety of RNA byproducts, including but not limited to: introns on both sides that are cut off after the circularization reaction, linear RNA precursors that have not yet been circularized, and linear and circular high molecular weight polymerized RNA produced by polymerization of multiple linear precursors. The prior art has reported that the presence of these byproducts has various adverse effects on the final circular RNA biological products, which not only reduces the purity and titer of the products, but also significantly increases the immunogenicity in vivo (see non-patent document 2). Therefore, there is an urgent need for a method that can efficiently prepare high-purity circular RNA molecules, especially a preparation method that can meet the production level of industrialization.

现有制备方法中对副产物与环状RNA的分离主要利用的是二者在分子量的差异,或者线状与环状结构的物理性质区别,例如体积排阻层析(size exclusion chromatography,SEC)或高效液相色谱(HPLC)(参见非专利文献2~6,专利文献1)。但是,上述纯化方式难以区分目标环状RNA和与之分子量大小非常接近的线状RNA前体、以及分子量完全相同的nicked RNA,因而业内常规做法是在进行SEC或HPLC之前进行RNA酶R消化,从而去除上述杂质。同时,SEC和HPLC本身就具有载量低、成本高、工艺复杂、分辨率低等共性问题,在叠加额外的酶处理步骤后,更加难以同时实现产量、效率和纯度的提高,勿论工业级别的环状RNA生产。The existing preparation methods for separating byproducts from circular RNA mainly utilize the difference in molecular weight between the two, or the difference in physical properties between linear and circular structures, such as size exclusion chromatography (SEC) or high performance liquid chromatography (HPLC) (see non-patent literature 2 to 6, patent literature 1). However, the above purification methods make it difficult to distinguish between the target circular RNA and linear RNA precursors with very similar molecular weights, as well as nicked RNA with exactly the same molecular weight. Therefore, the conventional practice in the industry is to perform RNase R digestion before SEC or HPLC to remove the above impurities. At the same time, SEC and HPLC themselves have common problems such as low loading capacity, high cost, complex process, and low resolution. After adding additional enzyme treatment steps, it is even more difficult to simultaneously achieve improvements in yield, efficiency, and purity, let alone industrial-level circular RNA production.

专利文献2报道了使用以Oligo dT为亲和配基的柱层析来纯化具有Poly(A)结构的线状mRNA的方法。虽然专利文献2也提及可以向环状RNA分子中人为添加Poly(A)序列,但该专利方法将Poly(A)添加在RNA分子的成环区域中,因而理论上无法在一个柱层析步骤中同时分离全部自剪接副产物与目标环状RNA分子,尤其无法分离同样带有Poly(A)序列的尚未成环的线状RNA前体、nicked RNA和目标环状RNA分子。Patent document 2 reports a method for purifying linear mRNA with a poly(A) structure using column chromatography using oligo dT as an affinity ligand. Although patent document 2 also mentions that poly(A) sequences can be artificially added to circular RNA molecules, the patent method adds poly(A) to the circular region of the RNA molecule, so theoretically it is impossible to simultaneously separate all self-splicing byproducts and target circular RNA molecules in one column chromatography step, especially it is impossible to separate linear RNA precursors, nicked RNA and target circular RNA molecules that also have poly(A) sequences and have not yet formed a circle.

专利文献3报道了另一种纯化环状RNA的方法,该方法包括给与环状RNA混合存在的线状RNA加poly(A)尾的步骤,以及随后使用寡(dT)25偶联的磁珠来去除含poly(A)尾的RNA的步骤。然而该方法的主要目的是分离天然环状RNA及其对应的线状转录本片段,而并非针对环状RNA合成过程中的各种副产物。Patent document 3 reports another method for purifying circular RNA, which includes the steps of adding a poly(A) tail to linear RNA mixed with circular RNA, and then removing the RNA containing the poly(A) tail using oligo(dT)25-coupled magnetic beads. However, the main purpose of this method is to separate natural circular RNA and its corresponding linear transcript fragments, rather than various byproducts in the synthesis process of circular RNA.

专利文献4报道了在线性的前体RNA的3’或5’末端掺入至少一个纯化标签,从而可以使用反义Oligo亲和纯化环状RNA的方法,但专利文献4已经整体上明确排除了使用poly(A)及其变体作为纯化标签。具体而言,poly(A)是构成专利文献4的环状RNA的必要元件,若还将其作为标签插入线状RNA前体的3’或5’末端,显然根据亲和纯化的设计原理将无法区分期望分离的环状RNA和未环化的杂质,专利文献4也明确记载,“作为自剪接的结果,相应的环化RNA不再包含3’和/或5’末端纯化标签”。因此,专利文献4实际上给出了与本发明完全相反的技术教导。Patent document 4 reports a method of incorporating at least one purification tag into the 3' or 5' end of a linear precursor RNA, thereby enabling the use of antisense oligo affinity purification of circular RNA, but Patent document 4 has clearly excluded the use of poly(A) and its variants as purification tags as a whole. Specifically, poly(A) is an essential element constituting the circular RNA of Patent document 4. If it is also inserted as a tag into the 3' or 5' end of the linear RNA precursor, it is obvious that according to the design principle of affinity purification, it will not be possible to distinguish between the desired isolated circular RNA and the uncircularized impurities. Patent document 4 also clearly states that "as a result of self-splicing, the corresponding circularized RNA no longer contains 3' and/or 5' end purification tags". Therefore, Patent document 4 actually provides technical teachings that are completely opposite to the present invention.

此外,专利文献4还记载了使用SEQ ID NO:208(tctttaccctcgtcttgacg)和209(tatgctgttatccgtcgatt)作为Oligo来亲和纯化环状RNA。然而,该文献的图6显示,环状RNA的 富集效果发生在将目标环状RNA与未完全环化的杂质分离的步骤之后。换言之,专利文献4中,3’或5’末端添加纯化标签并不会直接改善环化反应效率和环状RNA纯度。In addition, Patent Document 4 also describes the use of SEQ ID NO: 208 (tctttaccctcgtcttgacg) and 209 (tatgctgttatccgtcgatt) as oligos for affinity purification of circular RNA. However, FIG. 6 of the document shows that the circular RNA The enrichment effect occurs after the step of separating the target circular RNA from the impurities that are not completely circularized. In other words, in Patent Document 4, adding a purification tag to the 3' or 5' end does not directly improve the circularization reaction efficiency and circular RNA purity.

参考文献:References:

非专利文献1:Chen等人,Frontiers in Bioengineering and Biotechnology(2021).9:787881;Non-patent literature 1: Chen et al., Frontiers in Bioengineering and Biotechnology (2021). 9: 787881;

非专利文献2:Liu等人,Molecular Cell(2022),82:1-15;Non-patent literature 2: Liu et al., Molecular Cell (2022), 82: 1-15;

非专利文献3:Been等人,Nucleic Acids Research(1992).20:5357-5364;Non-patent literature 3: Been et al., Nucleic Acids Research (1992). 20: 5357-5364;

非专利文献4:Ford等人,Proceedings of National Academy of Science(1994).91:3117-3121;Non-patent literature 4: Ford et al., Proceedings of National Academy of Science (1994).91:3117-3121;

非专利文献5:Wesselhoeft等人,Nature Communications(2018).9:2629;Non-patent literature 5: Wesselhoeft et al., Nature Communications (2018). 9: 2629;

非专利文献6:Wesselhoeft等人,Molecular Cell(2019).74:508-520;Non-patent literature 6: Wesselhoeft et al., Molecular Cell (2019). 74: 508-520;

专利文献1:WO2019/236673A1;Patent document 1: WO2019/236673A1;

专利文献2:CN114381454A;Patent document 2: CN114381454A;

专利文献3:CN110283895A;Patent document 3: CN110283895A;

专利文献:4:WO2023/073228A1。Patent document: 4: WO2023/073228A1.

发明内容Summary of the invention

本发明人通过深入研究发现,通过在用于制备环状RNA的线状RNA前体的两侧端臂中插入标签序列,就能够利用标签序列与相应亲和配基的特异性结合,使用亲和层析分离所制备的目的环状RNA分子与制备过程中产生的各种RNA副产物。与传统的环状RNA制备方法相比,本发明方法能够使用易于操作和产业化的亲和层析工艺,在降低生产成本的同时能够实现产业化生产。本发明人还令人惊异地发现,在线状RNA前体的两端插入多聚腺苷酸或其功能性变体后,能够显著减少、甚至基本消除nicked RNA这类副产物的产生,因而极大地降低了在环状RNA纯化过程中使用RNA酶R处理时的酶用量,甚至可以省略该步酶处理步骤,不仅简化了工艺流程、降低成本,而且显著提高了环化RNA的生产效率、产物纯度和产量。此外,本发明方法无需在目的环状RNA中插入与其功能无关的标签,提高了环状RNA的安全性,同时所有杂质中均存在标签序列,极大地简化了纯度检测和质量控制。使用本发明方法制备得到的环状RNA基本不含线状RNA副产物为研发环状RNA药物提供了优良的药物活性成分。本发明还筛选了线状RNA前体及其编码DNA中适宜插入纯化标签的位置,结果发现,在将纯化标签插入至线状RNA前体的编码DNA的特定区域之后,不会降低所述编码DNA体外转录(IVT)产生线状RNA前体的产量。The inventors have found through in-depth research that by inserting a tag sequence into the two end arms of the linear RNA precursor used to prepare circular RNA, the specific binding of the tag sequence to the corresponding affinity ligand can be utilized to separate the prepared target circular RNA molecule from various RNA byproducts generated during the preparation process using affinity chromatography. Compared with the traditional circular RNA preparation method, the method of the present invention can use an affinity chromatography process that is easy to operate and industrialized, and can achieve industrial production while reducing production costs. The inventors also surprisingly found that after inserting polyadenylic acid or its functional variants at both ends of the linear RNA precursor, the production of byproducts such as nicked RNA can be significantly reduced or even basically eliminated, thereby greatly reducing the amount of enzyme used when using RNase R treatment during the circular RNA purification process, and even omitting this enzyme treatment step, which not only simplifies the process flow and reduces costs, but also significantly improves the production efficiency, product purity and yield of circularized RNA. In addition, the method of the present invention does not need to insert a tag that is irrelevant to its function into the target circular RNA, which improves the safety of the circular RNA. At the same time, the tag sequence exists in all impurities, which greatly simplifies purity detection and quality control. The circular RNA prepared by the method of the present invention substantially does not contain linear RNA byproducts, providing excellent active pharmaceutical ingredients for the development of circular RNA drugs. The present invention also screened the linear RNA precursor and its encoding DNA for suitable insertion of purification tags. The results showed that after the purification tag was inserted into a specific region of the encoding DNA of the linear RNA precursor, the yield of the linear RNA precursor produced by in vitro transcription (IVT) of the encoding DNA would not be reduced.

因此,一方面,本发明提供了一种制备环状RNA的方法,所述方法包括:Therefore, in one aspect, the present invention provides a method for preparing circular RNA, the method comprising:

步骤A:产生线状RNA前体,所述线状前体按5’至3’方向依次以彼此可操作地连接的方式包含5’端臂、3’自剪接位点、环化区域、5’自剪接位点和3’端臂,其中,所述5’端臂中插入了第一标签,所述3’端臂中插入了第二标签; Step A: generating a linear RNA precursor, wherein the linear precursor comprises a 5' end arm, a 3' self-splicing site, a circularization region, a 5' self-splicing site and a 3' end arm in a manner operably connected to each other in sequence from 5' to 3' direction, wherein a first tag is inserted into the 5' end arm and a second tag is inserted into the 3' end arm;

步骤B:使所述线状RNA前体处于适合所述3’自剪接位点和所述5’自剪接位点发生自剪接的条件下,得到包含带有所述第一标签和/或所述第二标签的线状RNA片段以及由所述环化区域环化而得的环状RNA的混合物;Step B: placing the linear RNA precursor under conditions suitable for self-splicing of the 3' self-splicing site and the 5' self-splicing site, to obtain a mixture comprising a linear RNA fragment with the first tag and/or the second tag and a circular RNA obtained by cyclization of the cyclization region;

步骤C:使所述混合物与能够同时结合所述第一标签和所述第二标签的亲和层析介质接触一定时间,至足以使所述亲和层析介质结合包含至少一种所述标签的线状RNA片段;和Step C: contacting the mixture with an affinity chromatography medium capable of simultaneously binding the first tag and the second tag for a period of time sufficient to allow the affinity chromatography medium to bind to the linear RNA fragment containing at least one of the tags; and

步骤D:将所述亲和层析介质和与之接触后的混合物分离,收集上清液,得到环状RNA,Step D: separating the affinity chromatography medium and the mixture after contacting the affinity chromatography medium, collecting the supernatant, and obtaining the circular RNA.

其中,所述第一标签和/或所述第二标签独立地选自由20~100个、优选30~90个,更优选40~80个、进一步优选45~70个,最优选50~65个连续的腺嘌呤核苷酸组成的poly(A)标签或其功能性变体,并且所述第一标签和所述第二标签在所述环化区域和所述环状RNA中均不存在。Wherein, the first tag and/or the second tag are independently selected from a poly(A) tag consisting of 20 to 100, preferably 30 to 90, more preferably 40 to 80, further preferably 45 to 70, and most preferably 50 to 65 consecutive adenine nucleotides or a functional variant thereof, and the first tag and the second tag are not present in the cyclization region and the circular RNA.

在一些实施方案中,本发明的方法中使用的poly(A)标签的功能性变体为在poly(A)标签中插入1个或多个非A碱基,优选插入1~20个非A碱基,更优选插入1~10个非A碱基。In some embodiments, the functional variant of the poly(A) tag used in the method of the present invention is to insert one or more non-A bases into the poly(A) tag, preferably insert 1 to 20 non-A bases, and more preferably insert 1 to 10 non-A bases.

在一些实施方案中,本发明的方法中使用的poly(A)标签的功能性变体包含如下情况:In some embodiments, functional variants of the poly(A) tag used in the methods of the present invention include the following:

(1)唯一的元件a、至少一个元件b,以及至少一个元件c,(1) a single element a, at least one element b, and at least one element c,

(2)唯一的元件a,至少一个元件b,以及至少一个元件d;或者(2) only one element a, at least one element b, and at least one element d; or

(3)唯一的元件a,至少一个元件b,以及至少一个元件c和至少一个元件d,(3) a single element a, at least one element b, at least one element c and at least one element d,

其中,所述元件a是由20个以上连续的腺嘌呤核苷酸组成,所述元件b是由3个以上且少于20个连续的腺嘌呤核苷酸组成,所述元件c是由一个选自尿嘧啶核苷酸、胞嘧啶核苷酸、鸟嘌呤核苷酸的核苷酸组成,所述元件d是由2个以上至20个以下的核苷酸组成,所述核苷酸任意地选自腺嘌呤核苷酸、尿嘧啶核苷酸、胞嘧啶核苷酸、鸟嘌呤核苷酸,并且所述元件d不包含3个以上连续腺嘌呤核苷酸,且其5’及3’最末端的核苷酸不是腺嘌呤核苷酸,wherein the element a is composed of more than 20 consecutive adenine nucleotides, the element b is composed of more than 3 and less than 20 consecutive adenine nucleotides, the element c is composed of a nucleotide selected from uracil nucleotides, cytosine nucleotides, and guanine nucleotides, the element d is composed of more than 2 and less than 20 nucleotides, the nucleotides are arbitrarily selected from adenine nucleotides, uracil nucleotides, cytosine nucleotides, and guanine nucleotides, and the element d does not contain more than 3 consecutive adenine nucleotides, and the 5' and 3' terminal nucleotides are not adenine nucleotides,

其中,当所述Poly(A)标签中同时包含两个以上的所述元件b、所述元件c或所述元件d时,每两个元件b的序列可以相同或不同、每两个元件c的序列可以相同或不同,以及每两个元件d的序列可以相同或不同,Wherein, when the Poly(A) tag contains two or more of the element b, the element c or the element d at the same time, the sequences of every two elements b may be the same or different, the sequences of every two elements c may be the same or different, and the sequences of every two elements d may be the same or different.

并且,所述元件a和所述元件b之间、所述元件c和所述元件d之间、所述元件b彼此之间、所述元件c彼此之间、所述元件d彼此之间均不相邻。Furthermore, the element a and the element b, the element c and the element d, the elements b, the elements c, and the elements d are not adjacent to each other.

在一些实施方案中,本发明的方法中使用的元件a由20个以上、80个以下连续的腺嘌呤核苷酸组成,优选由30至70个、35至65个、40至60个、或45至55个连续的腺嘌呤核苷酸组成,更优选由60个连续的腺嘌呤核苷酸组成。In some embodiments, element a used in the method of the present invention consists of more than 20 and less than 80 consecutive adenine nucleotides, preferably consists of 30 to 70, 35 to 65, 40 to 60, or 45 to 55 consecutive adenine nucleotides, and more preferably consists of 60 consecutive adenine nucleotides.

在一些实施方案中,本发明的方法中使用的元件b由3至10个、10至19个、12至15个、14nt至17个、或者16至19个,优选由19个连续的腺嘌呤核苷酸组成。在一些实施方案中,本发明的方法中使用的元件b的个数为2~10个,优选2~5个,进一步优选3个。 In some embodiments, the element b used in the method of the present invention consists of 3 to 10, 10 to 19, 12 to 15, 14 nt to 17, or 16 to 19, preferably 19 consecutive adenine nucleotides. In some embodiments, the number of element b used in the method of the present invention is 2 to 10, preferably 2 to 5, and more preferably 3.

在一些实施方案中,本发明的方法中使用的元件c为鸟嘌呤核苷酸。在一些实施方案中,本发明的方法中使用的元件c的个数为2至10个、3个至8个、4至6个、或者2至5个,优选为2个。In some embodiments, the element c used in the method of the present invention is a guanine nucleotide. In some embodiments, the number of the element c used in the method of the present invention is 2 to 10, 3 to 8, 4 to 6, or 2 to 5, preferably 2.

在一些实施方案中,本发明的方法中使用的元件d由3至18个、5至16个、4至10个、或者6至12个核苷酸组成,优选由6个核苷酸组成。在优选的实施方案中,本发明的方法中使用的元件d可独立地选自GAUAUC、GUAUAC、GAAUCU、GCAUAUGACU或GAUAUCGUAUAC中的任一者。在一些实施方案中,本发明的方法中使用的元件d的个数为0~5个,优选1~3个,更优选1个。In some embodiments, the element d used in the method of the present invention consists of 3 to 18, 5 to 16, 4 to 10, or 6 to 12 nucleotides, preferably 6 nucleotides. In a preferred embodiment, the element d used in the method of the present invention can be independently selected from any one of GAUAUC, GUAUAC, GAAUCU, GCAUAUGACU or GAUAUCGUAUAC. In some embodiments, the number of elements d used in the method of the present invention is 0 to 5, preferably 1 to 3, and more preferably 1.

在一些实施方案中,当元件c和元件d同时存在时,本发明的方法中使用的元件c和元件d的个数总和为2~15个,优选3~5个,更优选3个。In some embodiments, when element c and element d are present at the same time, the total number of element c and element d used in the method of the present invention is 2 to 15, preferably 3 to 5, and more preferably 3.

在优选的实施方案中,本发明方法中使用的poly(A)标签的功能性变体具有选自如下所示结构中的任一种结构:元件a-元件c-元件b-元件c-元件b-元件c-元件b-元件c-元件b,元件b-元件c-元件b-元件c-元件a-元件d-元件b-元件c-元件b-元件c-元件b,元件b-元件c-元件b-元件c-元件b-元件d-元件a-元件c,元件a-元件d-元件b-元件c-元件b-元件c-元件b,或,元件b-元件c-元件b-元件c-元件b-元件d-元件a。In a preferred embodiment, the functional variant of the poly(A) tag used in the method of the present invention has any one structure selected from the following structures: element a-element c-element b-element c-element b-element c-element b-element c-element b, element b-element c-element b-element c-element a-element d-element b-element c-element b-element c-element b, element b-element c-element b-element c-element b-element d-element a-element c, element a-element d-element b-element c-element b-element c-element b-element b, or, element b-element c-element b-element c-element b-element d-element a.

在任选的实施方案中,本发明的方法中使用的poly(A)标签的功能性变体还可进一步包含一个元件e,所述元件e是由一个或两个连续的腺嘌呤核苷酸组成,其中,所述元件e位于所述Poly(A)标签序列的3’末端,且与所述元件d或所述元件c相邻。In an optional embodiment, the functional variant of the poly (A) tag used in the method of the present invention may further comprise an element e, wherein the element e is composed of one or two consecutive adenine nucleotides, wherein the element e is located at the 3' end of the Poly (A) tag sequence and is adjacent to the element d or the element c.

在一些实施方案中,本发明的方法中的线状RNA前体在5’端臂按5’至3’方向包含5’外部同源臂和3’内含子片段,所述第一标签插入在所述5’外部同源臂中,或者插入在所述3’内含子片段中靠近所述5’外部同源臂的5’末端区域中,所述5’末端区域优选为所述3’内含子片段的5’末端的20个核苷酸,更优选15个核苷酸,进一步优选10个核苷酸,或者插入在5'外部同源臂的5’端上游,并且,本发明的方法中的线状RNA前体还在3’端臂按5’至3’方向包含5’内含子片段和3’外部同源臂,所述第二标签插入在所述3’外部同源臂中,或者插入在所述5’内含子片段中靠近所述3’外部同源臂的3’末端区域中,所述3’末端区域优选为所述5’内含子片段的3’末端的20个核苷酸,更优选15个核苷酸,进一步优选10个核苷酸,或者插入在3'外部同源臂的3’端下游。In some embodiments, the linear RNA precursor in the method of the present invention comprises a 5' external homology arm and a 3' intron fragment in the 5' end arm in the 5' to 3' direction, and the first tag is inserted in the 5' external homology arm, or inserted in the 5' terminal region of the 3' intron fragment near the 5' external homology arm, and the 5' terminal region is preferably 20 nucleotides at the 5' end of the 3' intron fragment, more preferably 15 nucleotides, and further preferably 10 nucleotides, or inserted at the 5' end of the 5' external homology arm. Upstream, and the linear RNA precursor in the method of the present invention also comprises a 5' intron fragment and a 3' external homologous arm in the 3' end arm in the 5' to 3' direction, and the second tag is inserted in the 3' external homologous arm, or inserted in the 3' terminal region of the 5' intron fragment close to the 3' external homologous arm, the 3' terminal region is preferably 20 nucleotides at the 3' end of the 5' intron fragment, more preferably 15 nucleotides, further preferably 10 nucleotides, or inserted downstream of the 3' end of the 3' external homologous arm.

在优选的实施方案中,第一标签插入在5’外部同源臂中的任意位置,且所述第二标签插入在3'外部同源臂的3’端下游(例如紧邻3'外部同源臂的3’端最后一个核苷酸残基的位置)。在另一些优选的实施方案中,第一标签插入在5’外部同源臂中的任意位置,且所述第二标签插入在3'外部同源臂中的任意位置。在另一些优选的实施方案中,第一标签插入在5’外部同源臂中的任意位置,且所述第二标签插入在5'内含子片段的3’末端区域中的任意位置。在更优选的实施方式中,第一标签插入在5’外部同源臂中除5’末端第1个核苷酸残基以外的任意位 置,且所述第二标签插入在3'外部同源臂的3’端下游(例如紧邻3'外部同源臂的3’端最后一个核苷酸残基的位置)。在另一些更优选的实施方案中,第一标签插入在5’外部同源臂中除5’末端第1个核苷酸残基以外的任意位置,且第二标签插入在3'外部同源臂中的任意位置。在另一些更优选的实施方案中,第一标签插入在5’外部同源臂中除5’末端第1个核苷酸残基以外的任意位置,且第二标签插入在5’内含子片段的3’末端区域中的任意位置。In a preferred embodiment, the first tag is inserted at any position in the 5' external homology arm, and the second tag is inserted downstream of the 3' end of the 3' external homology arm (for example, the position immediately adjacent to the last nucleotide residue at the 3' end of the 3' external homology arm). In other preferred embodiments, the first tag is inserted at any position in the 5' external homology arm, and the second tag is inserted at any position in the 3' external homology arm. In other preferred embodiments, the first tag is inserted at any position in the 5' external homology arm, and the second tag is inserted at any position in the 3' terminal region of the 5' intron fragment. In a more preferred embodiment, the first tag is inserted at any position in the 5' external homology arm except the first nucleotide residue at the 5' end. In some preferred embodiments, the first tag is inserted at any position in the 5' external homology arm except the first nucleotide residue at the 5' end, and the second tag is inserted at any position in the 3' external homology arm. In some preferred embodiments, the first tag is inserted at any position in the 5' external homology arm except the first nucleotide residue at the 5' end, and the second tag is inserted at any position in the 3' external homology arm. In some preferred embodiments, the first tag is inserted at any position in the 5' external homology arm except the first nucleotide residue at the 5' end, and the second tag is inserted at any position in the 3' terminal region of the 5' intron fragment.

在一些实施方式中,第一标签插入在5’外部同源臂中自5’端首位核苷酸残基起、按5’至3’方向的第1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20位残基位置处。在优选的实施方式中,第一标签插入在5’外部同源臂中自5’端首位核苷酸残基起、第2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20位残基位置处。在更优选的实施方式中,第一标签插入在5’外部同源臂中自5’端首位核苷酸残基起、第2、3、4、5、6、7、8、9、10或11位残基位置处。In some embodiments, the first tag is inserted at the 5' outer homology arm from the first nucleotide residue at the 5' end, in the 5' to 3' direction, at the 1st, 2nd, 3rd, 4th, 5th, 6th, 7th, 8th, 9th, 10th, 11th, 12th, 13th, 14th, 15th, 16th, 17th, 18th, 19th, or 20th residue position. In a preferred embodiment, the first tag is inserted at the 5' outer homology arm from the first nucleotide residue at the 5' end, at the 2nd, 3rd, 4th, 5th, 6th, 7th, 8th, 9th, 10th, 11th, 12th, 13th, 14th, 15th, 16th, 17th, 18th, 19th, or 20th residue position. In a more preferred embodiment, the first tag is inserted at the 5' outer homology arm from the first nucleotide residue at the 5' end, at the 2nd, 3rd, 4th, 5th, 6th, 7th, 8th, 9th, 10th, or 11th residue position.

示例性地,术语“插入至第n位”或“插入第n位残基位置处”,表示所描述的插入序列的5’最末端和3’最末端的核苷酸残基分别与插入前位于第n位和第n+1位残基形成共价连接。例如“插入第1位”表示插入的序列位于原第1位和原第2位残基之间。Illustratively, the term "inserted at position n" or "inserted at residue position n" means that the 5'-most and 3'-most nucleotide residues of the described insertion sequence are covalently linked to the residues at positions n and n+1, respectively, before insertion. For example, "inserted at position 1" means that the inserted sequence is located between the original residues at positions 1 and 2.

在一些实施方式中,第二标签插入在3'外部同源臂中自3’端最后一位核苷酸残基起、按3’至5’方向的第1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20位残基位置处。在另一些更优选的实施方案中,第二标签插入在3'外部同源臂中自3’端最后一位核苷酸残基起、按3’至5’方向的第1、2、3、4、5、6、7、8、9、10或11位残基位置处。在一些实施方式中,第二标签插入在5'内含子片段中自3’端最后一位核苷酸残基起、按3’至5’方向的第1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20位残基位置处。在另一些更优选的实施方案中,第二标签插入在5'内含子片段中自3’端最后一位核苷酸残基起、按3’至5’方向的第1、2、3、4、5、6、7、8、9、10或11位残基位置处。In some embodiments, the second tag is inserted in the 3' outer homology arm from the last nucleotide residue at the 3' end, in the 3' to 5' direction at the 1st, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20th residue position. In other more preferred embodiments, the second tag is inserted in the 3' outer homology arm from the last nucleotide residue at the 3' end, in the 3' to 5' direction at the 1st, 2, 3, 4, 5, 6, 7, 8, 9, 10 or 11th residue position. In some embodiments, the second tag is inserted in the 5' intron fragment from the last nucleotide residue at the 3' end, in the 3' to 5' direction at the 1st, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20th residue position. In other more preferred embodiments, the second tag is inserted into the 5' intron fragment at the 1st, 2nd, 3rd, 4th, 5th, 6th, 7th, 8th, 9th, 10th or 11th residue position in the 3' to 5' direction starting from the last nucleotide residue at the 3' end.

在一些实施方案中,环化区域按5’至3’方向依次以彼此可操作地连接的方式包含3’编码区片段、翻译起始元件、5’编码区片段。在另一些实施方案中,环化区域按5’至3’方向依次以彼此可操作地连接的方式包含3’外显子片段、5’内部同源臂、插入片段、3’内部同源臂和5’外显子片段。在任选的实施方案中,环化区域在所述插入片段与所述5’内部同源臂之间包含第一间隔区,在所述插入片段与所述3’内部同源臂之间包含第二间隔区。在另一些实施方案中,插入片段包含翻译起始元件,或者包含翻译起始元件和编码区。在一些实施方案中,翻译起始元件为IRES序列。在优选的实施方案中,IRES序列可以选自但不限于以下的IRES序列:Taura综合征病毒、吸血猎蝽病毒、泰累尔氏脑脊髓炎病毒、猿猴病毒40、红火蚁病毒1、禾谷缢管蚜病毒、网状内皮增生症病毒、福曼脊髓灰质炎病毒1、大豆尺蠖病毒、克什米尔蜂病毒、人鼻病毒2、琉璃叶蝉病毒-1、人免疫缺陷病毒1型、琉璃叶蝉病毒-1、虱P病毒、丙型肝炎病毒、甲型肝炎病毒、GB型肝炎病毒、口蹄疫病毒、人肠道病毒71、马鼻病毒、茶尺蠖 样病毒、脑心肌炎病毒(EMCV)、果蝇C病毒、十字花科烟草病毒、蟋蟀麻痹病毒、牛病毒性腹泻病毒1、黑皇后细胞病毒、蚜虫致死麻痹病毒、禽脑脊髓炎病毒、急性蜂麻痹病毒、芙蓉枯黄环斑病毒、猪瘟病毒、人类FGF2、人类SFTPA1、人类AMLl/RUNXl、果蝇触角、人类AQP4、人类AT1R、人类BAG-1、人类BCL2、人类BiP、人类c-IAPl、人类c-myc、人类eIF4G、小鼠NDST4L、人类LEF1、小鼠HIF1α、人类n.myc、小鼠Gtx、人类p27kipl、人类PDGF2/c-sis、人类p53、人类Pim-1、小鼠Rbm3、果蝇reaper、犬Scamper、果蝇Ubx、人类UNR、小鼠UtrA、人类VEGF-A、人类XIAP、果蝇hairless、酿酒酵母TFIID、酿酒酵母YAP1、人类c-src、人类FGF-1、猿猴小核糖核酸病毒、芜菁绉缩病病毒、eIF4G的适体、柯萨奇病毒B3(CVB3)或柯萨奇病毒A(CVB1/2)。在一些实施方案中,IRES序列为野生型IRES序列或经过修饰的IRES序列。在某些实施方式中,IRES序列的长度为约50个核苷酸。In some embodiments, the circularization region comprises a 3' coding region fragment, a translation initiation element, and a 5' coding region fragment in a manner that is operably connected to each other in the 5' to 3' direction. In other embodiments, the circularization region comprises a 3' exon fragment, a 5' internal homology arm, an insert fragment, a 3' internal homology arm, and a 5' exon fragment in a manner that is operably connected to each other in the 5' to 3' direction. In optional embodiments, the circularization region comprises a first spacer between the insert fragment and the 5' internal homology arm, and a second spacer between the insert fragment and the 3' internal homology arm. In other embodiments, the insert fragment comprises a translation initiation element, or comprises a translation initiation element and a coding region. In some embodiments, the translation initiation element is an IRES sequence. In a preferred embodiment, the IRES sequence can be selected from, but not limited to, the following IRES sequences: Taura syndrome virus, blood-sucking assassin bug virus, Theile's encephalomyelitis virus, simian virus 40, fire ant virus 1, cereal aphid virus, reticuloendotheliosis virus, Forman polio virus 1, soybean looper virus, Kashmir bee virus, human rhinovirus 2, glass leafhopper virus-1, human immunodeficiency virus type 1, glass leafhopper virus-1, lice P virus, hepatitis C virus, hepatitis A virus, GB hepatitis virus, foot-and-mouth disease virus, human enterovirus 71, equine rhinovirus, tea looper virus Like virus, encephalomyocarditis virus (EMCV), Drosophila C virus, crucifer tobacco virus, cricket paralysis virus, bovine viral diarrhea virus 1, black queen cell virus, aphid lethal paralysis virus, avian encephalomyelitis virus, acute bee paralysis virus, hibiscus yellow ringspot virus, swine fever virus, human FGF2, human SFTPA1, human AML1/RUNX1, Drosophila antennae, human AQP4, human AT1R, human BAG-1, human BCL2, human BiP, human c-IAPl, human c-myc, human eIF4G, mouse NDST4L, human LEF1, mouse HIF1α, Human n.myc, mouse Gtx, human p27kipl, human PDGF2/c-sis, human p53, human Pim-1, mouse Rbm3, Drosophila reaper, canine Scamper, Drosophila Ubx, human UNR, mouse UtrA, human VEGF-A, human XIAP, Drosophila hairless, Saccharomyces cerevisiae TFIID, Saccharomyces cerevisiae YAP1, human c-src, human FGF-1, simian picornavirus, turnip shrunken virus, aptamer of eIF4G, coxsackievirus B3 (CVB3) or coxsackievirus A (CVB1/2). In some embodiments, the IRES sequence is a wild-type IRES sequence or a modified IRES sequence. In certain embodiments, the IRES sequence is about 50 nucleotides in length.

在一些实施方案中,插入片段包含结构基因或其功能性片段的编码序列或者非编码RNA的序列或其互补序列,所述结构基因选自多肽、蛋白质亚单位、蛋白质活性中心、蛋白质或非天然催化基团的蛋白质杂合体、经重组的蛋白质活性亚单位或活性中心/、经重组的人工酶或其他主要由氨基酸构成的生物学效应单元,所述非编码RNA选自微小RNA(miRNA)、小干扰RNA(siRNA)、PIWI蛋白相互作用RNA(piRNA)、转运RNA衍生小RNA(tsRNA)、核小RNA(snRNA)、核仁小RNA(snoRNA)、长链非编码RNA(lncRNA)、假基因、ceRNA(competing endogenous RNAs)、microRNA海绵或其他种类非mRNA的RNA。In some embodiments, the inserted fragment comprises the coding sequence of a structural gene or its functional fragment or the sequence of a non-coding RNA or its complementary sequence, wherein the structural gene is selected from a polypeptide, a protein subunit, a protein active center, a protein or a protein hybrid of a non-natural catalytic group, a recombinant protein active subunit or active center/, a recombinant artificial enzyme or other biological effector units mainly composed of amino acids, and the non-coding RNA is selected from microRNA (miRNA), small interfering RNA (siRNA), PIWI protein-interacting RNA (piRNA), transfer RNA-derived small RNA (tsRNA), small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), long non-coding RNA (lncRNA), pseudogene, ceRNA (competing endogenous RNAs), microRNA sponge or other types of non-mRNA RNA.

在任选的实施方案中,5’和3’外部同源臂的长度各自独立地为大于5nt、大于10nt、大于15nt、大于20nt、大于25nt、大于30nt、大于40nt、大于50nt、大于60nt、小于5nt、小于10nt、小于15nt、小于20nt、小于25nt、小于30nt、小于40nt、小于50nt、小于60nt、5~60nt、10~55nt、15~50nt、20~45nt、25~40nt、30~35nt、10nt、15nt、20nt、25nt、30nt、35nt、40nt、45nt、或50nt。在任选的实施方案中,5’和3’内含子片段的长度各自独立地为大于5nt、大于10nt、大于15nt、大于20nt、大于25nt、大于30nt、大于40nt、大于50nt、大于60nt、小于5nt、小于10nt、小于15nt、小于20nt、小于25nt、小于30nt、小于40nt、小于50nt、小于60nt、小于70nt、小于80nt、小于90nt、小于100nt、小于150nt、小于200nt、5~200nt、10~150nt、50~200nt、50~150nt、5~60nt、10~55nt、15~50nt、20~45nt、25~40nt、30~35nt、10nt、15nt、20nt、25nt、30nt、35nt、40nt、45nt、或50nt。在任选的实施方案中,5’和3’外显子片段的长度各自独立地为大于5nt、大于10nt、大于15nt、大于20nt、大于25nt、大于30nt、大于40nt、大于50nt、大于60nt、小于5nt、小于10nt、小于15nt、小于20nt、小于25nt、小于30nt、小于40nt、小于50nt、小于60nt、5~60nt、10~55nt、15~50nt、20~45nt、25~40nt、30~35nt、10nt、15nt、20nt、25nt、30nt、35nt、40nt、45nt、或50nt。在任选的实施方案中,5’和3’内部同源臂的长度各自独立地为大于5nt、大于10nt、大于15nt、大于20nt、大于25nt、大于30nt、大于40nt、大于50nt、大于60nt、小于5nt、小于10nt、小于15nt、小于20nt、小于25nt、小于30nt、小于40nt、 小于50nt、小于60nt、5~60nt、10~55nt、15~50nt、20~45nt、25~40nt、30~35nt、10nt、15nt、20nt、25nt、30nt、35nt、40nt、45nt、或50nt。在任选的实施方案中,插入片段的长度为大于50nt、大于100nt、大于150nt、大于200nt、大于250nt、大于300nt、大于400nt、大于500nt、大于600nt、大于1k nt、大于1.5k nt、大于2k nt、大于3k nt、小于50nt、小于100nt、小于150nt、小于200nt、小于250nt、小于300nt、小于400nt、小于500nt、小于600nt、小于600nt、小于1k nt、小于1.5k nt、小于2k nt、小于3k nt、50~5k nt、50~5k nt、50~4k nt、50~3k nt、50~2k nt、50~1.5k nt、50~1k nt、50~600nt、100~550nt、150~500nt、200~450nt、250~400nt、300~350nt。In an optional embodiment, the length of the 5' and 3' outer homology arms is each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, 5-60 nt, 10-55 nt, 15-50 nt, 20-45 nt, 25-40 nt, 30-35 nt, 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, or 50 nt. In an optional embodiment, the length of the 5' and 3' intron fragments is each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, less than 70 nt, less than 80 nt, less than 90 nt, less than 100 nt, less than 150 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, 0nt, less than 90nt, less than 100nt, less than 150nt, less than 200nt, 5-200nt, 10-150nt, 50-200nt, 50-150nt, 5-60nt, 10-55nt, 15-50nt, 20-45nt, 25-40nt, 30-35nt, 10nt, 15nt, 20nt, 25nt, 30nt, 35nt, 40nt, 45nt, or 50nt. In an optional embodiment, the length of the 5' and 3' exonic fragments is each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, 5-60 nt, 10-55 nt, 15-50 nt, 20-45 nt, 25-40 nt, 30-35 nt, 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, or 50 nt. In optional embodiments, the lengths of the 5' and 3' internal homology arms are each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, Less than 50nt, less than 60nt, 5-60nt, 10-55nt, 15-50nt, 20-45nt, 25-40nt, 30-35nt, 10nt, 15nt, 20nt, 25nt, 30nt, 35nt, 40nt, 45nt, or 50nt. In an optional embodiment, the length of the insert is greater than 50 nt, greater than 100 nt, greater than 150 nt, greater than 200 nt, greater than 250 nt, greater than 300 nt, greater than 400 nt, greater than 500 nt, greater than 600 nt, greater than 1k nt, greater than 1.5k nt, greater than 2k nt, greater than 3k nt, less than 50 nt, less than 100 nt, less than 150 nt, less than 200 nt, less than 250 nt, less than 300 nt, less than 400 nt, less than 500 nt, less than 600 nt, less than 600 nt, less than 1k nt, less than 1.5k nt, less than 2k nt, less than 3k nt, 50-5k nt, 50-5k nt, 50-4k nt, 50-3k nt, 50-2k nt, 50-1.5k nt, 50-1k nt, 50~600nt, 100~550nt, 150~500nt, 200~450nt, 250~400nt, 300~350nt.

在优选的实施方案中,5’外部同源臂具有如SEQ ID NO:1或56所示的序列,3’外部同源臂具有如SEQ ID NO:2或57所示的序列。在优选的实施方案中,3’内含子片段和5’内含子片段均来自I型内含子,优选来自蓝藻鱼腥藻属pre-tRNA基因或T4噬菌体Td基因。在更优选的实施方案中,3’内含子片段具有如SEQ ID NO:3所示的序列,5’内含子片段具有如SEQ ID NO:4所示的序列。在优选的实施方案中,3’内含子片段和所述5’内含子片段均来自II型内含子,优选来自梭菌属(Clostridium)如破伤风梭菌(Clostridium tetani)的II型内含子,或者芽孢杆菌属(Bacillus)如苏云金芽孢杆菌(Bacillus thuringiensis)的II型内含子。在更优选的实施方案中,3’内含子片段和所述5’内含子片段来自SEQ ID NO:5或6所示核苷酸序列中包含的II型内含子。在优选的实施方案中,3’外显子片段和所述5’外显子片段分别来自天然外显子的3’末端区域和5末端区域,优选来自蓝藻鱼腥藻属pre-tRNA基因或T4噬菌体Td基因。在更优选的实施方案中,3’外显子片段具有如SEQ ID NO:7所示的序列,5’外显子片段具有如SEQ ID NO:8所示的序列。在优选的实施方案中,5’内部同源臂具有如SEQ ID NO:9所示的序列,3’内部同源臂具有如SEQ ID NO:10所示的序列。在一些实施方案中,所述第一间隔区与第二间隔区相同或不同。In a preferred embodiment, the 5' external homology arm has a sequence as shown in SEQ ID NO: 1 or 56, and the 3' external homology arm has a sequence as shown in SEQ ID NO: 2 or 57. In a preferred embodiment, the 3' intron fragment and the 5' intron fragment are both from type I introns, preferably from the cyanobacterium Anabaena pre-tRNA gene or the T4 phage Td gene. In a more preferred embodiment, the 3' intron fragment has a sequence as shown in SEQ ID NO: 3, and the 5' intron fragment has a sequence as shown in SEQ ID NO: 4. In a preferred embodiment, the 3' intron fragment and the 5' intron fragment are both from type II introns, preferably from type II introns of Clostridium such as Clostridium tetani, or type II introns of Bacillus such as Bacillus thuringiensis. In a more preferred embodiment, the 3' intron fragment and the 5' intron fragment are from the type II intron contained in the nucleotide sequence shown in SEQ ID NO:5 or 6. In a preferred embodiment, the 3' exon fragment and the 5' exon fragment are from the 3' terminal region and the 5' terminal region of the natural exon, respectively, preferably from the cyanobacterium Anabaena pre-tRNA gene or the T4 phage Td gene. In a more preferred embodiment, the 3' exon fragment has a sequence as shown in SEQ ID NO:7, and the 5' exon fragment has a sequence as shown in SEQ ID NO:8. In a preferred embodiment, the 5' internal homology arm has a sequence as shown in SEQ ID NO:9, and the 3' internal homology arm has a sequence as shown in SEQ ID NO:10. In some embodiments, the first spacer is the same as or different from the second spacer.

在一些实施方案中,本发明方法中使用的亲和层析介质上可操作地连接有能够特异性结合所述第一标签和/或第二标签的亲和配基。在优选的实施方案中,亲和配基选自由多聚X1、多聚X1-X2、多聚X1-X2-X3、多聚X1-X2-X3-X4组成的群组,其中X1、X2、X3、X4分别独立地为A、G、C、T、U中的任一者。在更优选的实施方案中,亲和配基选自由Oligo dT、Oligo dC、Oligo dG、Oligo dU组成的群组中的任一者。In some embodiments, the affinity chromatography medium used in the method of the present invention is operably linked to an affinity ligand capable of specifically binding to the first tag and/or the second tag. In a preferred embodiment, the affinity ligand is selected from the group consisting of poly X1, poly X1 - X2 , poly X1 - X2 - X3 , poly X1- X2 - X3 - X4 , wherein X1 , X2 , X3 , X4 are independently any one of A, G, C, T, U. In a more preferred embodiment, the affinity ligand is selected from any one of the group consisting of Oligo dT, Oligo dC, Oligo dG, Oligo dU.

在一些实施方案中,本发明方法中使用的亲和层析介质选自由磁珠、葡聚糖分子、聚丙烯氨酰胺大分子、大分子的纤维素分子、壳聚糖材料、改性的聚乳酸材料、PET材料、无机硅酸盐材料或其他高分子聚合物组成的群组中的任一者。In some embodiments, the affinity chromatography medium used in the method of the present invention is selected from any one of the group consisting of magnetic beads, dextran molecules, polyacrylamide macromolecules, macromolecular cellulose molecules, chitosan materials, modified polylactic acid materials, PET materials, inorganic silicate materials or other high molecular polymers.

在一些实施方案中,本发明的方法不包括向包含环状RNA的混合物中添加RNA酶R以去除线状RNA的步骤。In some embodiments, the methods of the present invention do not include the step of adding RNase R to the mixture comprising circular RNA to remove linear RNA.

在另一些实施方案中,当本发明的方法包括向包含环状RNA的混合物中添加RNA酶R 以去除线状RNA的步骤时,所添加的RNA酶R的量降低至用于纯化未插入所述poly(A)标签或其功能性变体的情况下所用酶量的30~50%,优选40%或50%。In other embodiments, when the method of the present invention comprises adding RNase R to a mixture comprising circular RNA In the step of removing linear RNA, the amount of RNase R added is reduced to 30-50%, preferably 40% or 50%, of the amount used for purification without the poly(A) tag or its functional variant.

在另一个方面,本发明提供了使用本发明的制备方法制备得到的环状RNA,该环状RNA基本不含非环状的RNA分子。在优选的实施方案中,使用本发明的制备方法制备得到的环状RNA具有大于90%、大于91%、大于92%、大于93%、大于94%、大于95%、大于96%、大于97%、大于98%、大于99%、大于99.5%、大于99.9%或更高的纯度。在另一些优选的实施方案中,使用本发明方法制备得到的环状RNA具有90%以上、91%以上、92%以上、93%以上、94%以上、95%以上、96%以上、97%以上、98%以上、99%以上、99.5%以上、99.9%以上或更高的纯度。在另一些优选的实施方案中,使用本发明方法制备得到的环状RNA中的非理想的核酸(例如,没有完全环化仍保留核酶等元件的各种中间复合体等)的含量不多于RNA总量的10%、9%、8%、7%、6%、5%、4%、3%、2%、1%、0.5%、0.1%或更低。In another aspect, the present invention provides a circular RNA prepared using the preparation method of the present invention, which substantially does not contain non-circular RNA molecules. In a preferred embodiment, the circular RNA prepared using the preparation method of the present invention has a purity of greater than 90%, greater than 91%, greater than 92%, greater than 93%, greater than 94%, greater than 95%, greater than 96%, greater than 97%, greater than 98%, greater than 99%, greater than 99.5%, greater than 99.9% or higher. In other preferred embodiments, the circular RNA prepared using the method of the present invention has a purity of more than 90%, more than 91%, more than 92%, more than 93%, more than 94%, more than 95%, more than 96%, more than 97%, more than 98%, more than 99%, more than 99.5%, more than 99.9% or more. In other preferred embodiments, the content of non-ideal nucleic acids (e.g., various intermediate complexes that are not completely cyclized but still retain elements such as ribozymes, etc.) in the circular RNA prepared using the method of the present invention is no more than 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.5%, 0.1% or less of the total RNA.

在另一个方面,本发明提供了用于本发明方法中的线状RNA前体。In another aspect, the present invention provides a linear RNA precursor for use in the methods of the present invention.

在另一个方面,本发明提供了核酸序列,其能够在适合转录的条件下,转录产生本发明的线状RNA前体。在优选的实施方案中,本发明的核酸序列还以可操作地连接的方式包含转录产生所述线状RNA前体所必需的调控序列,包括但不限于启动子、终止子、转录因子结合位点、非编译区(UTR)、增强子、回文序列、顺式作用元件、反式作用元件、TATA框、CAAT框、操纵子、转座子。In another aspect, the present invention provides a nucleic acid sequence that can be transcribed to produce the linear RNA precursor of the present invention under conditions suitable for transcription. In a preferred embodiment, the nucleic acid sequence of the present invention also comprises a regulatory sequence necessary for transcription to produce the linear RNA precursor in an operably linked manner, including but not limited to a promoter, a terminator, a transcription factor binding site, an untranslated region (UTR), an enhancer, a palindrome, a cis-acting element, a trans-acting element, a TATA box, a CAAT box, an operator, and a transposon.

在另一个方面,本发明提供了包含本发明的核酸序列的载体。在一些实施方案中,本发明的载体为线状DNA、质粒、病毒核酸片段或细胞基因组DNA片段。In another aspect, the present invention provides a vector comprising the nucleic acid sequence of the present invention. In some embodiments, the vector of the present invention is a linear DNA, a plasmid, a viral nucleic acid fragment or a cell genomic DNA fragment.

在另一个方面,本发明提供了工程化细胞,其包含本发明的线状RNA前体、环状RNA、核酸序列、或者载体。In another aspect, the present invention provides an engineered cell comprising the linear RNA precursor, circular RNA, nucleic acid sequence, or vector of the present invention.

在另一个方面,本发明提供了组合物,其包含本发明的线状RNA前体、环状RNA、核酸序列、载体、或者工程化细胞。In another aspect, the present invention provides a composition comprising the linear RNA precursor, circular RNA, nucleic acid sequence, vector, or engineered cell of the present invention.

在另一个方面,本发明还提供了使用本发明的线状RNA前体、核酸序列、载体、或者工程化细胞来制备环状RNA的用途。In another aspect, the present invention also provides the use of the linear RNA precursor, nucleic acid sequence, vector, or engineered cell of the present invention to prepare circular RNA.

在另一个方面,本发明还提供了本发明的线状RNA前体、环状RNA、核酸序列、载体、或者工程化细胞用于制备药物、细胞毒性剂或免疫调节制剂中的用途。在一些实施方案中,所述治疗性药物、细胞毒性剂或免疫调节制剂选自病毒、多能或专能干细胞、iPS、工程化免疫细胞、抗体或抗体片段、偶联药物的抗体或抗体片段、化疗剂、免疫抑制或调节剂、抗感染药物、抗癌剂、降糖药物、心脑血管系统疾病治疗药物、退行性神经疾病药物、肥胖症治疗药物、血液学疾病治疗药物、呼吸系统疾病治疗医药、或逆转录病毒疾病治疗药物。In another aspect, the present invention also provides the use of the linear RNA precursor, circular RNA, nucleic acid sequence, vector, or engineered cell of the present invention for preparing a drug, a cytotoxic agent, or an immunomodulatory preparation. In some embodiments, the therapeutic drug, cytotoxic agent, or immunomodulatory preparation is selected from viruses, pluripotent or multipotent stem cells, iPS, engineered immune cells, antibodies or antibody fragments, antibodies or antibody fragments conjugated to drugs, chemotherapeutic agents, immunosuppressive or modulatory agents, anti-infective drugs, anticancer agents, hypoglycemic drugs, cardiovascular and cerebrovascular disease therapeutic drugs, degenerative neurological disease drugs, obesity therapeutic drugs, hematological disease therapeutic drugs, respiratory disease therapeutic drugs, or retroviral disease therapeutic drugs.

在另一个方面,本发明还提供了施用环状RNA的方法,其包括向有需要的生物体施用有效量的、本发明的环状RNA,或者施用有效量的、使用本发明的核酸序列、载体、工程化细 胞或组合物制备得到的环状RNA。In another aspect, the present invention also provides a method for administering circular RNA, which comprises administering an effective amount of the circular RNA of the present invention to an organism in need thereof, or administering an effective amount of the circular RNA of the present invention using the nucleic acid sequence, vector, engineered cell or the like. The circular RNA is prepared from cells or compositions.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1示意性地示出了根据本申请的一种用于制备环状RNA的线状RNA前体的核酸序列,其中,在5’至3’方向上顺序地包含以下元件:5’外部同源臂、3’内含子片段、3’外显子片段、5’内部同源臂、第一间隔区、miRNA结合区(micro RNA binding sites)、第二间隔区、3’内部同源臂、5’外显子片段、5’内含子片段、3’外部同源臂,其中以垂直箭头示意性地示出了两个自剪接位点的所在位置;FIG1 schematically shows a nucleic acid sequence of a linear RNA precursor for preparing circular RNA according to the present application, wherein the following elements are sequentially included in the 5′ to 3′ direction: a 5′ external homology arm, a 3′ intron fragment, a 3′ exon fragment, a 5′ internal homology arm, a first spacer, a miRNA binding region (micro RNA binding sites), a second spacer, a 3′ internal homology arm, a 5′ exon fragment, a 5′ intron fragment, and a 3′ external homology arm, wherein the locations of two self-splicing sites are schematically shown by vertical arrows;

图2示出了根据本申请的一种用于制备环状RNA的线状RNA前体的示意图;FIG2 shows a schematic diagram of a linear RNA precursor for preparing circular RNA according to the present application;

图3示出了根据本申请的另一种用于制备环状RNA的线状RNA前体的示意图;FIG3 shows a schematic diagram of another linear RNA precursor for preparing circular RNA according to the present application;

图4示出了根据本申请的另一种用于制备环状RNA的线状RNA前体的示意图;FIG4 shows a schematic diagram of another linear RNA precursor for preparing circular RNA according to the present application;

图5A和5B示出了部分根据本申请的线状RNA前体的环化反应产物及其经过RNase R酶切处理后的产物的凝胶电泳图。图中,M表示Riboruler低分子量RNA参照物;CK表示未插入poly(A)的对比例1;Figures 5A and 5B show gel electrophoresis images of the cyclization reaction products of the linear RNA precursor according to some embodiments of the present invention and the products after RNase R digestion. In the figure, M represents the Riboruler low molecular weight RNA reference; CK represents the comparative example 1 without poly (A) insertion;

图6示出了根据本申请中制备例6~24的线状RNA前体的环化反应产物的相对IVT产量。图中,各个制备例的IVT产量用相对于对照IVT产量的倍数表示并进行One-way ANOVA统计学分析。****表示P<0.0001;FIG6 shows the relative IVT yields of the cyclization reaction products of the linear RNA precursors according to Preparation Examples 6 to 24 of the present application. In the figure, the IVT yield of each preparation example is expressed as a multiple of the control IVT yield and subjected to One-way ANOVA statistical analysis. **** indicates P<0.0001;

图7示出了根据本申请中制备例25~29的线状RNA前体的环化反应产物的相对IVT产量。图中,各个制备例的IVT产量用相对于对照IVT产量的倍数表示并进行One-way ANOVA统计学分析。***表示P<0.001,****表示P<0.0001;FIG7 shows the relative IVT yields of the cyclization reaction products of the linear RNA precursors according to Preparation Examples 25 to 29 of the present application. In the figure, the IVT yield of each preparation example is expressed as a multiple of the control IVT yield and subjected to One-way ANOVA statistical analysis. *** indicates P < 0.001, **** indicates P < 0.0001;

图8A~8D示出了根据本申请的线状RNA前体的环化反应产物在经过RNase R酶切处理之前和之后的HPLC图。Figures 8A to 8D show HPLC graphs of the cyclization reaction products of the linear RNA precursor according to the present application before and after RNase R enzymatic cleavage treatment.

具体实施方式DETAILED DESCRIPTION

定义definition

除非本文另外定义,否则与本公开关联使用的科学和技术术语将具有由本领域普通技术人员通常所理解的含义。术语的含义和范围应是明晰的,然而,在存在任何潜在歧义的情况下,本文提供的定义优先于任何词典或外部定义。Unless otherwise defined herein, scientific and technical terms used in connection with the present disclosure will have the meanings commonly understood by those of ordinary skill in the art. The meaning and scope of the terms should be clear, however, in the event of any potential ambiguity, the definitions provided herein take precedence over any dictionary or external definitions.

如本文所用,术语“包括”或“包含”意指序列、组合物及方法包含所记载的组件或步骤,但不排除其它组件或步骤。“主要由…组成”,当用在定义序列、组合物及方法时,应意味排除任何对其应当实现的技术效果而言明显为重要的其它组件或其他步骤。“由…所组成”应意味排除未提及的其它组件及步骤。As used herein, the term "comprising" or "including" means that the sequence, composition and method include the described components or steps, but do not exclude other components or steps. "Mainly composed of...", when used to define sequences, compositions and methods, should mean excluding any other components or other steps that are obviously important for the technical effect it should achieve. "Composed of..." should mean excluding other components and steps not mentioned.

如本文所用,术语“线状RNA前体”或“线状前体”可互换使用,是指本身并未共价闭环,但可以在环化过程中产生环状RNA的RNA前体,其一般是由模板DNA转录形 成的,但不限于此。在本文中,线状RNA前体包含尚未成环的完整环状RNA序列,以及使所述RNA序列环化所需的自剪接序列(例如自剪接位点、内含子核酶片段和同源臂等)。这些自剪接序列在环化过程中从线状RNA前体中除去,产生环状RNA和作为副产物的线状RNA片段。在一些实施方案中,本发明的线状RNA前体可用于通过在镁离子和鸟苷核苷酸或核苷存在下、在RNA环化发生的温度(例如介于20℃与60℃之间)下孵育来产生环状RNA。As used herein, the term "linear RNA precursor" or "linear precursor" is used interchangeably and refers to an RNA precursor that is not covalently closed in a circle but can generate circular RNA during the circularization process, which is generally transcribed from a template DNA. In this article, the linear RNA precursor comprises a complete circular RNA sequence that has not yet been circularized, and a self-splicing sequence (e.g., a self-splicing site, an intron ribozyme fragment, and a homology arm, etc.) required for the circularization of the RNA sequence. These self-splicing sequences are removed from the linear RNA precursor during the circularization process to produce circular RNA and linear RNA fragments as by-products. In some embodiments, the linear RNA precursor of the present invention can be used to produce circular RNA by incubation in the presence of magnesium ions and guanosine nucleotides or nucleosides at a temperature (e.g., between 20° C. and 60° C.) at which RNA circularization occurs.

在优选的实施方案中,线状RNA前体中包括如下元件:5’外部同源臂、第一标签、3’内含子片段、3’自剪接位点、3’外显子片段、5’内部同源臂、插入片段、3’内部同源臂、5’外显子片段、5’自剪接位点、5’内含子片段、第二标签和3’外部同源臂。在一些实施方案中,位于插入片段的5’上游的3’内含子片段、3’自剪接位点、3’外显子片段,和位于插入片段的3’下游的5’外显子片段、5’自剪接位点、5’内含子片段在由线状RNA前体剪接形成环状RNA的过程中发挥关键作用。例如,利用3’内含子片段的核酶特性,使连接3’内含子片段与3’外显子片段的3’自剪接位点在GTP引发下断裂,断裂位点又可以进一步引发连接5’内含子片段与5’外显子片段的5’自剪接位点断裂,使5’外显子片段与3’外显子片段在自剪接位点处连接形成环状RNA。而外部同源臂和内部同源臂的成对存在能够提高上述环化反应中的自剪接效率和环化效率。In a preferred embodiment, the linear RNA precursor includes the following elements: 5' external homology arm, first tag, 3' intron fragment, 3' self-splicing site, 3' exon fragment, 5' internal homology arm, inserted fragment, 3' internal homology arm, 5' exon fragment, 5' self-splicing site, 5' intron fragment, second tag and 3' external homology arm. In some embodiments, the 3' intron fragment, 3' self-splicing site, 3' exon fragment located 5' upstream of the inserted fragment, and the 5' exon fragment, 5' self-splicing site, 5' intron fragment located 3' downstream of the inserted fragment play a key role in the process of forming circular RNA by splicing of linear RNA precursor. For example, by utilizing the ribozyme properties of the 3' intron fragment, the 3' self-splicing site connecting the 3' intron fragment and the 3' exon fragment is broken under GTP initiation, and the break site can further trigger the break of the 5' self-splicing site connecting the 5' intron fragment and the 5' exon fragment, so that the 5' exon fragment and the 3' exon fragment are connected at the self-splicing site to form a circular RNA. The presence of the external homology arm and the internal homology arm in pairs can improve the self-splicing efficiency and the cyclization efficiency in the above-mentioned cyclization reaction.

本发明的线状RNA前体中的核苷酸可以是未修饰的天然核苷酸,也可以是部分修饰或完全修饰的非天然核苷酸。在一些实施方案中,本发明的线状RNA前体仅包含天然存在的核苷酸。在另一些实施方案中,本发明的线状RNA前体包含一种或多种能够增加稳定性的修饰,例如2'-O-甲基、氟或O-甲氧基乙基的缀合物、硫代磷酸酯骨架或2',4'-环状2'-O-乙基修饰(Holdt等人,Front Physiol.,9:1262(2018);Krutzfeldt等人,Nature,438(7068):685-9(2005);Crooke等人,Cell Metab 27(4):714-739(2018)),和/或一种或多种能够降低环状RNA分子在宿主中的先天免疫原性的修饰,例如至少一种N6-甲基腺苷(m6A)The nucleotides in the linear RNA precursor of the present invention may be unmodified natural nucleotides or partially or completely modified non-natural nucleotides. In some embodiments, the linear RNA precursor of the present invention comprises only naturally occurring nucleotides. In other embodiments, the linear RNA precursor of the present invention comprises one or more modifications that can increase stability, such as 2'-O-methyl, fluorine or O-methoxyethyl conjugates, thiophosphate backbones or 2',4'-cyclic 2'-O-ethyl modifications (Holdt et al., Front Physiol., 9: 1262 (2018); Krutzfeldt et al., Nature, 438 (7068): 685-9 (2005); Crooke et al., Cell Metab 27 (4): 714-739 (2018)), and/or one or more modifications that can reduce the innate immunogenicity of the circular RNA molecule in the host, such as at least one N6-methyladenosine (m 6 A)

如本文所用,术语“线状RNA片段”是指线状RNA前体经过环化反应后,除了由其所产生的目的环状RNA分子以外的、所有源自该线状RNA前体的RNA产物,包括但不限于:成环反应后被剪切掉的两侧自剪接序列,和成环反应过程中产生的各种剪接中间体序列。由于线状RNA前体分子可能并未均发生成环反应,因此,“线状RNA片段”也可以包括残留在成环反应产物中的线状RNA前体序列。此外,“线状RNA片段”的长度并不必然小于其所来源的线状RNA前体,因为该术语还涵盖了由多个相同或不同的上述序列聚合产生的线状及环状高分子量聚合RNA。在本文中,术语“线状RNA片段”与成环反应所产生的“副产物”或“杂质”具有实质相同的含义。 As used herein, the term "linear RNA fragment" refers to all RNA products derived from the linear RNA precursor after the linear RNA precursor undergoes a cyclization reaction, except for the target circular RNA molecule produced thereby, including but not limited to: the self-splicing sequences on both sides that are sheared off after the cyclization reaction, and the various splicing intermediate sequences produced during the cyclization reaction. Since the linear RNA precursor molecules may not all undergo a cyclization reaction, the "linear RNA fragment" may also include the linear RNA precursor sequence remaining in the cyclization reaction product. In addition, the length of the "linear RNA fragment" is not necessarily less than the linear RNA precursor from which it is derived, because the term also covers linear and circular high molecular weight polymerized RNA produced by polymerization of multiple identical or different sequences mentioned above. In this article, the term "linear RNA fragment" has substantially the same meaning as the "byproduct" or "impurity" produced by the cyclization reaction.

如本文所用,术语“环状RNA”或“circRNA”可互换使用,指通过共价键形成闭合环状结构的多聚核糖核苷酸。已报道环状RNA是3-5'共价闭合的RNA环,不具有5’端Cap和3’端poly(A)尾巴,因缺乏核酸外切酶介导的降解所必需的游离末端,使其具有对抗RNA酶降解的性质,与普通的线状RNA产物(例如,成熟mRNA)相比具有更长的寿命或半衰期。环状RNA可以通过剪接过程产生,环化主要在注释的外显子边界处使用常规剪接位点发生(Starke等人,2015;Szabo等人,2015)。在优选的实施方案中,这样的环状RNA是单链RNA分子。除非另外明确指出,本文中的环状RNA分子可以具有本领域已知的任何适合环状RNA的结构,但不含与存在于环状RNA前体中的第一标签和第二标签相同的序列。As used herein, the terms "circular RNA" or "circRNA" are used interchangeably and refer to polyribonucleotides that form a closed circular structure through covalent bonds. It has been reported that circular RNA is a 3-5' covalently closed RNA ring without a 5' end cap and a 3' end poly (A) tail. Due to the lack of free ends required for exonuclease-mediated degradation, it has the property of resisting RNA enzyme degradation and has a longer life or half-life than ordinary linear RNA products (e.g., mature mRNA). Circular RNA can be produced by a splicing process, and cyclization mainly occurs at the annotated exon boundaries using conventional splicing sites (Starke et al., 2015; Szabo et al., 2015). In a preferred embodiment, such a circular RNA is a single-stranded RNA molecule. Unless otherwise explicitly stated, the circular RNA molecule herein can have any structure suitable for circular RNA known in the art, but does not contain the same sequence as the first tag and the second tag present in the circular RNA precursor.

RNA nicking是指RNA链发生单点断裂的现象。在环状RNA制备过程中,反应体系中的金属离子(尤其是Mg2+)的存在导致环状RNA发生随机断裂,从而产生nicked RNA,其分子量与环状RNA相同且易被RNA酶消化。RNA nicking refers to the phenomenon of single-point breakage of the RNA chain. During the preparation of circular RNA, the presence of metal ions (especially Mg2+) in the reaction system causes random breakage of circular RNA, thereby generating nicked RNA, which has the same molecular weight as circular RNA and is easily digested by RNase.

如本文所用,术语“内含子片段”是指例如与天然I型或II型内含子核酶(Ribozyme或intronribozyme)或其主要活性片段具有75%或更高相似性的序列,或者小核酶(例如,卫星RNA,主要包括锤头状(hammerhead)核酶、发夹状(hairpin)核酶、D型肝炎病毒(HDV)RNA、Varkud卫星(VS)核酶和GlmS核开关)。例如,示例性的I型内含子核酶可以是鱼腥藻的天然内含子自剪切核酶序列(其在GenBank数据库收录登录号为(GenBank:AY768517),序列总长度313个碱基),或者该序列的主要活性片段(例如,如CN115786374A中SEQ ID NO.2所述的序列,序列总长度为246个碱基),或者对以上核酶进行各种碱基替换、截断的等同物。示例性的II型核酶可以来源于酵母线粒体DNA(如在以下参考文献中所述:Zimmerly等,Mob DNA,2015年)、乳酸乳球菌(Lactococcus lactis)的Ll.LtrB内含子、嗜热聚球藻Thermosynechococcus elongatus的TeI3c/4c二型内含子(如在以下参考文献中所述:Monat等,PloS One,2020年;Costa等,Sscience,2016年)、梭菌属(Clostridium)如破伤风梭菌(Clostridium tetani)的II型内含子、芽孢杆菌属(Bacillus)如苏云金芽孢杆菌(Bacillus thuringiensis)的II型内含子、或商业化的核酶工具Targetron。如本文所用,术语“3’内含子片段”表示与鱼腥藻核酶的3’端区域具有75%或更高相似性的序列,术语“5’内含子片段”表示与鱼腥藻核酶的5’端区域具有75%或更高相似性的序列,只要两个内含子片段可以在RNA的空间结构上彼此靠近、形成具有完整核酶活性的复合体即可(如在以下参考文献中所述:Wesselhoeft等,Nature Communication,2018年)。本领域技术人可以确定天然内含子核酶中能够作为“3’内含子片段”和“5’内含子片段”的序列。As used herein, the term "intron fragment" refers to, for example, a sequence having 75% or higher similarity to a natural type I or type II intron ribozyme (Ribozyme or intronribozyme) or its main active fragment, or a small ribozyme (e.g., satellite RNA, mainly including hammerhead ribozymes, hairpin ribozymes, hepatitis D virus (HDV) RNA, Varkud satellite (VS) ribozyme, and GlmS riboswitch). For example, an exemplary type I intron ribozyme can be a natural intron self-cleaving ribozyme sequence of Anabaena (whose accession number is included in the GenBank database (GenBank: AY768517), with a total sequence length of 313 bases), or a main active fragment of the sequence (e.g., a sequence as described in SEQ ID NO.2 in CN115786374A, with a total sequence length of 246 bases), or an equivalent of the above ribozyme with various base substitutions or truncations. Exemplary type II ribozymes can be derived from yeast mitochondrial DNA (as described in the following references: Zimmerly et al., Mob DNA, 2015), the Ll.LtrB intron of Lactococcus lactis, the TeI3c/4c type II intron of Thermosynechococcus elongatus (as described in the following references: Monat et al., PloS One, 2020; Costa et al., Sscience, 2016), type II introns of Clostridium such as Clostridium tetani, type II introns of Bacillus such as Bacillus thuringiensis, or the commercial ribozyme tool Targetron. As used herein, the term "3' intron fragment" refers to a sequence having 75% or higher similarity to the 3' terminal region of the Anabaena ribozyme, and the term "5' intron fragment" refers to a sequence having 75% or higher similarity to the 5' terminal region of the Anabaena ribozyme, as long as the two intron fragments can be close to each other in the spatial structure of the RNA to form a complex with complete ribozyme activity (as described in the following reference: Wesselhoeft et al., Nature Communication, 2018). A person skilled in the art can determine the sequences that can serve as the "3' intron fragment" and the "5' intron fragment" in the natural intron ribozyme.

如本文所用,术语“外显子片段”可以指示核酶的内含子-外显子(PIE)系统中,被内含子核酶识别并剪接的外显子或内含子核酶识别的信号序列的片段,例如,“3'外显子片段”可 以表示与鱼腥藻核酶所识别并被剪切的外显子3’端区域具有75%或更高相似性的序列,“5’外显子片段”可以表示与鱼腥藻核酶所识别并被剪切的外显子5’端区域具有75%或更高相似性的序列,只要两个外显子片段构成方向逆转、可被核酶识别并发生自剪接即可(如在以下参考文献中所述:Wesselhoeft等,Nature Communication,2018年)。本领域技术人可以确定天然基因外显子中能够作为“3’外显子片段”和“5’外显子片段”的序列。As used herein, the term "exon fragment" may refer to an exon recognized and spliced by an intron ribozyme in the intron-exon (PIE) system of a ribozyme, or a fragment of a signal sequence recognized by an intron ribozyme. For example, a "3' exon fragment" may refer to a fragment of a signal sequence recognized by an intron ribozyme. To represent a sequence having 75% or higher similarity with the 3' end region of the exon recognized and cleaved by the Anabaena ribozyme, a "5' exon fragment" can represent a sequence having 75% or higher similarity with the 5' end region of the exon recognized and cleaved by the Anabaena ribozyme, as long as the two exon fragments are reversed in direction, can be recognized by the ribozyme and undergo self-splicing (as described in the following reference: Wesselhoeft et al., Nature Communication, 2018). A person skilled in the art can determine the sequences in the exons of natural genes that can serve as "3' exon fragments" and "5' exon fragments".

如本文所用,术语“自剪接位点”或“剪接位点”是指在RNA成环反应期间,发生裂解并产生供成环的游离末端的二核苷酸之间所在的位置。As used herein, the term "self-splicing site" or "splice site" refers to the position between dinucleotides where cleavage occurs during the RNA circularization reaction and generates free ends for circularization.

如本文所用,术语“环化区域”是指在通过夹板介导法、内含子-外显子置换法(permuted intron-exon method)、RNA连接酶介导法或其他方法,将处于线性的RNA在体外制备成环化RNA或者在体内形成环状RNA之后,被包含在形成的环状RNA而不包含在剪切下来的RNA的区域,与RNA环化方式、环化中间体的结构和功能无关。As used herein, the term "circularization region" refers to a region that is included in the formed circular RNA but not in the sheared RNA after linear RNA is prepared into circular RNA in vitro or circular RNA is formed in vivo by splint-mediated method, permuted intron-exon method, RNA ligase-mediated method or other methods, and is irrelevant to the RNA circularization method, the structure and function of the circularization intermediate.

如本文所用,术语“标签”是指可以用于与亲和层析基质上互补的寡核苷酸(可以称为“亲和配基”)进行结合的序列,从而使含有标签的分子特异性吸附于亲和层析基质,与不含所述标签的分子分离。组成标签的核苷酸可以是天然存在的核苷酸,也可以是经过人为修饰的非天然核苷酸。As used herein, the term "tag" refers to a sequence that can be used to bind to a complementary oligonucleotide (which can be referred to as an "affinity ligand") on an affinity chromatography matrix, so that molecules containing the tag are specifically adsorbed to the affinity chromatography matrix and separated from molecules that do not contain the tag. The nucleotides constituting the tag can be naturally occurring nucleotides or artificially modified non-natural nucleotides.

需要说明的是,此处及在本文文本中所用的表述“第一”、“第二”或其类似表述,仅旨在将两个所述元素在特定范畴上进行区分,并不指示所述元素的重要性、顺序等;“第一”、“第二”的元素可以是相同或不同的所指物/概念。例如,在某些情况下,“第一标签”和“第二标签”可以表示相同的标签(例如,polyA)或不同的标签(例如,polyA或其功能性变体)。It should be noted that the expressions "first", "second" or similar expressions used here and in the text of this article are only intended to distinguish the two elements in a specific category, and do not indicate the importance, order, etc. of the elements; the "first" and "second" elements can be the same or different referents/concepts. For example, in some cases, the "first tag" and the "second tag" can represent the same tag (e.g., polyA) or different tags (e.g., polyA or its functional variants).

如本文所用,术语“poly(A)标签”具有本领域普通技术人员识别和理解的含义,例如指由多聚腺嘌呤核苷酸组成的序列,其通常位于线性信使核糖核酸分子中的3’末端区域附近或3’末端(通常也称为poly(A)尾部)。一般认为,3’末端的poly(A)序列在线性mRNA分子中通常具有保护mRNA免受3'末端降解,并在帽依赖性蛋白质翻译中发挥重要作用。专利文献4还报道在环状RNA中的poly(A)序列具有降低其免疫原性的作用。本发明的poly(A)标签通常由约20至多达约100个腺嘌呤核苷酸组成,优选30~90个,更优选40~80个、进一步优选45~70个,最优选50~65个连续的腺嘌呤核苷酸组成。在一些实施方案中,本发明的poly(A)标签由20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40、41、42、43、44、45、46、47、48、49、50、51、52、53、54、55、56、57、58、59、60、61、62、63、64、65、66、67、68、69、70、71、72、73、74、75、76、77、78、79、80、81、82、83、84、85、86、87、88、89、90、91、92、93、94、95、96、97、98、99、100个连续的腺嘌呤核苷酸组成。As used herein, the term "poly (A) tag" has a meaning recognized and understood by a person of ordinary skill in the art, for example, it refers to a sequence composed of polyadenine nucleotides, which is generally located near the 3' terminal region or the 3' end (also commonly referred to as the poly (A) tail) in a linear messenger RNA molecule. It is generally believed that the poly (A) sequence at the 3' end generally protects the mRNA from 3' end degradation in a linear mRNA molecule and plays an important role in cap-dependent protein translation. Patent document 4 also reports that the poly (A) sequence in circular RNA has the effect of reducing its immunogenicity. The poly (A) tag of the present invention is generally composed of about 20 to up to about 100 adenine nucleotides, preferably 30 to 90, more preferably 40 to 80, further preferably 45 to 70, and most preferably 50 to 65 consecutive adenine nucleotides. In some embodiments, the poly (A) tag of the invention is composed of 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58 , 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 consecutive adenine nucleotides.

如本文所用,术语“Poly(A)标签”还可以是一类包含至少一个非A碱基的poly(A),包括但不限于PCT申请PCT/CN2023/079037、中国专利申请CN112805386A、美国专利 US10717982B2中所描述的各种poly(A)序列。上述申请或专利文本在此全部并入本文。这样的poly(A)序列在本文中也被称为“poly(A)的功能性变体”,意指上述由连续的腺嘌呤核苷酸组成的序列虽然被个别非A核苷酸中断,但仍然具有与连续腺嘌呤核苷酸组成的序列基本相同的功能活性(如对3'末端稳定性和/或翻译活性的影响)的序列。As used herein, the term "Poly(A) tag" may also refer to a class of poly(A) tags comprising at least one non-A base, including but not limited to PCT application PCT/CN2023/079037, Chinese patent application CN112805386A, U.S. patent application Various poly (A) sequences described in US10717982B2. The above application or patent text is hereby incorporated in its entirety. Such poly (A) sequences are also referred to herein as "functional variants of poly (A)", meaning that the above sequence consisting of continuous adenine nucleotides is interrupted by individual non-A nucleotides, but still has substantially the same functional activity (such as the effect on 3' terminal stability and/or translation activity) as the sequence consisting of continuous adenine nucleotides.

具体而言,本发明中的“poly(A)的功能性变体”包括如下不同的情况:Specifically, the "functional variant of poly(A)" in the present invention includes the following different situations:

(1)包含唯一的元件a、至少一个元件b,以及至少一个元件c的功能性变体,(1) a functional variant comprising only one element a, at least one element b, and at least one element c,

(2)包含唯一的元件a,至少一个元件b,以及至少一个元件d的功能性变体;或者(2) a functional variant comprising only element a, at least one element b, and at least one element d; or

(3)包含唯一的元件a,至少一个元件b,以及至少一个元件c和至少一个元件d的功能性变体,(3) a functional variant comprising only one element a, at least one element b, and at least one element c and at least one element d,

其中,元件a由多个连续的腺嘌呤(A)核苷酸组成,长度范围为≥20nt;元件b由多个连续的A核苷酸组成,在一些实施方案中所述元件b的长度范围为3nt≤b<20nt;元件c由一个非A的核苷酸组成,所述核苷酸选自胸腺嘧啶(T)、胞嘧啶(C)、鸟嘌呤(G)核苷酸;元件d由任意两个或更多个连续的核苷酸组成,所述核苷酸选自A、T、C、G核苷酸,其中元件d的5’及3’末端的核苷酸不为A核苷酸,并且元件d不包含3个以上连续的A核苷酸,所述元件d的长度范围为2nt≤d≤20nt;元件e由一个或两个连续的A组成,且当其存在时,其位于且仅可位于Poly(A)编码序列的3’末端,且与元件d或元件c相邻。Among them, element a is composed of multiple consecutive adenine (A) nucleotides, and the length range is ≥20nt; element b is composed of multiple consecutive A nucleotides, and in some embodiments, the length range of element b is 3nt≤b<20nt; element c is composed of a non-A nucleotide, and the nucleotide is selected from thymine (T), cytosine (C), and guanine (G) nucleotides; element d is composed of any two or more consecutive nucleotides, and the nucleotides are selected from A, T, C, and G nucleotides, wherein the nucleotides at the 5' and 3' ends of element d are not A nucleotides, and element d does not contain more than 3 consecutive A nucleotides, and the length range of element d is 2nt≤d≤20nt; element e is composed of one or two consecutive A, and when it exists, it is located and can only be located at the 3' end of the Poly (A) coding sequence, and is adjacent to element d or element c.

当一条Poly(A)中同时包含两个或多个“元件b”、“元件c”、及“元件d”时,每两个元件b的序列可以相同或不同、每两个元件c的序列可以相同或不同,以及每两个元件d的序列可以相同或不同,只要他们各自均符合前述对元件a、b、c和d的定义即可。When a Poly(A) contains two or more "element b", "element c", and "element d" at the same time, the sequences of every two elements b can be the same or different, the sequences of every two elements c can be the same or different, and the sequences of every two elements d can be the same or different, as long as they each meet the above definitions of elements a, b, c and d.

在一些实施方案中,poly(A)标签中的元件a和元件b不相邻、元件c和元件d不相邻,元件b之间彼此不相邻、元件c之间彼此不相邻、且元件d之间彼此不相邻。In some embodiments, element a and element b in the poly(A) tag are not adjacent, element c and element d are not adjacent, elements b are not adjacent to each other, elements c are not adjacent to each other, and elements d are not adjacent to each other.

在一些实施方案中,Poly(A)标签还进一步包含唯一一个元件e,所述元件e由一个或两个连续的A组成,且其位于所述Poly(A)标签的3’末端,且与元件d或元件c相邻。In some embodiments, the Poly(A) tag further comprises a unique element e, which consists of one or two consecutive A's and is located at the 3' end of the Poly(A) tag and adjacent to element d or element c.

在一些实施方案中,Poly(A)及其功能性变体可以是一段RNA,也可以是DNA与RNA的杂合分子。In some embodiments, Poly(A) and its functional variants can be a segment of RNA or a hybrid molecule of DNA and RNA.

在一些实施方案中,Poly(A)标签的序列结构选自:In some embodiments, the sequence structure of the Poly(A) tag is selected from:

元件a-元件c-元件b-元件c-元件b-元件c-元件b-元件c-元件b;Element a-element c-element b-element c-element b-element c-element b-element c-element b;

元件b-元件c-元件b-元件c-元件a-元件d-元件b-元件c-元件b-元件c-元件b;Element b-element c-element b-element c-element a-element d-element b-element c-element b-element c-element b;

元件b-元件c-元件b-元件c-元件b-元件d-元件a-元件c;Element b-element c-element b-element c-element b-element d-element a-element c;

元件a-元件d-元件b-元件c-元件b-元件c-元件b;或element a-element d-element b-element c-element b-element c-element b; or

元件b-元件c-元件b-元件c-元件b-元件d-元件a。Element b-element c-element b-element c-element b-element d-element a.

如本文所用,术语“结合”是指核苷酸分子碱基之间通过分子间相互作用力(例如,氢键)而可逆地互补配对,或者通过化学键等互补配对,从而在特定条件或反应下使得核酸分子之 间、核酸与带有核酸的基质材料(例如,有机高分子材料或磁珠)之间、或者带有核酸的基质材料之间达到一定亲和力。例如,一定长度(例如,50nt)polyA的寡聚单链RNA或包含该寡聚polyA的长单链RNA可以通过碱基互补配对的方式在一定条件“结合”带有oligo dT的葡聚糖基质,并且可选地可以通过改变环境条件(如变温、强离子条件或强酸碱条件)使polyA与oligo dT之间的氢键打开,从而使该“结合”可逆、进一步使该寡聚单链RNA或包含该寡聚polyA的长单链RNA从葡聚糖基质上脱离下来;此外,可选地,可以通过一定条件,使得polyA与oligo dT之间形成不可逆的化学键,从而使该RNA与葡聚糖基质不可逆地结合。As used herein, the term "binding" refers to the reversible complementary pairing between nucleotide bases through intermolecular interaction forces (e.g., hydrogen bonds), or complementary pairing through chemical bonds, so that nucleic acid molecules can be reversibly paired under specific conditions or reactions. A certain affinity can be achieved between nucleic acids and matrix materials (e.g., organic polymer materials or magnetic beads) carrying nucleic acids, or between matrix materials carrying nucleic acids. For example, a certain length (e.g., 50 nt) of polyA oligosingle-stranded RNA or a long single-stranded RNA containing the oligosingle-stranded polyA can "bind" to a dextran matrix carrying oligo dT under certain conditions by means of complementary base pairing, and optionally, the hydrogen bond between polyA and oligo dT can be opened by changing environmental conditions (e.g., temperature change, strong ion conditions, or strong acid-base conditions), thereby making the "binding" reversible, and further making the oligosingle-stranded RNA or the long single-stranded RNA containing the oligosingle-stranded polyA detached from the dextran matrix; in addition, optionally, an irreversible chemical bond can be formed between polyA and oligo dT under certain conditions, thereby making the RNA irreversibly bound to the dextran matrix.

如本文所用,术语“亲和层析介质”是指带有与纯化标签互补的寡聚核苷酸、亲和层析基质的材料。可以根据特定的需要,设计理想的寡聚核苷酸,例如,在标签为poly(A)的情况下,介质上的理想寡聚核苷酸可以为oligo dT。亲和层析基质可以选用与核苷酸没有反应或与核苷酸分子惰性的稳定材料,其分子表面与核苷酸分子之间没有明显的相互排斥或相互吸引的分子间作用力,例如,基质可以是葡聚糖分子、聚丙烯氨酰胺大分子、大分子的纤维素分子、壳聚糖材料、改性的聚乳酸材料、PET材料、无机硅酸盐材料、具有涂覆层的金属材料等。通常地,寡聚核苷酸和亲和层析基质之间是通过共价键交联的。此外,介质上的寡聚核苷酸也可以替换成其他分子,只要这种分子与核苷酸具有倾向性的分子间作用力。例如,在标签为U、C和A中的至少一者组成的寡聚核苷酸的情况下,可以将次黄嘌呤交联到基质上形成合意的亲和层析介质。As used herein, the term "affinity chromatography medium" refers to materials with oligonucleotides complementary to purification tags and affinity chromatography matrices. Ideal oligonucleotides can be designed according to specific needs. For example, in the case where the tag is poly(A), the ideal oligonucleotide on the medium can be oligo dT. Affinity chromatography matrices can be selected from stable materials that are unreactive with nucleotides or inert with nucleotide molecules, and there is no obvious mutual repulsion or mutual attraction between the molecular surface and the nucleotide molecules. For example, the matrix can be dextran molecules, polyacrylamide macromolecules, macromolecular cellulose molecules, chitosan materials, modified polylactic acid materials, PET materials, inorganic silicate materials, metal materials with coating layers, etc. Generally, the oligonucleotides and affinity chromatography matrices are cross-linked by covalent bonds. In addition, the oligonucleotides on the medium can also be replaced by other molecules, as long as such molecules have a preferred intermolecular force with the nucleotides. For example, in the case where the tag is an oligonucleotide composed of at least one of U, C and A, hypoxanthine can be cross-linked to the matrix to form a desired affinity chromatography medium.

如本文所用,术语“分离”是指将环状RNA产物与线状RNA前体、线状RNA片段、各种自剪接过程形成的中间复合体等区分开,从而简便地得到纯化的效果。用于纯化目标多核苷酸和多肽的技术在本领域中是熟知的,并且包括例如离子交换色谱法、亲和色谱法以及根据密度进行沉降。通常,当物质相对于样品的其他组分以大于其天然存在的量存在于样品中时,则所述物质是纯化的。在本文优选的实施方式中,通过亲和层析或亲和吸附分离的环状RNA具有大于90%、大于91%、大于92%、大于93%、大于94%、大于95%、大于96%、大于97%、大于98%、大于99%、大于99.5%、大于99.9%或更高的纯度,或者具有90%以上、91%以上、92%以上、93%以上、94%以上、95%以上、96%以上、97%以上、98%以上、99%以上、99.5%以上、99.9%以上或更高的纯度,其中非理想的核酸(例如,没有完全环化仍保留核酶等元件的各种中间复合体等)在其中的含量不多于10%、9%、8%、7%、6%、5%、4%、3%、2%、1%、0.5%、0.1%或更低的量。As used herein, the term "separation" refers to the separation of circular RNA products from linear RNA precursors, linear RNA fragments, various intermediate complexes formed by self-splicing processes, etc., thereby simply obtaining the effect of purification. Techniques for purifying target polynucleotides and polypeptides are well known in the art and include, for example, ion exchange chromatography, affinity chromatography, and sedimentation according to density. Generally, a substance is purified when it is present in a sample in an amount greater than its naturally occurring amount relative to other components of the sample. In a preferred embodiment herein, the circular RNA separated by affinity chromatography or affinity adsorption has a purity of greater than 90%, greater than 91%, greater than 92%, greater than 93%, greater than 94%, greater than 95%, greater than 96%, greater than 97%, greater than 98%, greater than 99%, greater than 99.5%, greater than 99.9% or higher, or has a purity of 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, 99.9% or higher, wherein the content of non-ideal nucleic acids (e.g., various intermediate complexes that are not completely circularized and still retain elements such as ribozymes, etc.) therein is no more than 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.5%, 0.1% or less.

线状核酸分子被称为具有“5'-末端”(5'端)和“3'-末端”(3'端),因为核酸磷酸二酯键存在于取代单核苷酸的糖部分的5'碳和3'碳处。在一些例子中,核酸分子的5’末端核苷酸是能以其核糖5'碳形成新的磷酸二酯键的核苷酸,相应地,3’末端核苷酸是能以其核糖3'碳形成新的磷酸二酯键的核苷酸。Linear nucleic acid molecules are said to have a "5'-terminus" (5' end) and a "3'-terminus" (3' end) because nucleic acid phosphodiester bonds are present at the 5' carbon and 3' carbon of the sugar portion of the substituted mononucleotide. In some examples, the 5' terminal nucleotide of the nucleic acid molecule is a nucleotide that can form a new phosphodiester bond with its ribose 5' carbon, and correspondingly, the 3' terminal nucleotide is a nucleotide that can form a new phosphodiester bond with its ribose 3' carbon.

如本文所用,术语“5’端上游”是指特定核酸片段在5’至3’方向上之前的、不在该片段内 部的位置,术语“3’端下游”是指特定核酸片段在5’至3’方向上之后的、不在该片段内部的位置,例如,在一段1kb的核酸序列中,位于第100bp至199bp的一段特定核酸片段的“5’端上游”可以包括第0bp至第99bp的任意位置,其“3’端下游”可以包括第200bp至第1000bp的任意位置。基于类似的理念,术语“5’末端区域”是指特定核酸片段内部自其5’末端核苷酸起、在5’至3’方向上的若干个连续核苷酸残基,术语“3’末端区域”是指特定核酸片段内部自其3’末端核苷酸起、在3’至5’方向上的若干个连续核苷酸残基。本领域技术人员可以比照完整核酸分子容易地确定该特定片段的5’和3’末端核苷酸。例如,在一段1kb的核酸序列中,位于第100bp至199bp的一段特定核酸片段的“5’末端区域”可以表示第100bp至第119bp、第100bp至第114bp、第100bp至第109bp,其“3’末端区域”可以表示第170bp至第199bp、第175bp至第199bp、第180bp至第199bp。需要特别说明的是,此处的解释中列举特定的核酸长度和位置是示例性的,在不违背本公开的概念和精神的前提下,这些术语可以做各种补充、扩展、扩大、替换、同义、等同的解释。As used herein, the term "5'upstream" refers to the portion of a nucleic acid fragment that precedes the fragment in the 5' to 3' direction and is not within the fragment. The term "3' end downstream" refers to the position of a specific nucleic acid fragment in the 5' to 3' direction, but not in the fragment. For example, in a 1kb nucleic acid sequence, the "5' end upstream" of a specific nucleic acid fragment located at the 100bp to 199bp may include any position from the 0bp to the 99bp, and its "3' end downstream" may include any position from the 200bp to the 1000bp. Based on a similar concept, the term "5' terminal region" refers to a number of consecutive nucleotide residues in the 5' to 3' direction from the 5' terminal nucleotide of a specific nucleic acid fragment, and the term "3' terminal region" refers to a number of consecutive nucleotide residues in the 3' to 5' direction from the 3' terminal nucleotide of a specific nucleic acid fragment. Those skilled in the art can easily determine the 5' and 3' terminal nucleotides of the specific fragment by comparing with the complete nucleic acid molecule. For example, in a 1kb nucleic acid sequence, the "5' terminal region" of a specific nucleic acid fragment located at 100bp to 199bp can represent 100bp to 119bp, 100bp to 114bp, 100bp to 109bp, and its "3' terminal region" can represent 170bp to 199bp, 175bp to 199bp, 180bp to 199bp. It should be noted that the specific nucleic acid lengths and positions listed in the explanations here are exemplary, and these terms can be supplemented, extended, expanded, replaced, synonymous, and equivalent without violating the concept and spirit of the present disclosure.

如本文所用,术语“结构基因”是指可以编码各种多肽或蛋白质的基因,“结构基因的功能性片段”可以表示断裂的结构基因的全部或部分地可以最终被表达成多肽的片段,作为结构基因的功能性片段的非限制性示例,可举出例如蛋白质亚单位、蛋白质活性中心、蛋白质或非天然催化基团的蛋白质杂合体、经重组的蛋白质活性亚单位或活性中心、经重组的人工酶或其他主要由氨基酸构成的生物学效应单元等;“非编码RNA”可以指不编码多肽或蛋白质产物的RNA,包括rRNA、tRNA、snRNA、snoRNA、microRNA、micronRNA海绵(microRNA sponge)、miRNA、lncRNA、circRNA、piRNA等多种已知功能的RNA,还包括未知功能的RNA。As used herein, the term "structural gene" refers to a gene that can encode various polypeptides or proteins, and "functional fragments of structural genes" may refer to fragments of broken structural genes that can ultimately be expressed as polypeptides in whole or in part. Non-limiting examples of functional fragments of structural genes include protein subunits, protein active centers, protein hybrids of proteins or non-natural catalytic groups, recombinant protein active subunits or active centers, recombinant artificial enzymes or other biological effect units mainly composed of amino acids, etc.; "non-coding RNA" may refer to RNA that does not encode polypeptide or protein products, including rRNA, tRNA, snRNA, snoRNA, microRNA, micronRNA sponge, miRNA, lncRNA, circRNA, piRNA and other RNAs with known functions, and also include RNAs with unknown functions.

如本文所用,术语“间隔区”是指一段功能序列(例如,启动子或核糖体进入序列)与另一段功能序列(例如,结构基因或其片段,或者非编码RNA或转录出非编码RNA的DNA)之间的一段核酸序列,用于将两段功能序列的结合因子在空间上形成一定距离,或者用于避免两段功能序列相互影响其空间结构,或者用于将一段功能序列的结合因子与另一段功能序列本身的空间结构具有一定的空间距离,其不编码具有生物学效应的多肽、蛋白质或非编码RNA,或者转录产物或翻译产物至少不劣化多肽、蛋白质或非编码RNA的生物学活性。例如,在一些实施方式中,间隔区可以是特定长度(例如,9nt)的绕乱(scramble)的、单核苷酸重复的、多核苷酸重复的序列。As used herein, the term "spacer" refers to a nucleic acid sequence between a functional sequence (e.g., a promoter or a ribosome entry sequence) and another functional sequence (e.g., a structural gene or a fragment thereof, or a non-coding RNA or a DNA transcribed from a non-coding RNA), which is used to space the binding factors of the two functional sequences at a certain distance, or to prevent the two functional sequences from affecting each other's spatial structure, or to space the binding factors of a functional sequence at a certain distance from the spatial structure of the other functional sequence itself, which does not encode a polypeptide, protein or non-coding RNA with a biological effect, or the transcription product or translation product at least does not degrade the biological activity of the polypeptide, protein or non-coding RNA. For example, in some embodiments, the spacer can be a scrambled, mononucleotide repeated, or polynucleotide repeated sequence of a specific length (e.g., 9nt).

如本文所用,术语“RNA酶R”(Ribonuclease R,RNase R)是一种核糖核酸外切酶,来源于大肠杆菌RNR超家族,可以从3’至5’方向切割降解线状RNA分子,但基本不消化呈环形的RNA、套索结构或3’端突出末端缺少7nt的双联RNA分子,RNA酶R的具体示例可以参考以下参考文献:Cheng ZF,Biol Chem,2002年。本领域技术人员可以理解的是,术语“RNA酶R”可以包含基于RNA酶R人工改造、重组产生的具有同种酶活性的多肽、杂合酶、多肽类似物等。 As used herein, the term "RNase R" (Ribonuclease R, RNase R) is a ribonuclease exonuclease derived from the Escherichia coli RNR superfamily, which can cut and degrade linear RNA molecules from the 3' to 5' direction, but basically does not digest circular RNA, lasso structure or double-linked RNA molecules with 7nt missing at the 3' protruding end. For specific examples of RNase R, please refer to the following reference: Cheng ZF, Biol Chem, 2002. It can be understood by those skilled in the art that the term "RNase R" can include polypeptides, hybrid enzymes, polypeptide analogs, etc. with the same enzyme activity that are artificially modified or recombinantly produced based on RNase R.

如本文所用,术语“免疫原性”是指诱导对物质的免疫应答的潜力。当生物体的免疫系统或某种类型的免疫细胞暴露于免疫原性物质时,可诱导免疫应答。As used herein, the term "immunogenicity" refers to the potential to induce an immune response to a substance. When an organism's immune system or a certain type of immune cell is exposed to an immunogenic substance, an immune response may be induced.

如本文所用,术语“环化效率”是指所得环状多聚核糖核苷酸与其线状起始材料相比的测量值。As used herein, the term "cyclization efficiency" refers to a measure of the resulting circular polyribonucleotide compared to its linear starting material.

如本文所用,术语“翻译效率”是指从核糖核苷酸转录物产生蛋白质或肽的速率或量。在一些实施方案中,翻译效率可表示为每给定量的编码蛋白质或肽的转录物所产生的蛋白质或肽的量。As used herein, the term "translation efficiency" refers to the rate or amount of protein or peptide produced from ribonucleotide transcripts. In some embodiments, translation efficiency can be expressed as the amount of protein or peptide produced per a given amount of transcript encoding a protein or peptide.

术语“核苷酸”是指核糖核苷酸、脱氧核糖核苷酸、其修饰形式或其类似物。核苷酸包括物质,所述物质包括嘌呤(例如,腺嘌呤、次黄嘌呤、鸟嘌呤及其衍生物和类似物)以及嘧啶(例如,胞嘧啶、尿嘧啶、胸腺嘧啶及其衍生物和类似物)。核苷酸类似物包括在碱基、糖和/或磷酸的化学结构中具有经修饰的核苷酸,包括但不限于,5'-位置嘧啶修饰、8'-位置嘌呤修饰、胞嘧啶环外胺处的修饰和5-溴-尿嘧啶的取代;和2'-位置糖修饰,包括但不限于糖修饰的核糖核苷酸,其中2'-OH被诸如H、OR、R、卤代基、SH、SR、NH2、NHR、NR2或CN的基团取代,其中R是如本文所定义的烷基部分。核苷酸类似物还意在包括具有碱基,例如肌苷、辫苷、黄嘌呤;糖类,如2'-甲基核糖;非天然磷酸二酯键联,如甲基膦酸酯、硫代磷酸酯和肽键联的核苷酸。核苷酸类似物包括5-甲氧基尿苷、1-甲基假尿苷和6-甲基腺苷。The term "nucleotide" refers to a ribonucleotide, a deoxyribonucleotide, a modified form thereof, or an analog thereof. Nucleotides include substances including purines (e.g., adenine, hypoxanthine, guanine, and derivatives and analogs thereof) and pyrimidines (e.g., cytosine, uracil, thymine, and derivatives and analogs thereof). Nucleotide analogs include nucleotides having modifications in the chemical structure of bases, sugars, and/or phosphates, including but not limited to, 5'-position pyrimidine modifications, 8'-position purine modifications, modifications at the cytosine exocyclic amine, and substitutions of 5-bromo-uracil; and 2'-position sugar modifications, including but not limited to sugar-modified ribonucleotides, wherein 2'-OH is substituted by a group such as H, OR, R, halo, SH, SR, NH2, NHR, NR2, or CN, wherein R is an alkyl moiety as defined herein. Nucleotide analogs are also intended to include nucleotides having bases such as inosine, quercetin, xanthine; sugars such as 2'-methylribose; non-natural phosphodiester linkages such as methylphosphonate, phosphorothioate, and peptide linkages. Nucleotide analogs include 5-methoxyuridine, 1-methylpseudouridine, and 6-methyladenosine.

术语“核酸”和“多核苷酸”在本文中可互换使用以描述任何长度(例如大于约2个碱基、大于约10个碱基、大于约100个碱基、大于约500个碱基、大于1000个碱基或多达约10,000个或更多个碱基)、由核苷酸(例如脱氧核糖核苷酸或核糖核苷酸)组成的聚合物,并且可酶促或合成产生(例如,如美国专利第5,948,902号和其中引用的参考文献中所述),其可以类似于两种天然存在的核酸的序列特异性方式与天然存在的核酸杂交,例如可参与沃森-克里克(Watson-Crick)碱基配对相互作用。天然存在的核酸由核苷酸组成,所述核苷酸包括鸟嘌呤、胞嘧啶、腺嘌呤、胸腺嘧啶和尿嘧啶(分别为G、C、A、T和U)。The terms "nucleic acid" and "polynucleotide" are used interchangeably herein to describe a polymer of any length (e.g., greater than about 2 bases, greater than about 10 bases, greater than about 100 bases, greater than about 500 bases, greater than 1000 bases, or up to about 10,000 or more bases), composed of nucleotides (e.g., deoxyribonucleotides or ribonucleotides), and can be produced enzymatically or synthetically (e.g., as described in U.S. Pat. No. 5,948,902 and references cited therein), which can hybridize with naturally occurring nucleic acids in a sequence-specific manner similar to two naturally occurring nucleic acids, for example, can participate in Watson-Crick base pairing interactions. Naturally occurring nucleic acids are composed of nucleotides, including guanine, cytosine, adenine, thymine, and uracil (G, C, A, T, and U, respectively).

如本文所用术语“核糖核酸”和“RNA”意指由核糖核苷酸组成的聚合物。As used herein, the terms "ribonucleic acid" and "RNA" refer to a polymer composed of ribonucleotides.

如本文所用的术语“脱氧核糖核酸”和“DNA”意指由脱氧核糖核苷酸组成的聚合物。As used herein, the terms "deoxyribonucleic acid" and "DNA" refer to a polymer composed of deoxyribonucleotides.

如本文所用,当两个“同源臂”或“同源区”与彼此的反向互补序列具有足够水平的序列同一性以充当杂交反应的底物时,所述两个区域彼此互补或是互补的。如本文所用,当多核苷酸序列与反向互补序列或“互补”序列同一或共享序列同一性时,所述多核苷酸序列具有“同源性”。同源区域和对应同源区域的反向互补序列之间的序列同一性百分比可以是允许杂交发生的任何序列同一性百分比。在一些实施方案中,本发明多核苷酸的内部双链体形成区能够与另一个内部双链体形成区形成双链体并且不与外部双链体形成区形成双链体。As used herein, when two "homology arms" or "homologous regions" have a sufficient level of sequence identity with each other's reverse complementary sequences to serve as substrates for hybridization reactions, the two regions are complementary to each other or complementary. As used herein, a polynucleotide sequence has "homology" when it is identical or shares sequence identity with a reverse complementary sequence or "complementary" sequence. The percentage of sequence identity between a homologous region and the reverse complementary sequence of the corresponding homologous region can be any percentage of sequence identity that allows hybridization to occur. In some embodiments, the internal duplex forming region of the polynucleotide of the present invention is capable of forming a duplex with another internal duplex forming region and does not form a duplex with an external duplex forming region.

“转录”是指使用DNA分子作为模板,通过RNA聚合酶形成或合成RNA分子。本发明关于用于转录的RNA聚合酶没有限制。例如,在一些实施方案中,可使用T7型RNA聚合 酶。“翻译”是指基于RNA模板由核糖体形成多肽分子。"Transcription" refers to the formation or synthesis of RNA molecules by RNA polymerase using a DNA molecule as a template. The present invention is not limited to the RNA polymerase used for transcription. For example, in some embodiments, a T7-type RNA polymerase may be used. Enzymes. "Translation" refers to the formation of polypeptide molecules by ribosomes based on RNA templates.

应当理解,本文所用的术语仅出于描述特定实施方案的目的,而并非旨在进行限制。如在本说明书和所附权利要求中所使用,除非内容另外明确指明,否则单数形式“一个/种(a/an)”和“所述(the)”包括复数指代物。因此,例如,提及“一个细胞”包括两个或更多个细胞的组合,或细胞的整个培养物;提及“多核苷酸”实际上包括所述多核苷酸的许多拷贝。除非明确规定或从上下文显而易见,否则如本文所使用,术语“或”被理解为包括在内。除非本文中或和说明书其余部分下文中另有定义,否则本文使用的所有技术性和科学性术语具有与本发明所属领域的技术人员通常所理解的相同的含义。It should be understood that the terms used herein are for the purpose of describing specific embodiments only and are not intended to be limiting. As used in this specification and the appended claims, unless the content clearly indicates otherwise, the singular forms "a/an" and "the" include plural referents. Thus, for example, reference to "a cell" includes a combination of two or more cells, or the entire culture of cells; reference to "polynucleotides" actually includes many copies of the polynucleotides. Unless expressly provided or obvious from the context, as used herein, the term "or" is understood to be inclusive. Unless otherwise defined herein or and in the remainder of the specification below, all technical and scientific terms used herein have the same meaning as those commonly understood by those skilled in the art to which the invention belongs.

除非明确规定或从上下文显而易见,否则如本文所用,术语“约”应理解为在本领域的正常公差范围内,例如在平均值的2个标准偏差以内。“约”可理解为在所述值的10%、9%、8%、7%、6%、5%、4%、3%、2%、1%、0.9%、0.8%、0.7%、0.6%、0.5%、0.4%、0.3%、0.2%、0.1%、0.09%、0.08%、0.07%、0.06%、0.05%、0.04%、0.03%、0.02%或0.01%内。除非另外从上下文显而易见,否则本文提供的所有数值都由术语“约”修饰。Unless expressly provided or obvious from the context, as used herein, the term "about" should be understood to be within the normal tolerance range of the field, such as within 2 standard deviations of the mean value. "About" can be understood to be within 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.9%, 0.8%, 0.7%, 0.6%, 0.5%, 0.4%, 0.3%, 0.2%, 0.1%, 0.09%, 0.08%, 0.07%, 0.06%, 0.05%, 0.04%, 0.03%, 0.02% or 0.01% of the value. Unless otherwise obvious from the context, all numerical values provided herein are modified by the term "about".

如本文所用,术语“编码”泛指任何过程,其中聚合大分子中的信息用于指导与第一个分子不同的第二个分子的产生。第二分子可具有与第一分子的化学性质不同的化学结构。As used herein, the term "encoding" refers broadly to any process in which information in a polymeric macromolecule is used to direct the production of a second molecule that is different from the first molecule. The second molecule may have a chemical structure that is different in chemical nature from the first molecule.

“联用”或“共同施用”是指将本文提供的治疗剂与一种或多种额外治疗剂时间上足够接近地联合施用,使得本文提供的治疗剂可增强一种或多种额外治疗剂的效果,反之亦然。"Combination" or "co-administration" refers to the co-administration of a therapeutic agent provided herein and one or more additional therapeutic agents sufficiently close in time that the therapeutic agent provided herein can enhance the effect of the one or more additional therapeutic agents, or vice versa.

如本文所用,术语“治疗”和“预防”以及源自其的词不一定意味着100%或完全治疗或预防。而是,存在不同程度的治疗或预防,本领域普通技术人员认为所述治疗或预防具有潜在益处或治疗效果。由本文公开的方法提供的治疗或预防可包括治疗或预防疾病的一种或多种疾患或症状。此外,出于本文的目的,“预防”可包括延迟疾病或其症状或疾患的发作。As used herein, the terms "treat" and "prevent" and words derived therefrom do not necessarily mean 100% or complete treatment or prevention. Rather, there are varying degrees of treatment or prevention that one of ordinary skill in the art recognizes as having potential benefits or therapeutic effects. The treatment or prevention provided by the methods disclosed herein may include treating or preventing one or more disorders or symptoms of a disease. In addition, for purposes herein, "prevention" may include delaying the onset of a disease or a symptom or disorder thereof.

如本文所用,术语“表达序列”可指编码产物例如肽或多肽、调控性核酸或非编码核酸的核酸序列。编码肽或多肽的示例性表达序列可包含多个核苷酸三联体,所述核苷酸三联体中的每一者可编码氨基酸并被称为“密码子”。As used herein, the term "expressed sequence" may refer to a nucleic acid sequence encoding a product such as a peptide or polypeptide, a regulatory nucleic acid, or a non-coding nucleic acid. An exemplary expressed sequence encoding a peptide or polypeptide may comprise a plurality of nucleotide triplets, each of which may encode an amino acid and is referred to as a "codon".

术语“抗体”(Ab)包括但不限于特异性地结合至抗原的糖蛋白免疫球蛋白。一般而言,抗体可包含通过二硫键互连的至少两条重(H)链和两条轻(L)链,或其抗原结合分子。每条H链包含重链可变区(本文缩写为VH)和重链恒定区。重链恒定区包含三个恒定结构域,CH1、CH2和CH3。每条轻链包含轻链可变区(本文缩写为VL)和轻链恒定区。轻链恒定区包含一个恒定结构域,CL。VH和VL区可进一步细分为高变区,称为互补决定区(CDR),其间散布着较保守的区域,称为框架区(FR)。每个VH和VL包含三个CDR和四个FR,以下列顺序从氨基末端至羧基末端排列:FR1、CDR1、FR2、CDR2、FR3、CDR3和FR4。重链和轻链的可变区含有与抗原相互作用的结合结构域。抗体的恒定区可介导免疫球蛋白与宿主组织或因子的结合,包括免疫系统的不同细胞(例如,效应细胞)以及经典补体系统的第一组分。抗体可包括例如单 克隆抗体、重组产生的抗体、单特异性抗体、多特异性抗体(包括双特异性抗体)、人抗体、工程化抗体、人源化抗体、嵌合抗体、免疫球蛋白、合成抗体、包含两个重链和两个轻链分子的四聚抗体、抗体轻链单体、抗体重链单体、抗体轻链二聚体、抗体重链二聚体、抗体轻链-抗体重链对、胞内抗体、抗体融合物(本文有时称为“抗体缀合物”)、异源缀合物抗体、单结构域抗体、单价抗体、单链抗体或单链Fv(scFv)、骆驼化抗体、亲和体(affybody)、Fab片段、F(ab')2片段、二硫化物连接的Fv(sdFv)、抗独特型(anti-id)抗体(包括例如抗-抗Id抗体)、微型抗体、结构域抗体、合成抗体(有时在本文中称为“抗体模拟物”)和上述任一者的抗原结合片段。在一些实施方案中,本文所述的抗体是指多克隆抗体群。The term "antibody" (Ab) includes, but is not limited to, glycoprotein immunoglobulins that specifically bind to an antigen. In general, an antibody may comprise at least two heavy (H) chains and two light (L) chains interconnected by disulfide bonds, or an antigen-binding molecule thereof. Each H chain comprises a heavy chain variable region (abbreviated herein as VH) and a heavy chain constant region. The heavy chain constant region comprises three constant domains, CH1, CH2, and CH3. Each light chain comprises a light chain variable region (abbreviated herein as VL) and a light chain constant region. The light chain constant region comprises one constant domain, CL. The VH and VL regions can be further subdivided into hypervariable regions, called complementary determining regions (CDRs), interspersed with more conservative regions, called framework regions (FRs). Each VH and VL comprises three CDRs and four FRs, arranged in the following order from the amino terminus to the carboxyl terminus: FR1, CDR1, FR2, CDR2, FR3, CDR3, and FR4. The variable regions of the heavy and light chains contain binding domains that interact with the antigen. The constant regions of the antibodies may mediate the binding of the immunoglobulin to host tissues or factors, including various cells of the immune system (e.g., effector cells) and the first component of the classical complement system. Antibodies may include, for example, monoclonal antibodies. Clonal antibodies, recombinantly produced antibodies, monospecific antibodies, multispecific antibodies (including bispecific antibodies), human antibodies, engineered antibodies, humanized antibodies, chimeric antibodies, immunoglobulins, synthetic antibodies, tetrameric antibodies comprising two heavy chains and two light chain molecules, antibody light chain monomers, antibody heavy chain monomers, antibody light chain dimers, antibody heavy chain dimers, antibody light chain-antibody heavy chain pairs, intrabodies, antibody fusions (sometimes referred to herein as "antibody conjugates"), heteroconjugate antibodies, single domain antibodies, monovalent antibodies, single chain antibodies or single chain Fv (scFv), camelized antibodies, affibodies, Fab fragments, F(ab')2 fragments, disulfide-linked Fv (sdFv), anti-idiotype antibodies (including, for example, anti-anti-Id antibodies), miniantibodies, domain antibodies, synthetic antibodies (sometimes referred to herein as "antibody mimetics"), and antigen-binding fragments of any of the above. In some embodiments, the antibodies described herein refer to polyclonal antibody populations.

免疫球蛋白可源自任何公知的同种型,包括但不限于IgA、分泌型IgA、IgG和IgM。IgG亚类也是本领域技术人员众所周知的,包括但不限于人IgG1、IgG2、IgG3和IgG4。“同种型”是指由重链恒定区基因编码的Ab类或亚类(例如IgM或IgG1)。举例来说,术语“抗体”包括天然存在的和非天然存在的抗体;单克隆和多克隆抗体;嵌合和人源化抗体;人或非人抗体;全合成抗体;和单链抗体。非人抗体可通过重组方法进行人源化以降低其在人中的免疫原性。在没有明确说明的情况下并且除非上下文另有说明,否则术语“抗体”还包括任何上述免疫球蛋白的抗原结合片段或抗原结合部分,并且包括单价和二价片段或部分,以及单链抗体。Immunoglobulins can be derived from any known isotype, including but not limited to IgA, secretory IgA, IgG and IgM. IgG subclasses are also well known to those skilled in the art, including but not limited to human IgG1, IgG2, IgG3 and IgG4. "Isotype" refers to the Ab class or subclass (e.g., IgM or IgG1) encoded by the heavy chain constant region gene. For example, the term "antibody" includes naturally occurring and non-naturally occurring antibodies; monoclonal and polyclonal antibodies; chimeric and humanized antibodies; human or non-human antibodies; fully synthetic antibodies; and single-chain antibodies. Non-human antibodies can be humanized by recombinant methods to reduce their immunogenicity in humans. In the absence of explicit instructions and unless otherwise indicated in the context, the term "antibody" also includes an antigen-binding fragment or antigen-binding portion of any of the above-mentioned immunoglobulins, and includes monovalent and divalent fragments or portions, as well as single-chain antibodies.

“抗原结合分子”、“抗原结合部分”或“抗体片段”是指包含所述分子所来源的抗体的抗原结合部分(例如,CDR)的任何分子。抗原结合分子可包括抗原互补决定区(CDR)。抗体片段的实例包括但不限于Fab、Fab'、F(ab')2和Fv片段、dAb、线状抗体、scFv抗体和由抗原结合分子形成的多特异性抗体。肽体(即包含肽结合结构域的Fc融合分子)是合适的抗原结合分子的另一个实例。在一些实施方案中,抗原结合分子结合至肿瘤细胞上的抗原。在一些实施方案中,抗原结合分子结合至过度增生性疾病中所涉及的细胞上的抗原或结合至病毒或细菌抗原。在一些实施方案中,抗原结合分子结合至BCMA。在其他实施方案中,抗原结合分子是特异性地结合至抗原的抗体片段,包括其一个或多个互补决定区(CDR)。在其他实施方案中,抗原结合分子是单链可变片段(scFv)。"Antigen binding molecule", "antigen binding portion" or "antibody fragment" refers to any molecule that comprises an antigen binding portion (e.g., CDR) of an antibody from which the molecule is derived. Antigen binding molecules may include antigen complementary determining regions (CDRs). Examples of antibody fragments include, but are not limited to, Fab, Fab', F(ab')2 and Fv fragments, dAbs, linear antibodies, scFv antibodies, and multispecific antibodies formed by antigen binding molecules. Peptibodies (i.e., Fc fusion molecules comprising peptide binding domains) are another example of suitable antigen binding molecules. In some embodiments, the antigen binding molecules bind to antigens on tumor cells. In some embodiments, the antigen binding molecules bind to antigens on cells involved in hyperproliferative diseases or to viral or bacterial antigens. In some embodiments, the antigen binding molecules bind to BCMA. In other embodiments, the antigen binding molecules are antibody fragments that specifically bind to antigens, including one or more complementary determining regions (CDRs) thereof. In other embodiments, the antigen binding molecules are single-chain variable fragments (scFv).

“癌”或“癌症”是指广泛的各种疾病,其特征在于体内异常细胞的不受控制的生长。不受调控的细胞分裂和生长导致恶性肿瘤的形成,所述恶性肿瘤侵入邻近组织并且还可通过淋巴系统或血流转移至身体的远端部分。“癌症”或“癌组织”可包括肿瘤。可通过本文公开的方法治疗的癌症的实例包括但不限于免疫系统的癌症,包括淋巴瘤、白血病、骨髓瘤和其他白细胞恶性肿瘤。在一些实施方案中,本文公开的方法可用于减小源自例如以下的肿瘤的肿瘤大小:骨癌、胰腺癌、皮肤癌、头颈癌、皮肤或眼内恶性黑素瘤、子宫癌、卵巢癌、直肠癌、肛门癌、胃癌、睾丸癌、子宫癌、多发性骨髓瘤、霍奇金病、非霍奇金淋巴瘤(NHL)、原发性纵隔大B细胞淋巴瘤(PMBC)、弥漫性大B细胞淋巴瘤(DLBCL)、滤泡性淋巴瘤(FL)、转化滤泡性淋巴瘤、脾边缘区淋巴瘤(SMZL)、食道癌、小肠癌、内分泌系统癌症、甲状腺癌、甲状旁 腺癌、肾上腺癌、尿道癌、阴茎癌、慢性或急性白血病、急性骨髓性白血病、慢性骨髓性白血病、急性成淋巴细胞性白血病(ALL)(包括非T细胞ALL)、慢性淋巴细胞性白血病(CLL)、儿童实体瘤、淋巴细胞性淋巴瘤、膀胱癌、肾癌或输尿管癌、中枢神经系统(CNS)肿瘤、原发性CNS淋巴瘤、肿瘤血管生成、脊髓轴肿瘤、脑干胶质瘤、垂体腺瘤、表皮样癌、鳞状细胞癌、T细胞淋巴瘤、环境诱导的癌症包括石棉诱导的那些癌症、其他B细胞恶性肿瘤和所述癌症的组合。在一些实施方案中,本文公开的方法可用于减小源自例如以下的肿瘤的肿瘤大小:肉瘤和癌、纤维肉瘤、粘液肉瘤、脂肪肉瘤、软骨肉瘤、成骨肉瘤、卡波济氏肉瘤、软组织肉瘤和其他肉瘤、滑膜瘤、间皮瘤、尤文氏瘤、平滑肌肉瘤、横纹肌肉瘤、结肠癌、胰腺癌、乳腺癌、卵巢癌、前列腺癌、肝细胞癌、肺癌、结肠直肠癌、鳞状细胞癌、基底细胞癌、腺癌(例如胰腺、结肠、卵巢、肺、乳腺、胃、前列腺、子宫颈或食道的腺癌)、汗腺癌、皮脂腺癌、乳头状癌、乳头状腺癌、髓样癌、支气管癌、肾细胞癌、肝癌、胆管癌、绒毛膜癌、韦尔姆斯氏瘤、宫颈癌、睾丸肿瘤、膀胱癌、输卵管癌、子宫内膜癌、子宫颈癌、阴道癌、外阴癌、肾盂癌、CNS肿瘤(如胶质瘤、星形细胞瘤、髓母细胞瘤、颅咽管瘤、室管膜瘤、松果体瘤、血管母细胞瘤、听神经瘤、少突神经胶质瘤、脑膜瘤、黑素瘤、神经母细胞瘤和视网膜母细胞瘤)。特定癌症可对化学疗法或放射疗法有反应,或者癌症可能是难治的。难治性癌症是指不适合手术干预的癌症,并且所述癌症最初对化学疗法或放射疗法没有反应,或者随时间推移癌症变得没有反应。"Cancer" or "cancer" refers to a broad variety of diseases characterized by the uncontrolled growth of abnormal cells in the body. Unregulated cell division and growth lead to the formation of malignant tumors that invade adjacent tissues and may also metastasize to distant parts of the body via the lymphatic system or bloodstream. "Cancer" or "cancerous tissue" may include tumors. Examples of cancers that may be treated by the methods disclosed herein include, but are not limited to, cancers of the immune system, including lymphomas, leukemias, myelomas, and other white blood cell malignancies. In some embodiments, the methods disclosed herein can be used to reduce the size of tumors originating from, for example, bone cancer, pancreatic cancer, skin cancer, head and neck cancer, cutaneous or intraocular malignant melanoma, uterine cancer, ovarian cancer, rectal cancer, anal cancer, stomach cancer, testicular cancer, uterine cancer, multiple myeloma, Hodgkin's disease, non-Hodgkin's lymphoma (NHL), primary mediastinal large B-cell lymphoma (PMBC), diffuse large B-cell lymphoma (DLBCL), follicular lymphoma (FL), transformed follicular lymphoma, splenic marginal zone lymphoma (SMZL), esophageal cancer, small intestine cancer, endocrine system cancer, thyroid cancer, parathyroid cancer adenocarcinoma, adrenal cancer, urethral cancer, penile cancer, chronic or acute leukemia, acute myeloid leukemia, chronic myeloid leukemia, acute lymphoblastic leukemia (ALL) (including non-T cell ALL), chronic lymphocytic leukemia (CLL), solid tumors in children, lymphocytic lymphoma, bladder cancer, kidney cancer or ureteral cancer, central nervous system (CNS) tumors, primary CNS lymphoma, tumor angiogenesis, spinal axis tumors, brain stem glioma, pituitary adenoma, epidermoid carcinoma, squamous cell carcinoma, T cell lymphoma, environmentally induced cancers including those induced by asbestos, other B cell malignancies, and combinations of the described cancers. In some embodiments, the methods disclosed herein can be used to reduce the size of tumors derived from, for example, sarcomas and carcinomas, fibrosarcomas, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, Kaposi's sarcoma, soft tissue sarcomas and other sarcomas, synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon cancer, pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, hepatocellular carcinoma, lung cancer, colorectal cancer, squamous cell carcinoma, basal cell carcinoma, adenocarcinoma (e.g., pancreatic, colon, ovarian, lung, breast, stomach, prostate, cervical cancer, or esophageal adenocarcinoma), sweat gland cancer, sebaceous gland cancer, papillary carcinoma, papillary adenocarcinoma, medullary carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatocarcinoma, bile duct cancer, choriocarcinoma, Wilms' tumor, cervical cancer, testicular tumors, bladder cancer, fallopian tube cancer, endometrial cancer, cervical cancer, vaginal cancer, vulvar cancer, renal pelvis cancer, CNS tumors (such as glioma, astrocytoma, medulloblastoma, craniopharyngioma, ependymoma, pinealoma, hemangioblastoma, acoustic neuroma, oligodendroglioma, meningioma, melanoma, neuroblastoma and retinoblastoma). Specific cancers may respond to chemotherapy or radiation therapy, or the cancer may be refractory. Refractory cancer refers to cancer that is not suitable for surgical intervention and the cancer initially does not respond to chemotherapy or radiation therapy, or the cancer becomes unresponsive over time.

如本文所用的术语“免疫细胞”或“淋巴细胞”包括自然杀伤(NK)细胞、T细胞或B细胞。NK细胞是一种类型的细胞毒性(细胞毒性)淋巴细胞,其代表固有免疫系统的主要组分。NK细胞排斥肿瘤和被病毒感染的细胞。它通过细胞凋亡或程序性细胞死亡的过程起作用。它们被称为“自然杀手”,因为它们不需要激活即可杀死细胞。T细胞在细胞介导的免疫(无抗体参与)中起主要作用。As used herein, the term "immune cell" or "lymphocyte" includes natural killer (NK) cells, T cells or B cells. NK cells are a type of cytotoxic (cytotoxic) lymphocytes that represent the main components of the innate immune system. NK cells repel tumors and cells infected by viruses. It works through the process of apoptosis or programmed cell death. They are called "natural killers" because they do not need to be activated to kill cells. T cells play a major role in cell-mediated immunity (without antibody involvement).

术语“遗传工程化”或“工程化”是指修饰细胞的基因组的方法,包括但不限于缺失编码或非编码区或其一部分或插入编码区或其一部分。在一些实施方案中,经修饰的细胞是淋巴细胞,例如T细胞,其可从患者或供体获得。可修饰细胞以表达掺入细胞基因组中的外源构建体,例如嵌合抗原受体(CAR)或T细胞受体(TCR)。The term "genetic engineering" or "engineering" refers to a method of modifying the genome of a cell, including but not limited to deleting a coding or non-coding region or a portion thereof or inserting a coding region or a portion thereof. In some embodiments, the modified cell is a lymphocyte, such as a T cell, which can be obtained from a patient or a donor. The cell can be modified to express an exogenous construct incorporated into the cell genome, such as a chimeric antigen receptor (CAR) or a T cell receptor (TCR).

如本文所用,表述“序列同一性”或例如包含“与……50%同一性的序列”是指在比较窗口上序列在逐个核苷酸或逐个氨基酸的基础上同一的程度。因此,“序列同一性百分比”可通过以下方式进行计算:在比较窗口上比较两个最佳比对的序列、确定两个序列中出现同一核酸碱基((例如,A、T、C、G、I)或或同一氨基酸残基(例如,Ala、Pro、Ser、Thr、Gly、Val、Leu、Ile、Phe、Tyr、Trp、Lys、Arg、His、Asp、Glu、Asn、Gln、Cys和Met)的位置数以得到匹配位置数,用匹配位置数除以比较窗口中的位置总数(即窗口大小),并将结果乘以100而得到序列同一性百分比。包括与本文所述的任何参考序列具有至少约50%、55%、60%、65%、70%、 75%、80%、85%、90%、95%、96%、97%、98%、99%或100%序列同一性的核苷酸和多肽,通常其中多肽变体保持参考多肽的至少一种生物活性。As used herein, the expression "sequence identity" or, for example, "a sequence that is 50% identical to ... " refers to the degree to which the sequences are identical on a nucleotide-by-nucleotide or amino acid-by-amino acid basis over the comparison window. Therefore, the "percentage of sequence identity" can be calculated by comparing two optimally aligned sequences over the comparison window, determining the number of positions where the same nucleic acid base (e.g., A, T, C, G, I) or the same amino acid residue (e.g., Ala, Pro, Ser, Thr, Gly, Val, Leu, Ile, Phe, Tyr, Trp, Lys, Arg, His, Asp, Glu, Asn, Gln, Cys and Met) appears in the two sequences to obtain the number of matching positions, dividing the number of matching positions by the total number of positions in the comparison window (i.e., the window size), and multiplying the result by 100 to obtain the percentage of sequence identity. Including sequences that have at least about 50%, 55%, 60%, 65%, 70%, 80%, 90%, 100%, 110%, 120%, 130%, 140%, 150%, 160%, 170%, 180%, 190%, 200%, 210%, 220%, 230%, 240%, 250%, 260%, 270%, 280%, 290%, 300%, 310%, 320%, 330%, 340%, 350%, 360%, 370%, 380%, 390%, 400%, 410%, 420%, 430%, 440%, 450%, 460%, 470%, 480%, 490%, 500%, 510%, 520%, 530%, 540%, 550%, 5 Nucleotides and polypeptides with 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity, typically wherein the polypeptide variant retains at least one biological activity of the reference polypeptide.

如本文所用,术语“治疗”是指治疗性治疗和预防性或防范性或阻止性措施,其中目的是预防或减缓(减轻)不期望的病理变化或病症。对于本发明的目的,有益或期望的临床结果包括但不限于:症状的减轻,疾病程度的降低,延缓或减慢疾病进展,改善或缓和疾病状态,以及缓解(无论是部分还是全部),无论是可检测的还是不可检测的。As used herein, the term "treatment" refers to both therapeutic treatment and prophylactic or preventive or preventive measures, wherein the purpose is to prevent or slow down (mitigate) an undesirable pathological change or condition. For purposes of the present invention, beneficial or desired clinical results include, but are not limited to, alleviation of symptoms, reduction in disease severity, delay or slowing of disease progression, improvement or alleviation of the disease state, and remission (whether partial or complete), whether detectable or undetectable.

如本文所用,术语“治疗有效量”是指本发明的化合物的量可以:(i)治疗或预防本文描述的疾病或病症,(ii)改善或消除本文描述的一种或多种疾病或病症,或者(iii)预防或延迟本文描述的疾病或病症的一种或多种症状的发作。As used herein, the term "therapeutically effective amount" refers to an amount of a compound of the invention that can: (i) treat or prevent a disease or condition described herein, (ii) ameliorate or eliminate one or more diseases or conditions described herein, or (iii) prevent or delay the onset of one or more symptoms of a disease or condition described herein.

如本文所用,术语“非编码RNA”或“ncRNA”的种类主要包括微小RNA(microRNA,miRNA)、小干扰RNA(small interfering RNA,siRNA)、PIWI-interacting RNA(piRNA)、转运RNA衍生小RNA(tRNA-derived small RNA,tsRNA)、核小RNA(small nuclear RNA,snRNA)、核仁小RNA(small nucleolar RNA,snoRNA)、长链非编码RNA(long non-coding RNA,lncRNA)、环状RNA(circular RNA,circRNA)、假基因等。miRNA是短ncRNA,约为22~23个核苷酸,其编码基因由RNA多聚酶Ⅱ转录,通过与mRNA 3′端非翻译区(3′untranslated region,3′UTR)之间的结合调控mRNA的表达,超过60%的编码基因是miRNA的潜在调控靶点。lncRNA是长度超过200个核苷酸的非编码RNA,其生物发生过程与mRNA类似。lncRNA在细胞周期调节、染色质修饰、mRNA翻译等多种生物过程中发挥重要作用。circRNA属于lncRNA,主要由外显子或者内含子序列产生,是一种单链环状RNA分子,可以作为竞争性内源性RNA与miRNA结合,调节转录或影响亲本基因表达。piRNA是一种长度约为21~35个核苷酸的小RNA,由长单链转录本加工而成,这些转录本的基因组位点聚集在整个基因组中,并由RNA多聚酶Ⅱ转录。人类基因组中大约有20 000个piRNA,主要表达于性腺细胞。tsRNA来源于核酸酶对tRNA的酶切,通常长度为18~40个核苷酸,在调节翻译、维持mRNA的稳定性、基因沉默、逆转录等过程中发挥重要作用。As used herein, the term "non-coding RNA" or "ncRNA" mainly includes microRNA (miRNA), small interfering RNA (siRNA), PIWI-interacting RNA (piRNA), tRNA-derived small RNA (tsRNA), small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), long non-coding RNA (lncRNA), circular RNA (circRNA), pseudogenes, etc. miRNA is a short ncRNA of about 22 to 23 nucleotides. Its coding gene is transcribed by RNA polymerase II and regulates the expression of mRNA by binding to the 3′ untranslated region (3′ UTR) of mRNA. More than 60% of coding genes are potential regulatory targets of miRNA. lncRNA is a non-coding RNA with a length of more than 200 nucleotides. Its biogenesis process is similar to that of mRNA. lncRNA plays an important role in a variety of biological processes such as cell cycle regulation, chromatin modification, and mRNA translation. CircRNA belongs to lncRNA, which is mainly produced by exon or intron sequences. It is a single-stranded circular RNA molecule that can bind to miRNA as a competitive endogenous RNA to regulate transcription or affect parental gene expression. piRNA is a small RNA with a length of about 21 to 35 nucleotides, which is processed from long single-stranded transcripts. The genomic sites of these transcripts are aggregated throughout the genome and transcribed by RNA polymerase II. There are about 20,000 piRNAs in the human genome, which are mainly expressed in gonadal cells. tsRNA is derived from the cleavage of tRNA by nucleases. It is usually 18 to 40 nucleotides in length and plays an important role in regulating translation, maintaining the stability of mRNA, gene silencing, reverse transcription, etc.

如文本所用,术语“基因组”或“基因组DNA”指宿主生物的可遗传信息。所述基因组DNA包括细胞或生物的全部遗传物质,包括胞核DNA(染色体DNA)、染色体外DNA和细胞器(例如线粒体)DNA。优选地,术语“基因组”或基因组“DNA”指胞核的染色体DNA。As used herein, the term "genome" or "genomic DNA" refers to the heritable information of a host organism. The genomic DNA includes all genetic material of a cell or organism, including nuclear DNA (chromosomal DNA), extrachromosomal DNA, and organelle (e.g., mitochondrial) DNA. Preferably, the term "genome" or genomic "DNA" refers to the chromosomal DNA of the nucleus.

如本文所用,在术语“重组”用来描述生物或细胞(例如微生物)的情况下,它用来表述该生物或细胞包含通常稍后描述的至少一个“转基因”、“转基因的”或“重组”多核苷酸。As used herein, where the term "recombinant" is used to describe an organism or cell (eg, a microorganism), it is used to convey that the organism or cell contains at least one "transgene," "transgenic," or "recombinant" polynucleotide, typically described later.

如本文所用,相对于个体生物为“外源”的多核苷酸是通过有性杂交之外的任何手段引入该生物中的多核苷酸。As used herein, a polynucleotide that is "exogenous" with respect to an individual organism is a polynucleotide that has been introduced into that organism by any means other than sexual hybridization.

如本文所用,术语“启动子”或“RNA酶结合位点”是起始编码序列转录的多核苷酸区域。启动子位于基因的转录起始位点附近,位于DNA的同一链和上游(朝向有义链的5'区域)。一些 启动子是组成型的,因为它们在细胞中的所有情况下都是活性的,而其他启动子被调节成响应于特定刺激而变成活性的(例如,诱导型启动子)。如本文所用的术语“启动子活性”及其语法等同项是指与正在测量其活性的启动子可操作地连接的核苷酸序列的表达程度。启动子活性可以通过确定产生的RNA转录物的量来直接测量,例如通过Northern印迹分析,或者通过确定由连接的核酸序列,如与启动子连接的报道核酸序列编码的产物的量间接测量。As used herein, the term "promoter" or "RNase binding site" is a polynucleotide region that initiates transcription of a coding sequence. The promoter is located near the transcription start site of a gene, on the same strand of DNA and upstream (towards the 5' region of the sense strand). Some Promoters are constitutive in that they are active under all circumstances in the cell, whereas other promoters are regulated to become active in response to a specific stimulus (e.g., inducible promoters). As used herein, the term "promoter activity" and its grammatical equivalents refer to the extent of expression of a nucleotide sequence operably linked to the promoter whose activity is being measured. Promoter activity can be measured directly by determining the amount of RNA transcript produced, such as by Northern blot analysis, or indirectly by determining the amount of product encoded by a linked nucleic acid sequence, such as a reporter nucleic acid sequence linked to the promoter.

如本文所用,术语“质粒”是指染色体外元件,其常携带不作为细胞核心代谢机制部分的基因,且通常是环状双链DNA分子的形式。这些元件可以是任何来源的自主复制序列、基因组整合序列、噬菌体或核苷酸序列,线状、环状或超螺旋的单链或双链DNA或RNA。通常,质粒含有在宿主细胞(例如大肠杆菌)中有功能的复制起点,以及用于检测包含该质粒的宿主细胞的选择标记。As used herein, the term "plasmid" refers to an extrachromosomal element that often carries genes that are not part of the core metabolic machinery of the cell, and is usually in the form of a circular double-stranded DNA molecule. These elements can be autonomous replication sequences, genome integration sequences, phage or nucleotide sequences of any origin, linear, circular or supercoiled single-stranded or double-stranded DNA or RNA. Typically, a plasmid contains a replication origin that is functional in a host cell (e.g., E. coli), and a selectable marker for detecting host cells containing the plasmid.

如本文所用,术语“转化”是指通过使用物理或化学方法将一种或多种外源多核苷酸引入到宿主细胞中。As used herein, the term "transformation" refers to the introduction of one or more exogenous polynucleotides into a host cell by using physical or chemical methods.

如本文所用,术语“表达”是揭示基因内编码的信息的过程。如果基因编码蛋白质,则表达包括将DNA转录为mRNA、将mRNA(如有必要)加工为成熟的mRNA产物以及将成熟的mRNA翻译为蛋白质。As used herein, the term "expression" is the process of revealing the information encoded within a gene. If the gene encodes a protein, expression includes transcribing the DNA into mRNA, processing the mRNA (if necessary) into a mature mRNA product, and translating the mature mRNA into a protein.

以下通过具体实施例的方式对本发明做进一步的说明,但这并非是对本发明的限制。本领域技术人员根据本发明的教导可以做出各种修改或调整,其并不背离本发明的精神和范围。The present invention is further described below by way of specific embodiments, but this is not intended to limit the present invention. Those skilled in the art may make various modifications or adjustments based on the teachings of the present invention without departing from the spirit and scope of the present invention.

实施例Example

下面结合具体实施方式对本发明进行进一步的详细描述,给出的实施例仅为了阐明本发明,而不是为了限制本发明的范围。以下提供的实施例可作为本技术领域普通技术人员进行进一步改进的指南,并不以任何方式构成对本发明的限制。The present invention is further described in detail below in conjunction with specific embodiments, and the examples provided are only for illustrating the present invention, rather than for limiting the scope of the present invention. The examples provided below can be used as a guide for further improvements by those of ordinary skill in the art, and do not constitute a limitation of the present invention in any way.

下述实施例中的实验方法,如无特殊说明,均为常规方法,按照本领域内的文献所描述的技术或条件或者按照产品说明书进行。下述实施例中所用的材料、试剂、仪器等,如无特殊说明,均可从商业途径得到。以下实施例中的定量试验,如无特殊说明,均为三次重复实验的平均值。下述实施例中,如无特殊说明,序列表中各核苷酸序列均从左至右按5’至3’末端的的顺序书写,氨基酸序列均从左至右按氨基端至羧基端的顺序书写。The experimental methods in the following examples, unless otherwise specified, are all conventional methods, and are carried out according to the techniques or conditions described in the literature in the art or according to the product instructions. The materials, reagents, instruments, etc. used in the following examples, unless otherwise specified, can all be obtained from commercial sources. The quantitative tests in the following examples, unless otherwise specified, are the average values of three repeated experiments. In the following examples, unless otherwise specified, the nucleotide sequences in the sequence table are written from left to right in the order of 5' to 3' end, and the amino acid sequences are written from left to right in the order of amino terminal to carboxyl terminal.

使用的试剂、仪器和细胞系Reagents, instruments, and cell lines used

如无特别指明,本发明使用的试剂、仪器和细胞系均为可商购获得。Unless otherwise specified, the reagents, instruments and cell lines used in the present invention are commercially available.

重要试剂列举如下:Important reagents are listed below:

BspQ I(货号ON-123,上海兆维科技发展有限公司);BspQ I (Cat. No. ON-123, Shanghai Zhaowei Technology Development Co., Ltd.);

High Yield T7 RNA Synthesis Kit(货号ON-040,上海兆维科技发展有限公司);High Yield T7 RNA Synthesis Kit (Cat. No. ON-040, Shanghai Zhaowei Technology Development Co., Ltd.);

Monarch RNA Cleanup Kit(货号T2040L,NEB); Monarch RNA Cleanup Kit (Cat. No. T2040L, NEB);

RNase R(货号E224,近岸蛋白);RNase R (Cat. No. E224, nearshore protein);

Oligo d(T)25磁珠(货号S1419S,NEB)。Oligo d(T)25 magnetic beads (Cat. No. S1419S, NEB).

重要仪器列举如下:The important instruments are listed below:

安捷伦5200片段分析仪系统(Agilent,U.S.),Agilent 5200 Fragment Analyzer System (Agilent, U.S.),

琼脂糖凝胶电泳系统(型号DYY-6C,北京六一生物科技有限公司),Agarose gel electrophoresis system (model DYY-6C, Beijing Liuyi Biotechnology Co., Ltd.),

Nanodrop分光光度计(型号Nanodrop ONEc,Thermo Fisher Scientific),Nanodrop spectrophotometer (model Nanodrop ONE c , Thermo Fisher Scientific),

离心机(型号Sorvall Legend Micro 17R,Thermo Fisher Scientific)),Centrifuge (model Sorvall Legend Micro 17R, Thermo Fisher Scientific),

金属浴(型号HWS-12,上海一恒科学仪器有限公司)。Metal bath (Model HWS-12, Shanghai Yiheng Scientific Instrument Co., Ltd.).

实施例1:制备线状RNA前体Example 1: Preparation of linear RNA precursor

委托苏州金唯智生物科技有限公司合成用于制备线状RNA前体的DNA模板双链。将该模板双链通过基因合成的方式掺入到pUC57载体(公司名、货号)中,使其置于T7启动子序列(TAATACGACTCACTATA)的3’端下游,获得用于合成线状RNA前体的模板质粒pUC-PIE-miR21Suzhou Jinweizhi Biotechnology Co., Ltd. was commissioned to synthesize a DNA template double strand for preparing a linear RNA precursor. The template double strand was incorporated into the pUC57 vector (company name, product number) by gene synthesis, and placed downstream of the 3' end of the T7 promoter sequence (TAATACGACTCACTATA) to obtain the template plasmid pUC-PIE-miR21 for synthesizing a linear RNA precursor

使用20μl体系(1X reaction buffer,ATP、GTP、CTP和UTP各7.5mM,T7 RNA polymerase mix 1μl,1μg线性模板质粒pUC-PIE-miR21),37℃孵育4h,进行体外转录,制备获得SEQ ID NO:11所示的线状RNA前体。该前体按5’至3’方向包含表1所列出的各个元件。
A 20 μl system (1X reaction buffer, 7.5 mM each of ATP, GTP, CTP and UTP, 1 μl of T7 RNA polymerase mix, 1 μg of linear template plasmid pUC-PIE-miR21) was used for in vitro transcription at 37°C for 4 h to prepare the linear RNA precursor shown in SEQ ID NO: 11. The precursor contains the elements listed in Table 1 in the 5' to 3' direction.

SEQ ID NO:11中的下划线分别示出了靠近5’末端的5’外部同源臂和靠近3’末端的3’外部 同源臂。为便于描述,将5’外部同源臂中的第一个胞嘧啶(C)编号为第1位,与它相邻的鸟嘌呤(G)为T7启动子(TAATACGACTCACTATA)的转录起点,称为第0位。SEQ ID NO:11中按5’至3’方向的最后一个核苷酸被称为第807位,与它相邻的3’端核苷酸相应地可被称为第808位。The underlines in SEQ ID NO: 11 indicate the 5' outer homology arm near the 5' end and the 3' outer homology arm near the 3' end, respectively. Homology arm. For ease of description, the first cytosine (C) in the 5' outer homology arm is numbered as position 1, and the guanine (G) adjacent to it is the transcription start point of the T7 promoter (TAATACGACTCACTATA), which is called position 0. The last nucleotide in SEQ ID NO:11 in the 5' to 3' direction is called position 807, and the 3' end nucleotide adjacent to it can be called position 808 accordingly.

SEQ ID NO:11中还以符号“#”和“*”分别示出了线状前体在成环时的两步转酯反应所针对的化学键位置,黑体显示的“UG”和“AA”分别是3’内含子片段和5’内含子片段中确定自剪接位点的二核苷酸。针对该具体序列而言,位于“#”5’上游的第1至151位核苷酸残基构成其5’端臂,位于“*”3’下游的第673至807位核苷酸残基构成其3’端臂;其所形成的环状RNA将由位于“#”和“*”之间的第152位至第672位核苷酸所示序列环化而成,即,所述环化区域具体包含:3’外显子片段(第152~202位)、5’内部同源臂(第203~227位)、插入片段(第228~635位)、3’内部同源臂(第636~657位)、5’外显子片段(第658~672位)。In SEQ ID NO:11, the symbols "#" and "*" respectively indicate the positions of the chemical bonds targeted by the two-step transesterification reaction of the linear precursor during the cyclization. The "UG" and "AA" shown in bold are the dinucleotides that determine the self-splicing sites in the 3' intron fragment and the 5' intron fragment, respectively. For this specific sequence, the nucleotide residues 1 to 151 located 5' upstream of "#" constitute its 5' end arm, and the nucleotide residues 673 to 807 located 3' downstream of "*" constitute its 3' end arm; the circular RNA formed by it will be cyclized by the sequence represented by the nucleotides 152 to 672 located between "#" and "*", that is, the cyclized region specifically includes: 3' exon fragment (positions 152 to 202), 5' internal homology arm (positions 203 to 227), insert fragment (positions 228 to 635), 3' internal homology arm (positions 636 to 657), 5' exon fragment (positions 658 to 672).

图1以示意图的方式示出了该线状RNA前体,并将各元件的名称和起止位置编号示于相应位置上方。FIG1 schematically shows the linear RNA precursor, and shows the name and start and end position numbers of each element above the corresponding position.

表1:PIE系统序列

Table 1: PIE system sequence

实施例2:在单侧或双侧端臂中具有纯化标签的线状RNA前体及其IVT产量、环化效率测试Example 2: Linear RNA precursors with purification tags in one or both end arms and IVT yield and circularization efficiency test

使用与实施例1相同的方法,制备具有如SEQ ID NO:26至SEQ ID NO:49所示核苷酸序列的线状RNA前体,分别作为制备例1至24。如下表2所示,与SEQ ID NO:11所示核苷酸序列相比,制备例1至24的线状RNA前体分别在SEQ ID NO:11的不同位置包含长度为50nt的poly(A)作为纯化标签。同时,以实施例1制备的SEQ ID NO:11作为对比例(CK)。Using the same method as in Example 1, linear RNA precursors having nucleotide sequences as shown in SEQ ID NO: 26 to SEQ ID NO: 49 were prepared as Preparation Examples 1 to 24, respectively. As shown in Table 2 below, compared with the nucleotide sequence shown in SEQ ID NO: 11, the linear RNA precursors of Preparation Examples 1 to 24 respectively contained a poly (A) with a length of 50 nt as a purification tag at different positions of SEQ ID NO: 11. At the same time, SEQ ID NO: 11 prepared in Example 1 was used as a comparative example (CK).

对于插入位置,示例性解释如下:对于插入位置为1时,poly A位于线性RNA前体第1位核苷酸残基的3’端且与第一位核苷酸残基直接相连。For the insertion position, an exemplary explanation is as follows: when the insertion position is 1, poly A is located at the 3' end of the first nucleotide residue of the linear RNA precursor and is directly connected to the first nucleotide residue.

表2:

Table 2:

使用已知的方法,使制备例1~5、7、11、24和对比例1在相同条件下进行环化反应。例如,将实施例1和2的体外转录体系于37℃孵育4h后,添加2mm GTP并转移至55℃孵育15min,再根据生产商的说明,将孵育后的混合物用RNA Cleanup Kit纯化,得到RNA酶R处理前的环化反应产物(-)。使用Nanodrop法,测定上述环化反应产物的RNA含量。向100μl反应体系中,先后加入20U RNase R和2μg RNA,37℃孵育15min,得到RNA酶R处理后的环化反应产物(+)。将上述两种反应产物在相同条件下通过1.0%琼脂糖凝胶进行检测。其中,单侧端臂含有poly(A)标签的产物电泳结果示出于图5A,双侧端臂含有poly(A)标签的产物电泳结果示出于图5B。Using known methods, Preparation Examples 1 to 5, 7, 11, 24 and Comparative Example 1 were subjected to cyclization reaction under the same conditions. For example, after incubating the in vitro transcription system of Examples 1 and 2 at 37°C for 4 hours, 2 mm GTP was added and transferred to 55°C for incubation for 15 minutes, and then the incubated mixture was purified with RNA Cleanup Kit according to the manufacturer's instructions to obtain the cyclization reaction product before RNase R treatment (-). The RNA content of the above cyclization reaction product was determined using the Nanodrop method. 20 U RNase R and 2 μg RNA were added to a 100 μl reaction system, incubated at 37°C for 15 minutes, and the cyclization reaction product (+) after RNase R treatment was obtained. The above two reaction products were detected by 1.0% agarose gel under the same conditions. Among them, the electrophoresis results of the product with a poly (A) tag on the single-side end arm are shown in Figure 5A, and the electrophoresis results of the product with a poly (A) tag on the double-side end arm are shown in Figure 5B.

RNase R属于3’-5’核酸外切酶,只能降解线状RNA,不降解环状RNA。图5显示,制备例1~5、7、11、24和对比例1(CK)在经过RNaseR酶切处理后,均可观察到与环状RNA对应的明显条带,表明poly(A)的插入并未影响这些线状RNA前体的环化。然而,在制备例1~5、7、11、24所对应的泳道中,不论是RNase R酶切处理之前(-)还是之后(+),除主条带外,均未观察到明显的次要条带。与之相对,在作为对照的对比例1所对应的泳道中,在RNase R酶切处理之前(-)可以观察到两条分离的明显条带,其中迁移率高的条带在RNase R酶切处理之后(+)消失,表明该条带主要对应于环状RNA随机断裂产生的Nicked RNA杂质。RNase R is a 3’-5’ exonuclease that can only degrade linear RNA, not circular RNA. Figure 5 shows that after RNase R digestion, obvious bands corresponding to circular RNA can be observed in Preparation Examples 1 to 5, 7, 11, 24 and Comparative Example 1 (CK), indicating that the insertion of poly(A) did not affect the cyclization of these linear RNA precursors. However, in the lanes corresponding to Preparation Examples 1 to 5, 7, 11, and 24, no obvious secondary bands were observed except for the main band, whether before (-) or after (+) RNase R digestion. In contrast, in the lane corresponding to Comparative Example 1 as a control, two separate obvious bands can be observed before RNase R digestion (-), among which the band with high mobility disappears after RNase R digestion (+), indicating that the band mainly corresponds to Nicked RNA impurities generated by random breakage of circular RNA.

综上,本实施例表明,不受任何理论机制的约束,在线状RNA前体的两侧端臂(尤其是5’外部同源臂、5’内含子或3’外部同源臂的区域)中插入纯化标签通常不会显著干扰环化反应,反而能够令人惊讶地改善线状RNA前体的环化反应效率,显著提高所期望的完全环化产物在反应产物中的占比,同时明显降低、甚至消除了nicked RNA等不期望的副产物,从而能够显著提高环化反应粗产物的纯度,减少甚至消除环状RNA纯化方法对RNase R酶处理步骤的依赖性,最终在不影响甚至明显提高环化产物质量的同时,极大地简化后续纯化工艺。 In summary, this example shows that, without being constrained by any theoretical mechanism, inserting a purification tag into the two end arms of a linear RNA precursor (especially the region of the 5' external homology arm, the 5' intron or the 3' external homology arm) generally does not significantly interfere with the cyclization reaction, but can surprisingly improve the cyclization reaction efficiency of the linear RNA precursor, significantly increase the proportion of the desired complete cyclization product in the reaction products, and significantly reduce or even eliminate undesirable by-products such as nicked RNA, thereby significantly improving the purity of the crude product of the cyclization reaction, reducing or even eliminating the dependence of the circular RNA purification method on the RNase R enzyme treatment step, and ultimately greatly simplifying the subsequent purification process while not affecting or even significantly improving the quality of the cyclization product.

实施例3:在双侧端臂中具有纯化标签的线状RNA前体及其IVT产量和环化效率测试Example 3: Linear RNA precursor with purification tags in both side arms and IVT yield and circularization efficiency test

使用与实施例2相同的方法,制备具有如SEQ ID NO:50所示核苷酸序列的线状RNA前体作为对比例2,该线状RNA前体中的各元件序列除表3所示的5’外部同源臂,3’外部同源臂外,其余元件同表1。Using the same method as Example 2, a linear RNA precursor having a nucleotide sequence as shown in SEQ ID NO:50 was prepared as Comparative Example 2. Except for the 5' external homology arm and the 3' external homology arm shown in Table 3, the remaining element sequences in the linear RNA precursor were the same as those in Table 1.

表3
Table 3

使用相同的方法,制备具有如SEQ ID NO:51至55所示核苷酸序列的线状RNA前体,分别作为制备例25至29。如下表4所示,与对比例2相比,制备例25至29的线状RNA前体分别在SEQ ID NO:50的不同位置包含长度为50nt的poly(A)作为纯化标签。Using the same method, linear RNA precursors having nucleotide sequences as shown in SEQ ID NO: 51 to 55 were prepared as Preparation Examples 25 to 29, respectively. As shown in Table 4 below, compared with Comparative Example 2, the linear RNA precursors of Preparation Examples 25 to 29 respectively contained poly (A) with a length of 50 nt as a purification tag at different positions of SEQ ID NO: 50.

表4:
Table 4:

使用与实施例2相同的方法,测量在相同条件下的制备例25至29与对比例2(CK)相比的相对产量,结果显示于图7。Using the same method as Example 2, the relative yields of Preparation Examples 25 to 29 compared with Comparative Example 2 (CK) under the same conditions were measured, and the results are shown in FIG. 7 .

本实施例仍表明,具有纯化标签的线状RNA前体都能够通过体外环化反应产生所需的环化产物。改变线状RNA前体的具体序列,包括改变端臂序列(如改变5’和/3’外部同源臂),不影响具有纯化标签的前体分子进行体外环化。但是,纯化标签在线状RNA前体中的部分位置插入时,IVT产量将明显降低,具体而言,是5'外部同源臂的5’端上游紧邻线状RNA前体的残基位置(第0位),和5'外部同源臂的5’端末端的首位残基位置(第1位)。但所有情况下均可观察到所产生的非目的产物的量减少。This example still shows that linear RNA precursors with purification tags can produce the desired cyclized products through in vitro cyclization reactions. Changing the specific sequence of the linear RNA precursor, including changing the end arm sequence (such as changing the 5' and/or 3' external homology arms), does not affect the in vitro cyclization of the precursor molecule with the purification tag. However, when the purification tag is inserted at a certain position in the linear RNA precursor, the IVT yield will be significantly reduced, specifically, the residue position of the linear RNA precursor immediately upstream of the 5' end of the 5' external homology arm (position 0), and the first residue position (position 1) at the end of the 5' end of the 5' external homology arm. However, a reduction in the amount of non-target products produced can be observed in all cases.

实施例4:其他示例性的线状RNA前体Example 4: Other exemplary linear RNA precursors

根据与实施例1相同的方法,制备具有图2~图4所示的元件结构的线状RNA前体。图中示出的外部同源臂、内含子片段、编码区片段、翻译起始元件、外显子片段、内部同源臂、间隔序列、插入序列均可以使用本领域已知具有相应功能、或者能够合理推断其应当具有相应功能的元件,包括但不限于CN112399860A、CN115404240A中所描述的各种功能元件。上述申请或专利文本在此全文并入本文中。 According to the same method as in Example 1, a linear RNA precursor having the element structure shown in Figures 2 to 4 is prepared. The external homology arms, intron fragments, coding region fragments, translation initiation elements, exon fragments, internal homology arms, spacer sequences, and insertion sequences shown in the figures can all use elements known in the art to have corresponding functions, or elements that can be reasonably inferred to have corresponding functions, including but not limited to various functional elements described in CN112399860A and CN115404240A. The above-mentioned application or patent text is hereby incorporated herein in its entirety.

实施例5:亲和层析环化反应产物的纯化效率Example 5: Purification efficiency of affinity chromatography cyclization reaction products

测量双侧端臂中插入了poly(A)的线状RNA前体相对于不含poly(A)的对照的亲和层析的纯化效率。The purification efficiency of linear RNA precursors with poly(A) inserted into both end arms was measured by affinity chromatography relative to a control without poly(A).

从制备例24和对比例1的环化反应产物中各取含有5μg RNA的反应终溶液进行纯化。在未添加RNase R的条件下,将终溶液与200μl Oligo d(T)25磁珠孵育20min,使磁珠充分接触并吸附带有poly(A)标签的RNA。20min后,将样品放入磁力架静止10sec,使磁珠与上清分离,去除磁珠,将上清根据生产商的说明用RNA Cleanup Kit回收。使用安捷伦5200片段分析仪在默认设置下分别检测RNA酶R处理前的环化反应产物(-)和RNA酶R处理后的环化反应产物(+)的纯度。结果示于图8A至8D。The final reaction solution containing 5 μg of RNA was taken from the cyclization reaction products of Preparation Example 24 and Comparative Example 1 for purification. Without adding RNase R, the final solution was incubated with 200 μl of Oligo d(T)25 magnetic beads for 20 minutes to allow the magnetic beads to fully contact and adsorb the RNA with the poly(A) label. After 20 minutes, the sample was placed on a magnetic stand and allowed to stand for 10 seconds to separate the magnetic beads from the supernatant, the magnetic beads were removed, and the supernatant was recovered using the RNA Cleanup Kit according to the manufacturer's instructions. The purity of the cyclization reaction product before RNase R treatment (-) and the cyclization reaction product after RNase R treatment (+) was detected using an Agilent 5200 fragment analyzer under default settings. The results are shown in Figures 8A to 8D.

图8显示,在用Oligo d(T)25磁珠孵育前,对比例1和制备例24的样品中的环状RNA纯度分别为81.3%和92.7%。而在用磁珠孵育后,对比例1的环状RNA纯度测定为78.1%,制备例24的纯度则提高至100%。上述结果表明,在线状RNA前体的两侧端臂中插入poly(A)后,能够藉由亲和层析介质(如Oligo d(T)25磁珠),去除环化反应体系中与目的环状RNA共存的各种杂质,获得高纯度的环状RNA。FIG8 shows that before incubation with Oligo d(T)25 magnetic beads, the purity of circular RNA in the samples of Comparative Example 1 and Preparation Example 24 was 81.3% and 92.7%, respectively. After incubation with magnetic beads, the purity of circular RNA in Comparative Example 1 was determined to be 78.1%, and the purity of Preparation Example 24 was increased to 100%. The above results show that after inserting poly(A) into the two end arms of the linear RNA precursor, various impurities coexisting with the target circular RNA in the cyclization reaction system can be removed by affinity chromatography media (such as Oligo d(T)25 magnetic beads), and high-purity circular RNA can be obtained.

值得注意的是,即使在与Oligo d(T)25磁珠孵育之前,仍可以观察到制备例24和对比例1的环化反应产物的纯度存在明显区别(92.7%vs 81.3%)。该结果与实施例3中观察到的电泳结果一致,表明插入至端臂中的poly(A)可以独立于亲和纯化机制、直接改善环化反应的产物纯度。It is noteworthy that even before incubation with Oligo d(T)25 magnetic beads, a significant difference in the purity of the cyclization reaction products of Preparation Example 24 and Comparative Example 1 can be observed (92.7% vs 81.3%). This result is consistent with the electrophoresis results observed in Example 3, indicating that the poly(A) inserted into the end arm can directly improve the purity of the cyclization reaction product independently of the affinity purification mechanism.

本申请以上实施例中使用的序列示于序列表中。应当理解,这些序列仅为本申请实施方案的示例性序列,而非对本申请方案的任何限制。尽管在电子序列表中表明的是DNA序列,但本申请序列表中的核苷酸序列既可表示DNA序列,也可表示RNA序列,当其表示RNA序列时,其中的“T”代表尿苷。








The sequences used in the above examples of the present application are shown in the sequence table. It should be understood that these sequences are only exemplary sequences of the present application's embodiments, rather than any restriction to the present application's scheme. Although what is indicated in the electronic sequence table is a DNA sequence, the nucleotide sequence in the present application's sequence table can represent both a DNA sequence and an RNA sequence, and when it represents an RNA sequence, "T" therein represents uridine.








Claims (66)

一种制备环状RNA的方法,所述方法包括:A method for preparing circular RNA, the method comprising: 步骤A:产生线状RNA前体,所述线状前体按5’至3’方向依次以彼此可操作地连接的方式包含5’端臂、3’自剪接位点、环化区域、5’自剪接位点和3’端臂,其中,所述5’端臂中插入了第一标签,所述3’端臂中插入了第二标签;Step A: generating a linear RNA precursor, wherein the linear precursor comprises a 5' terminal arm, a 3' self-splicing site, a circularization region, a 5' self-splicing site and a 3' terminal arm in a manner operably connected to each other in sequence from 5' to 3' direction, wherein a first tag is inserted into the 5' terminal arm, and a second tag is inserted into the 3' terminal arm; 步骤B:使所述线状RNA前体处于适合所述3’自剪接位点和所述5’自剪接位点发生自剪接的条件下,得到包含带有所述第一标签和/或所述第二标签的线状RNA片段以及由所述环化区域环化而得的环状RNA的混合物;Step B: placing the linear RNA precursor under conditions suitable for self-splicing of the 3' self-splicing site and the 5' self-splicing site, to obtain a mixture comprising a linear RNA fragment with the first tag and/or the second tag and a circular RNA obtained by cyclization of the cyclization region; 步骤C:使所述混合物与能够同时结合所述第一标签和所述第二标签的亲和层析介质接触一定时间,至足以使所述亲和层析介质结合包含至少一种所述标签的线状RNA片段;和Step C: contacting the mixture with an affinity chromatography medium capable of simultaneously binding the first tag and the second tag for a period of time sufficient to allow the affinity chromatography medium to bind to the linear RNA fragment containing at least one of the tags; and 步骤D:将所述亲和层析介质和与之接触后的混合物分离,收集上清液,得到环状RNA,Step D: separating the affinity chromatography medium and the mixture after contacting the affinity chromatography medium, collecting the supernatant, and obtaining the circular RNA. 其中,所述第一标签和/或所述第二标签独立地选自由20~100个、优选30~90个,更优选40~80个、进一步优选45~70个,最优选50~65个连续的腺嘌呤核苷酸组成的poly(A)标签或其功能性变体,并且所述第一标签和所述第二标签在所述环化区域和所述环状RNA中均不存在。Wherein, the first tag and/or the second tag are independently selected from a poly(A) tag consisting of 20 to 100, preferably 30 to 90, more preferably 40 to 80, further preferably 45 to 70, and most preferably 50 to 65 consecutive adenine nucleotides or a functional variant thereof, and the first tag and the second tag are not present in the cyclization region and the circular RNA. 根据权利要求1所述的方法,其中,所述poly(A)标签的功能性变体为在所述poly(A)标签中插入1个或多个非A碱基,优选插入1~20个非A碱基,更优选插入1~10个非A碱基。The method according to claim 1, wherein the functional variant of the poly(A) tag is to insert one or more non-A bases into the poly(A) tag, preferably insert 1 to 20 non-A bases, and more preferably insert 1 to 10 non-A bases. 根据前述任一项权利要求所述的方法,其中,所述poly(A)标签的功能性变体包含The method according to any of the preceding claims, wherein the functional variant of the poly(A) tag comprises (1)唯一的元件a、至少一个元件b,以及至少一个元件c,(1) a single element a, at least one element b, and at least one element c, (2)唯一的元件a,至少一个元件b,以及至少一个元件d;或者(2) only one element a, at least one element b, and at least one element d; or (3)唯一的元件a,至少一个元件b,以及至少一个元件c和至少一个元件d,(3) a single element a, at least one element b, at least one element c and at least one element d, 其中,所述元件a是由20个以上连续的腺嘌呤核苷酸组成,所述元件b是由3个以上且少于20个连续的腺嘌呤核苷酸组成,所述元件c是由一个选自尿嘧啶核苷酸、胞嘧啶核苷酸、鸟嘌呤核苷酸的核苷酸组成,所述元件d是由2个以上至20个以下的核苷酸组成,所述核苷酸任意地选自腺嘌呤核苷酸、尿嘧啶核苷酸、胞嘧啶核苷酸、鸟嘌呤核苷酸,并且所述元件d不包含3个以上连续腺嘌呤核苷酸,且其5’及3’最末端的核苷酸不是腺嘌呤核苷酸,wherein the element a is composed of more than 20 consecutive adenine nucleotides, the element b is composed of more than 3 and less than 20 consecutive adenine nucleotides, the element c is composed of a nucleotide selected from uracil nucleotides, cytosine nucleotides, and guanine nucleotides, the element d is composed of more than 2 and less than 20 nucleotides, the nucleotides are arbitrarily selected from adenine nucleotides, uracil nucleotides, cytosine nucleotides, and guanine nucleotides, and the element d does not contain more than 3 consecutive adenine nucleotides, and the 5' and 3' terminal nucleotides are not adenine nucleotides, 其中,当所述Poly(A)标签中同时包含两个以上的所述元件b、所述元件c或所述元件d时,每两个元件b的序列可以相同或不同、每两个元件c的序列可以相同或不同,以及每两个元件d的序列可以相同或不同,Wherein, when the Poly(A) tag contains two or more of the element b, the element c or the element d at the same time, the sequences of every two elements b may be the same or different, the sequences of every two elements c may be the same or different, and the sequences of every two elements d may be the same or different. 并且,所述元件a和所述元件b之间、所述元件c和所述元件d之间、所述元件b彼此之间、所述元件c彼此之间、所述元件d彼此之间均不相邻。Furthermore, the element a and the element b, the element c and the element d, the elements b, the elements c, and the elements d are not adjacent to each other. 根据前述任一项权利要求所述的方法,其中所述元件a由20个以上、80个以下连续的腺嘌呤核苷酸组成,优选由30至70个、35至65个、40至60个、或45至55个连续的腺嘌呤核苷 酸组成,更优选由60个连续的腺嘌呤核苷酸组成。The method according to any of the preceding claims, wherein the element a consists of more than 20 and less than 80 consecutive adenine nucleotides, preferably 30 to 70, 35 to 65, 40 to 60, or 45 to 55 consecutive adenine nucleotides. The nucleotide sequence is preferably composed of 60 consecutive adenine nucleotides. 根据前述任一项权利要求所述的方法,其中所述元件b由3至10个、10至19个、12至15个、14至17个、或者16至19个,优选由19个连续的腺嘌呤核苷酸组成。A method according to any preceding claim, wherein the element b consists of 3 to 10, 10 to 19, 12 to 15, 14 to 17, or 16 to 19, preferably 19 consecutive adenine nucleotides. 根据前述任一项权利要求所述的方法,其中所述元件b的个数为2~10个,优选2~5个,进一步优选3个。The method according to any one of the preceding claims, wherein the number of the elements b is 2 to 10, preferably 2 to 5, and more preferably 3. 根据前述任一项权利要求所述的方法,其中所述元件c为鸟嘌呤核苷酸。A method according to any preceding claim, wherein the element c is a guanine nucleotide. 根据前述任一项权利要求所述的方法,其中所述元件c的个数为2至10个、3个至8个、4至6个、或者2至5个,优选为2个。The method according to any of the preceding claims, wherein the number of the elements c is 2 to 10, 3 to 8, 4 to 6, or 2 to 5, preferably 2. 根据前述任一项权利要求所述的方法,其中所述元件d由3至18个、5至16个、4至10个、或者6至12个核苷酸组成,优选由6个核苷酸组成。The method according to any of the preceding claims, wherein the element d consists of 3 to 18, 5 to 16, 4 to 10, or 6 to 12 nucleotides, preferably consists of 6 nucleotides. 根据前述任一项权利要求所述的方法,其中所述元件d选自GAUAUC、GUAUAC、GAAUCU、GCAUAUGACU或GAUAUCGUAUAC中的任一者。A method according to any preceding claim, wherein the element d is selected from any of GAUAUC, GUAUAC, GAAUCU, GCAUAUGACU or GAUAUCGUAUAC. 根据前述任一项权利要求所述的方法,其中所述元件d的个数为0~5个,优选1~3个,更优选1个。The method according to any one of the preceding claims, wherein the number of the elements d is 0 to 5, preferably 1 to 3, more preferably 1. 根据前述任一项权利要求所述的方法,其中当元件c和元件d同时存在时,所述元件c和元件d的个数总和为2~15个,优选3~5个,更优选3个。The method according to any of the preceding claims, wherein when element c and element d are present at the same time, the total number of the elements c and d is 2 to 15, preferably 3 to 5, more preferably 3. 根据前述任一项权利要求所述的方法,其中,所述poly(A)标签的功能性变体具有选自如下所示结构中的任一种结构,The method according to any of the preceding claims, wherein the functional variant of the poly(A) tag has any one structure selected from the following structures: 元件a-元件c-元件b-元件c-元件b-元件c-元件b-元件c-元件b,component a-component c-component b-component c-component b-component c-component b-component c-component b, 元件b-元件c-元件b-元件c-元件a-元件d-元件b-元件c-元件b-元件c-元件b,Element b-element c-element b-element c-element a-element d-element b-element c-element b-element c-element b, 元件b-元件c-元件b-元件c-元件b-元件d-元件a-元件c,Element b-element c-element b-element c-element b-element d-element a-element c, 元件a-元件d-元件b-元件c-元件b-元件c-元件b,或element a-element d-element b-element c-element b-element c-element b, or 元件b-元件c-元件b-元件c-元件b-元件d-元件a。Element b-element c-element b-element c-element b-element d-element a. 根据前述任一项权利要求所述的方法,其中,所述5’端臂按5’至3’方向包含5’外部同源臂和3’内含子片段,所述第一标签插入在所述5’外部同源臂中,或者插入在所述3’内含子片段中靠近所述5’外部同源臂的5’末端区域中,所述5’末端区域优选为所述3’内含子片段的5’末端的20个核苷酸,更优选15个核苷酸,进一步优选10个核苷酸,或者插入在5'外部同源臂的5’端上游,The method according to any of the preceding claims, wherein the 5' end arm comprises a 5' external homology arm and a 3' intron fragment in the 5' to 3' direction, and the first tag is inserted in the 5' external homology arm, or inserted in the 5' terminal region of the 3' intron fragment close to the 5' external homology arm, the 5' terminal region is preferably 20 nucleotides at the 5' end of the 3' intron fragment, more preferably 15 nucleotides, and further preferably 10 nucleotides, or inserted upstream of the 5' end of the 5' external homology arm, 所述3’端臂按5’至3’方向包含5’内含子片段和3’外部同源臂,所述第二标签插入在所述3’外部同源臂中,或者插入在所述5’内含子片段中靠近所述3’外部同源臂的3’末端区域中,所述3’末端区域优选为所述5’内含子片段的3’末端的20个核苷酸,更优选15个核苷酸,进一步优选10个核苷酸,或者插入在3'外部同源臂的3’端下游,The 3' end arm comprises a 5' intron fragment and a 3' external homology arm in the 5' to 3' direction, and the second tag is inserted in the 3' external homology arm, or inserted in the 3' terminal region of the 5' intron fragment close to the 3' external homology arm, the 3' terminal region is preferably 20 nucleotides at the 3' end of the 5' intron fragment, more preferably 15 nucleotides, and further preferably 10 nucleotides, or inserted downstream of the 3' end of the 3' external homology arm, 优选地, Preferably, 所述第一标签插入在所述5’外部同源臂中,且所述第二标签插入在所述3'外部同源臂的3’端下游;The first tag is inserted into the 5' outer homology arm, and the second tag is inserted downstream of the 3' end of the 3' outer homology arm; 所述第一标签插入在所述5’外部同源臂中,且所述第二标签插入在所述3'外部同源臂中;或The first tag is inserted in the 5' outer homology arm and the second tag is inserted in the 3' outer homology arm; or 所述第一标签插入在所述5’外部同源臂中,且所述第二标签插入在所述5’内含子片段的3’末端区域中;The first tag is inserted in the 5' external homology arm, and the second tag is inserted in the 3' terminal region of the 5' intron fragment; 更优选地,所述第一标签插入在所述5’外部同源臂中除5’端第1个核苷酸残基以外的任意位置,且所述第二标签插入在所述3'外部同源臂的3’端下游;More preferably, the first tag is inserted at any position in the 5' external homology arm except the first nucleotide residue at the 5' end, and the second tag is inserted downstream of the 3' end of the 3' external homology arm; 所述第一标签插入在所述5’外部同源臂中除5’端第1个核苷酸残基以外的任意位置,且所述第二标签插入在所述3'外部同源臂中;或The first tag is inserted in any position of the 5' outer homology arm except the first nucleotide residue at the 5' end, and the second tag is inserted in the 3' outer homology arm; or 所述第一标签插入在所述5’外部同源臂中除5’端第1个核苷酸残基以外的任意位置,且所述第二标签插入在所述5’内含子片段的3’末端区域。The first tag is inserted at any position in the 5' external homology arm except the first nucleotide residue at the 5' end, and the second tag is inserted at the 3' terminal region of the 5' intron fragment. 根据前述任一项权利要求所述的方法,其中,所述环化区域按5’至3’方向依次以彼此可操作地连接的方式包含3’编码区片段、翻译起始元件、5’编码区片段。The method according to any of the preceding claims, wherein the circularization region comprises a 3' coding region fragment, a translation initiation element, and a 5' coding region fragment in a manner operably linked to each other in sequence from 5' to 3' direction. 根据前述任一项权利要求所述的方法,其中,所述环化区域按5’至3’方向依次以彼此可操作地连接的方式包含3’外显子片段、5’内部同源臂、插入片段、3’内部同源臂和5’外显子片段,任选地,所述环化区域在所述插入片段与所述5’内部同源臂之间包含第一间隔区,在所述插入片段与所述3’内部同源臂之间包含第二间隔区。A method according to any of the preceding claims, wherein the circularization region comprises a 3' exon fragment, a 5' internal homology arm, an insert fragment, a 3' internal homology arm and a 5' exon fragment in sequence in a manner operably connected to each other from 5' to 3' direction, and optionally, the circularization region comprises a first spacer region between the insert fragment and the 5' internal homology arm, and a second spacer region between the insert fragment and the 3' internal homology arm. 根据权利要求16所述的方法,其中,所述插入片段包含翻译起始元件,或者包含翻译起始元件和编码区,其中,所述翻译起始元件优选为IRES序列。The method according to claim 16, wherein the inserted fragment comprises a translation initiation element, or comprises a translation initiation element and a coding region, wherein the translation initiation element is preferably an IRES sequence. 根据前述任一项权利要求所述的方法,其中,所述插入片段包含结构基因或其功能性片段或者非编码RNA的序列或其互补序列,所述结构基因编码选自多肽、蛋白质亚单位、蛋白质活性中心、蛋白质或非天然催化基团的蛋白质杂合体、经重组的蛋白质活性亚单位或活性中心、经重组的人工酶或其他主要由氨基酸构成的生物学效应单元,所述非编码RNA选自微小RNA(miRNA)、小干扰RNA(siRNA)、PIWI蛋白相互作用RNA(piRNA)、转运RNA衍生小RNA(tsRNA)、核小RNA(snRNA)、核仁小RNA(snoRNA)、长链非编码RNA(lncRNA)、假基因、ceRNA(competing endogenous RNAs)、microRNA海绵或其他种类非mRNA的RNA。The method according to any of the preceding claims, wherein the inserted fragment comprises a structural gene or a functional fragment thereof or a sequence of a non-coding RNA or its complementary sequence, wherein the structural gene encodes a polypeptide, a protein subunit, a protein active center, a protein or a protein hybrid of a non-natural catalytic group, a recombinant protein active subunit or active center, a recombinant artificial enzyme or other biological effect units mainly composed of amino acids, and the non-coding RNA is selected from microRNA (miRNA), small interfering RNA (siRNA), PIWI protein-interacting RNA (piRNA), transfer RNA-derived small RNA (tsRNA), small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), long non-coding RNA (lncRNA), pseudogene, ceRNA (competing endogenous RNAs), microRNA sponge or other types of non-mRNA RNA. 根据前述任一项权利要求所述的方法,其中,所述5’和3’外部同源臂的长度各自独立地为大于5nt、大于10nt、大于15nt、大于20nt、大于25nt、大于30nt、大于40nt、大于50nt、大于60nt、小于5nt、小于10nt、小于15nt、小于20nt、小于25nt、小于30nt、小于40nt、小于50nt、小于60nt、5~60nt、10~55nt、15~50nt、20~45nt、25~40nt、30~35nt、10nt、15nt、20nt、25nt、30nt、35nt、40nt、45nt、或50nt, The method according to any of the preceding claims, wherein the lengths of the 5' and 3' outer homology arms are each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, 5-60 nt, 10-55 nt, 15-50 nt, 20-45 nt, 25-40 nt, 30-35 nt, 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, or 50 nt, 任选所述5’和3’内含子片段的长度各自独立地为大于5nt、大于10nt、大于15nt、大于20nt、大于25nt、大于30nt、大于40nt、大于50nt、大于60nt、小于5nt、小于10nt、小于15nt、小于20nt、小于25nt、小于30nt、小于40nt、小于50nt、小于60nt、小于70nt、小于80nt、小于90nt、小于100nt、小于150nt、小于200nt、5~200nt、10~150nt、50~200nt、50~150nt、5~60nt、10~55nt、15~50nt、20~45nt、25~40nt、30~35nt、10nt、15nt、20nt、25nt、30nt、35nt、40nt、45nt、或50nt,Optionally, the lengths of the 5' and 3' intron fragments are each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, less than 70 nt, less than 80 nt. , less than 90nt, less than 100nt, less than 150nt, less than 200nt, 5-200nt, 10-150nt, 50-200nt, 50-150nt, 5-60nt, 10-55nt, 15-50nt, 20-45nt, 25-40nt, 30-35nt, 10nt, 15nt, 20nt, 25nt, 30nt, 35nt, 40nt, 45nt, or 50nt, 任选所述5’和3’外显子片段的长度各自独立地为大于5nt、大于10nt、大于15nt、大于20nt、大于25nt、大于30nt、大于40nt、大于50nt、大于60nt、小于5nt、小于10nt、小于15nt、小于20nt、小于25nt、小于30nt、小于40nt、小于50nt、小于60nt、5~60nt、10~55nt、15~50nt、20~45nt、25~40nt、30~35nt、10nt、15nt、20nt、25nt、30nt、35nt、40nt、45nt、或50nt,Optionally, the length of the 5' and 3' exon fragments are each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, 5-60 nt, 10-55 nt, 15-50 nt, 20-45 nt, 25-40 nt, 30-35 nt, 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, or 50 nt, 任选所述5’和3’内部同源臂的长度各自独立地为大于5nt、大于10nt、大于15nt、大于20nt、大于25nt、大于30nt、大于40nt、大于50nt、大于60nt、小于5nt、小于10nt、小于15nt、小于20nt、小于25nt、小于30nt、小于40nt、小于50nt、小于60nt、5~60nt、10~55nt、15~50nt、20~45nt、25~40nt、30~35nt、10nt、15nt、20nt、25nt、30nt、35nt、40nt、45nt、或50nt,Optionally, the lengths of the 5' and 3' internal homology arms are each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, 5-60 nt, 10-55 nt, 15-50 nt, 20-45 nt, 25-40 nt, 30-35 nt, 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, or 50 nt, 任选所述插入片段的长度为大于50nt、大于100nt、大于150nt、大于200nt、大于250nt、大于300nt、大于400nt、大于500nt、大于600nt、大于1k nt、大于1.5k nt、大于2k nt、大于3k nt、小于50nt、小于100nt、小于150nt、小于200nt、小于250nt、小于300nt、小于400nt、小于500nt、小于600nt、小于600nt、小于1k nt、小于1.5k nt、小于2k nt、小于3k nt、50~5knt、50~5k nt、50~4k nt、50~3k nt、50~2k nt、50~1.5k nt、50~1k nt、50~600nt、100~550nt、150~500nt、200~450nt、250~400nt、300~350nt。The length of the inserted fragment is optionally greater than 50 nt, greater than 100 nt, greater than 150 nt, greater than 200 nt, greater than 250 nt, greater than 300 nt, greater than 400 nt, greater than 500 nt, greater than 600 nt, greater than 1k nt, greater than 1.5k nt, greater than 2k nt, greater than 3k nt, less than 50 nt, less than 100 nt, less than 150 nt, less than 200 nt, less than 250 nt, less than 300 nt, less than 400 nt, less than 50 0nt, less than 600nt, less than 600nt, less than 1k nt, less than 1.5k nt, less than 2k nt, less than 3k nt, 50~5knt, 50~5k nt, 50~4k nt, 50~3k nt, 50~2k nt, 50~1.5k nt, 50~1k nt, 50~600nt, 100~550nt, 150~500nt, 200~450nt, 250~400nt, 300~350nt. 根据前述任一项权利要求所述的方法,其中,所述5’外部同源臂具有如SEQ ID NO:1或56所示的序列,所述3’外部同源臂具有如SEQ ID NO:2或57所示的序列。A method according to any of the preceding claims, wherein the 5’ external homology arm has a sequence as shown in SEQ ID NO:1 or 56, and the 3’ external homology arm has a sequence as shown in SEQ ID NO:2 or 57. 根据前述任一项权利要求所述的方法,其中,所述3’内含子片段和所述5’内含子片段来自I型内含子,优选来自蓝藻鱼腥藻属pre-tRNA基因或T4噬菌体Td基因,更优选所述3’内含子片段具有如SEQ ID NO:3所示的序列,所述5’内含子片段具有如SEQ ID NO:4所示的序列。The method according to any of the preceding claims, wherein the 3' intron fragment and the 5' intron fragment are derived from type I introns, preferably from the cyanobacterium Anabaena pre-tRNA gene or the T4 phage Td gene, more preferably the 3' intron fragment has a sequence as shown in SEQ ID NO:3, and the 5' intron fragment has a sequence as shown in SEQ ID NO:4. 根据前述任一项权利要求所述的方法,其中,所述3’内含子片段和所述5’内含子片段来自II型内含子,优选来自梭菌属(Clostridium)如破伤风梭菌(Clostridium tetani)的II型内含子,或者芽孢杆菌属(Bacillus)如苏云金芽孢杆菌(Bacillus thuringiensis)的II型内含子,更优选所述II型内含子为SEQ ID NO:5或6所示核苷酸序列中包含的II型内含子。The method according to any of the preceding claims, wherein the 3' intron fragment and the 5' intron fragment are derived from type II introns, preferably type II introns from Clostridium such as Clostridium tetani, or type II introns from Bacillus such as Bacillus thuringiensis, more preferably the type II intron is a type II intron contained in the nucleotide sequence shown in SEQ ID NO: 5 or 6. 根据前述任一项权利要求所述的方法,其中,所述3’外显子片段和所述5’外显子片段 分别来自天然外显子的3’末端区域和5末端区域,优选来自蓝藻鱼腥藻属pre-tRNA基因或T4噬菌体Td基因,更优选所述3’外显子片段具有如SEQ ID NO:7所示的序列,所述5’外显子片段具有如SEQ ID NO:8所示的序列。The method according to any of the preceding claims, wherein the 3' exon fragment and the 5' exon fragment The 3' terminal region and the 5' terminal region are respectively derived from natural exons, preferably from the cyanobacterium Anabaena pre-tRNA gene or the T4 phage Td gene, more preferably the 3' exon fragment has the sequence shown in SEQ ID NO:7, and the 5' exon fragment has the sequence shown in SEQ ID NO:8. 根据前述任一项权利要求所述的方法,其中,优选所述5’内部同源臂具有如SEQ ID NO:9所示的序列,所述3’内部同源臂具有如SEQ ID NO:10所示的序列。A method according to any of the preceding claims, wherein preferably the 5’ internal homology arm has a sequence as shown in SEQ ID NO:9, and the 3’ internal homology arm has a sequence as shown in SEQ ID NO:10. 根据前述任一项权利要求所述的方法,其中,所述第一间隔区与第二间隔区相同或不同。A method according to any preceding claim, wherein the first spacer is the same as or different from the second spacer. 根据前述任一项权利要求所述的方法,其中,所述亲和层析介质上可操作地连接有能够特异性结合所述第一标签和/或第二标签的亲和配基,优选所述亲和配基选自由多聚X1、多聚X1-X2、多聚X1-X2-X3、多聚X1-X2-X3-X4组成的群组,其中X1、X2、X3、X4分别独立地为A、G、C、T、U中的任一者,更优选选自由Oligo dT、Oligo dC、Oligo dG、Oligo dU组成的群组中的任一者。The method according to any of the preceding claims, wherein an affinity ligand capable of specifically binding to the first tag and/or the second tag is operably linked to the affinity chromatography medium, preferably the affinity ligand is selected from the group consisting of polymer X1, polymer X1 - X2 , polymer X1 - X2 - X3 , polymer X1 - X2 - X3-X4 , wherein X1 , X2 , X3 , X4 are independently any one of A, G, C, T, U, more preferably any one of the group consisting of Oligo dT, Oligo dC, Oligo dG, Oligo dU. 根据前述任一项权利要求所述的方法,其中,所述亲和层析介质选自由磁珠、葡聚糖分子、聚丙烯氨酰胺大分子、大分子的纤维素分子、壳聚糖材料、改性的聚乳酸材料、PET材料、无机硅酸盐材料或其他高分子聚合物组成的群组中的任一者。The method according to any of the preceding claims, wherein the affinity chromatography medium is selected from any one of the group consisting of magnetic beads, dextran molecules, polyacrylamide macromolecules, macromolecular cellulose molecules, chitosan materials, modified polylactic acid materials, PET materials, inorganic silicate materials or other high molecular polymers. 根据前述任一项权利要求所述的方法,其不包括向所述混合物中添加RNA酶R以去除线状RNA的步骤。A method according to any preceding claim, which does not include the step of adding RNase R to the mixture to remove linear RNA. 根据前述任一项权利要求所述的方法,当其包括向所述混合物中添加RNA酶R以去除线状RNA的步骤时,所添加的RNA酶R的量降低至用于纯化未插入所述poly(A)标签或其功能性变体的情况下所用酶量的30~50%,优选40%或50%。The method according to any of the preceding claims, when it includes the step of adding RNase R to the mixture to remove linear RNA, the amount of RNase R added is reduced to 30-50%, preferably 40% or 50% of the amount of enzyme used for purification when the poly (A) tag or its functional variant is not inserted. 根据权利要求1至29中任一项所述的方法制备得到的环状RNA,其中基本不含非环状的RNA分子。The circular RNA prepared by the method according to any one of claims 1 to 29, which substantially does not contain non-circular RNA molecules. 用于权利要求1至29中任一项所述方法中的线状RNA前体。A linear RNA precursor for use in the method according to any one of claims 1 to 29. 线状RNA前体,其特征在于,所述线状RNA前体按5’至3’方向依次以彼此可操作地连接的方式包含5’端臂、3’自剪接位点、环化区域、5’自剪接位点和3’端臂,其中,所述5’端臂中插入了第一标签,所述3’端臂中插入了第二标签;A linear RNA precursor, characterized in that the linear RNA precursor comprises a 5' end arm, a 3' self-splicing site, a cyclization region, a 5' self-splicing site and a 3' end arm in a manner operably connected to each other in sequence from 5' to 3' direction, wherein a first tag is inserted into the 5' end arm and a second tag is inserted into the 3' end arm; 任选地,所述第一标签和/或所述第二标签独立地选自由20~100个、优选30~90个,更优选40~80个、进一步优选45~70个,最优选50~65个连续的腺嘌呤核苷酸组成的poly(A)标签或其功能性变体,并且所述第一标签和所述第二标签在所述环化区域和所述环状RNA中均不存在。Optionally, the first tag and/or the second tag are independently selected from a poly(A) tag consisting of 20 to 100, preferably 30 to 90, more preferably 40 to 80, further preferably 45 to 70, and most preferably 50 to 65 consecutive adenine nucleotides or a functional variant thereof, and the first tag and the second tag are not present in the cyclization region and the circular RNA. 根据权利要求32所述的线状RNA前体,其中,所述poly(A)标签的功能性变体为在所述poly(A)标签中插入1个或多个非A碱基,优选插入1~20个非A碱基,更优选插入1~10个非A碱基。 The linear RNA precursor according to claim 32, wherein the functional variant of the poly (A) tag is an insertion of one or more non-A bases into the poly (A) tag, preferably an insertion of 1 to 20 non-A bases, and more preferably an insertion of 1 to 10 non-A bases. 根据权利要求32或33所述的线状RNA前体,其中,所述poly(A)标签的功能性变体包含The linear RNA precursor according to claim 32 or 33, wherein the functional variant of the poly (A) tag comprises (1)唯一的元件a、至少一个元件b,以及至少一个元件c,(1) a single element a, at least one element b, and at least one element c, (2)唯一的元件a,至少一个元件b,以及至少一个元件d;或者(2) only one element a, at least one element b, and at least one element d; or (3)唯一的元件a,至少一个元件b,以及至少一个元件c和至少一个元件d,(3) a single element a, at least one element b, at least one element c and at least one element d, 其中,所述元件a是由20个以上连续的腺嘌呤核苷酸组成,所述元件b是由3个以上且少于20个连续的腺嘌呤核苷酸组成,所述元件c是由一个选自尿嘧啶核苷酸、胞嘧啶核苷酸、鸟嘌呤核苷酸的核苷酸组成,所述元件d是由2个以上至20个以下的核苷酸组成,所述核苷酸任意地选自腺嘌呤核苷酸、尿嘧啶核苷酸、胞嘧啶核苷酸、鸟嘌呤核苷酸,并且所述元件d不包含3个以上连续腺嘌呤核苷酸,且其5’及3’最末端的核苷酸不是腺嘌呤核苷酸,wherein the element a is composed of more than 20 consecutive adenine nucleotides, the element b is composed of more than 3 and less than 20 consecutive adenine nucleotides, the element c is composed of a nucleotide selected from uracil nucleotides, cytosine nucleotides, and guanine nucleotides, the element d is composed of more than 2 and less than 20 nucleotides, the nucleotides are arbitrarily selected from adenine nucleotides, uracil nucleotides, cytosine nucleotides, and guanine nucleotides, and the element d does not contain more than 3 consecutive adenine nucleotides, and the 5' and 3' terminal nucleotides are not adenine nucleotides, 其中,当所述Poly(A)标签中同时包含两个以上的所述元件b、所述元件c或所述元件d时,每两个元件b的序列可以相同或不同、每两个元件c的序列可以相同或不同,以及每两个元件d的序列可以相同或不同,Wherein, when the Poly(A) tag contains two or more of the element b, the element c or the element d at the same time, the sequences of every two elements b may be the same or different, the sequences of every two elements c may be the same or different, and the sequences of every two elements d may be the same or different. 并且,所述元件a和所述元件b之间、所述元件c和所述元件d之间、所述元件b彼此之间、所述元件c彼此之间、所述元件d彼此之间均不相邻。Furthermore, the element a and the element b, the element c and the element d, the elements b, the elements c, and the elements d are not adjacent to each other. 根据权利要求32至34中任一项所述的线状RNA前体,其中所述元件a由20个以上、80个以下连续的腺嘌呤核苷酸组成,优选由30至70个、35至65个、40至60个、或45至55个连续的腺嘌呤核苷酸组成,更优选由60个连续的腺嘌呤核苷酸组成。The linear RNA precursor according to any one of claims 32 to 34, wherein the element a consists of more than 20 and less than 80 consecutive adenine nucleotides, preferably consists of 30 to 70, 35 to 65, 40 to 60, or 45 to 55 consecutive adenine nucleotides, more preferably consists of 60 consecutive adenine nucleotides. 根据权利要求32至35中任一项所述的线状RNA前体,其中所述元件b由3至10个、10至19个、12至15个、14至17个、或者16至19个,优选由19个连续的腺嘌呤核苷酸组成。The linear RNA precursor according to any one of claims 32 to 35, wherein the element b consists of 3 to 10, 10 to 19, 12 to 15, 14 to 17, or 16 to 19, preferably 19 consecutive adenine nucleotides. 根据权利要求32至36中任一项所述的线状RNA前体,其中所述元件b的个数为2~10个,优选2~5个,进一步优选3个。The linear RNA precursor according to any one of claims 32 to 36, wherein the number of the element b is 2 to 10, preferably 2 to 5, and more preferably 3. 根据权利要求32至37中任一项所述的线状RNA前体,其中所述元件c为鸟嘌呤核苷酸。The linear RNA precursor according to any one of claims 32 to 37, wherein the element c is a guanine nucleotide. 根据权利要求32至38中任一项所述的线状RNA前体,其中所述元件c的个数为2至10个、3个至8个、4至6个、或者2至5个,优选为2个。The linear RNA precursor according to any one of claims 32 to 38, wherein the number of the elements c is 2 to 10, 3 to 8, 4 to 6, or 2 to 5, preferably 2. 根据权利要求32至39中任一项所述的线状RNA前体,其中所述元件d由3至18个、5至16个、4至10个、或者6至12个核苷酸组成,优选由6个核苷酸组成。The linear RNA precursor according to any one of claims 32 to 39, wherein the element d consists of 3 to 18, 5 to 16, 4 to 10, or 6 to 12 nucleotides, preferably consists of 6 nucleotides. 根据权利要求32至40中任一项所述的线状RNA前体,其中所述元件d选自GAUAUC、GUAUAC、GAAUCU、GCAUAUGACU或GAUAUCGUAUAC中的任一者。The linear RNA precursor according to any one of claims 32 to 40, wherein the element d is selected from any one of GAUAUC, GUAUAC, GAAUCU, GCAUAUGACU or GAUAUCGUAUAC. 根据权利要求32至41中任一项所述的线状RNA前体,其中所述元件d的个数为0~5个,优选1~3个,更优选1个。The linear RNA precursor according to any one of claims 32 to 41, wherein the number of the element d is 0 to 5, preferably 1 to 3, and more preferably 1. 根据权利要求32至42中任一项所述的线状RNA前体,其中当元件c和元件d同时存在时,所述元件c和元件d的个数总和为2~15个,优选3~5个,更优选3个。 The linear RNA precursor according to any one of claims 32 to 42, wherein when element c and element d exist at the same time, the total number of element c and element d is 2 to 15, preferably 3 to 5, and more preferably 3. 根据权利要求32至43中任一项所述的线状RNA前体,其中,所述poly(A)标签的功能性变体具有选自如下所示结构中的任一种结构,The linear RNA precursor according to any one of claims 32 to 43, wherein the functional variant of the poly (A) tag has any one structure selected from the following structures, 元件a-元件c-元件b-元件c-元件b-元件c-元件b-元件c-元件b,component a-component c-component b-component c-component b-component c-component b-component c-component b, 元件b-元件c-元件b-元件c-元件a-元件d-元件b-元件c-元件b-元件c-元件b,Element b-element c-element b-element c-element a-element d-element b-element c-element b-element c-element b, 元件b-元件c-元件b-元件c-元件b-元件d-元件a-元件c,Element b-element c-element b-element c-element b-element d-element a-element c, 元件a-元件d-元件b-元件c-元件b-元件c-元件b,或element a-element d-element b-element c-element b-element c-element b, or 元件b-元件c-元件b-元件c-元件b-元件d-元件a;Element b-element c-element b-element c-element b-element d-element a; 优选地,所述poly(A)标签的功能性变体具有选自SEQ ID NO:12至25中任一者所述的核苷酸序列。Preferably, the functional variant of the poly(A) tag has a nucleotide sequence selected from any one of SEQ ID NO:12 to 25. 根据权利要求32至44中任一项所述的线状RNA前体,其中,所述5’端臂按5’至3’方向包含5’外部同源臂和3’内含子片段,所述第一标签插入在所述5’外部同源臂中,或者插入在所述3’内含子片段中靠近所述5’外部同源臂的5’末端区域中,所述5’末端区域优选为所述3’内含子片段的5’末端的20个核苷酸,更优选15个核苷酸,进一步优选10个核苷酸,或者插入在5'外部同源臂的5’端上游,The linear RNA precursor according to any one of claims 32 to 44, wherein the 5' end arm comprises a 5' external homology arm and a 3' intron fragment in the 5' to 3' direction, and the first tag is inserted in the 5' external homology arm, or inserted in the 5' terminal region of the 3' intron fragment close to the 5' external homology arm, the 5' terminal region is preferably 20 nucleotides at the 5' end of the 3' intron fragment, more preferably 15 nucleotides, and further preferably 10 nucleotides, or inserted upstream of the 5' end of the 5' external homology arm, 所述3’端臂按5’至3’方向包含5’内含子片段和3’外部同源臂,所述第二标签插入在所述3’外部同源臂中,或者插入在所述5’内含子片段中靠近所述3’外部同源臂的3’末端区域中,所述3’末端区域优选为所述5’内含子片段的3’末端的20个核苷酸,更优选15个核苷酸,进一步优选10个核苷酸,或者插入在3'外部同源臂的3’端下游,The 3' end arm comprises a 5' intron fragment and a 3' external homology arm in the 5' to 3' direction, and the second tag is inserted in the 3' external homology arm, or inserted in the 3' terminal region of the 5' intron fragment close to the 3' external homology arm, the 3' terminal region is preferably 20 nucleotides at the 3' end of the 5' intron fragment, more preferably 15 nucleotides, and further preferably 10 nucleotides, or inserted downstream of the 3' end of the 3' external homology arm, 优选地,Preferably, 所述第一标签插入在所述5’外部同源臂中,且所述第二标签插入在所述3'外部同源臂的3’端下游;The first tag is inserted into the 5' outer homology arm, and the second tag is inserted downstream of the 3' end of the 3' outer homology arm; 所述第一标签插入在所述5’外部同源臂中,且所述第二标签插入在所述3'外部同源臂中;或The first tag is inserted in the 5' outer homology arm and the second tag is inserted in the 3' outer homology arm; or 所述第一标签插入在所述5’外部同源臂中,且所述第二标签插入在所述5’内含子片段的3’末端区域中;The first tag is inserted in the 5' external homology arm, and the second tag is inserted in the 3' terminal region of the 5' intron fragment; 更优选地,所述第一标签插入在所述5’外部同源臂中除5’端第1个核苷酸残基以外的任意位置,且所述第二标签插入在所述3'外部同源臂的3’端下游;More preferably, the first tag is inserted at any position in the 5' external homology arm except the first nucleotide residue at the 5' end, and the second tag is inserted downstream of the 3' end of the 3' external homology arm; 所述第一标签插入在所述5’外部同源臂中除5’端第1个核苷酸残基以外的任意位置,且所述第二标签插入在所述3'外部同源臂中;或The first tag is inserted in any position of the 5' outer homology arm except the first nucleotide residue at the 5' end, and the second tag is inserted in the 3' outer homology arm; or 所述第一标签插入在所述5’外部同源臂中除5’端第1个核苷酸残基以外的任意位置,且所述第二标签插入在所述5’内含子片段的3’末端区域。The first tag is inserted at any position in the 5' external homology arm except the first nucleotide residue at the 5' end, and the second tag is inserted at the 3' terminal region of the 5' intron fragment. 根据权利要求32至45中任一项所述的线状RNA前体,其中,所述环化区域按5’至3’方向依次以彼此可操作地连接的方式包含3’编码区片段、翻译起始元件、5’编码区片段。 The linear RNA precursor according to any one of claims 32 to 45, wherein the cyclization region comprises a 3' coding region fragment, a translation initiation element, and a 5' coding region fragment in a manner operably linked to each other in sequence from 5' to 3' direction. 根据权利要求32至46中任一项所述的线状RNA前体,其中,所述环化区域按5’至3’方向依次以彼此可操作地连接的方式包含3’外显子片段、5’内部同源臂、插入片段、3’内部同源臂和5’外显子片段,任选地,所述环化区域在所述插入片段与所述5’内部同源臂之间包含第一间隔区,在所述插入片段与所述3’内部同源臂之间包含第二间隔区。The linear RNA precursor according to any one of claims 32 to 46, wherein the cyclization region comprises a 3' exon fragment, a 5' internal homology arm, an insert fragment, a 3' internal homology arm and a 5' exon fragment in a manner operably connected to each other in sequence from 5' to 3' direction, and optionally, the cyclization region comprises a first spacer region between the insert fragment and the 5' internal homology arm, and a second spacer region between the insert fragment and the 3' internal homology arm. 根据权利要求47所述的线状RNA前体,其中,所述插入片段包含翻译起始元件,或者包含翻译起始元件和编码区,其中,所述翻译起始元件优选为IRES序列。The linear RNA precursor according to claim 47, wherein the inserted fragment comprises a translation initiation element, or comprises a translation initiation element and a coding region, wherein the translation initiation element is preferably an IRES sequence. 根据权利要求32至48中任一项所述的线状RNA前体,其中,所述插入片段包含结构基因或其功能性片段的编码序列或者非编码RNA的序列或其互补序列,所述结构基因选自多肽、蛋白质亚单位、蛋白质活性中心、蛋白质或非天然催化基团的蛋白质杂合体、经重组的蛋白质活性亚单位或活性中心/、经重组的人工酶或其他主要由氨基酸构成的生物学效应单元,所述非编码RNA选自微小RNA(miRNA)、小干扰RNA(siRNA)、PIWI蛋白相互作用RNA(piRNA)、转运RNA衍生小RNA(tsRNA)、核小RNA(snRNA)、核仁小RNA(snoRNA)、长链非编码RNA(lncRNA)、假基因、ceRNA(competing endogenous RNAs)、microRNA海绵或其他种类非mRNA的RNA。The linear RNA precursor according to any one of claims 32 to 48, wherein the inserted fragment comprises the coding sequence of a structural gene or its functional fragment or the sequence of a non-coding RNA or its complementary sequence, the structural gene is selected from a polypeptide, a protein subunit, a protein active center, a protein or a protein hybrid of a non-natural catalytic group, a recombinant protein active subunit or active center/, a recombinant artificial enzyme or other biological effect units mainly composed of amino acids, and the non-coding RNA is selected from microRNA (miRNA), small interfering RNA (siRNA), PIWI protein-interacting RNA (piRNA), transfer RNA-derived small RNA (tsRNA), small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), long non-coding RNA (lncRNA), pseudogene, ceRNA (competing endogenous RNAs), microRNA sponge or other types of non-mRNA RNA. 根据权利要求32至49中任一项所述的线状RNA前体,其中,所述5’和3’外部同源臂的长度各自独立地为大于5nt、大于10nt、大于15nt、大于20nt、大于25nt、大于30nt、大于40nt、大于50nt、大于60nt、小于5nt、小于10nt、小于15nt、小于20nt、小于25nt、小于30nt、小于40nt、小于50nt、小于60nt、5~60nt、10~55nt、15~50nt、20~45nt、25~40nt、30~35nt、10nt、15nt、20nt、25nt、30nt、35nt、40nt、45nt、或50nt,The linear RNA precursor according to any one of claims 32 to 49, wherein the lengths of the 5' and 3' outer homology arms are each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, 5-60 nt, 10-55 nt, 15-50 nt, 20-45 nt, 25-40 nt, 30-35 nt, 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, or 50 nt, 任选所述5’和3’内含子片段的长度各自独立地为大于5nt、大于10nt、大于15nt、大于20nt、大于25nt、大于30nt、大于40nt、大于50nt、大于60nt、小于70nt、小于80nt、小于90nt、小于100nt、小于150nt、小于200nt、5~200nt、10~150nt、50~200nt、50~150nt、小于5nt、小于10nt、小于15nt、小于20nt、小于25nt、小于30nt、小于40nt、小于50nt、小于60nt、5~60nt、10~55nt、15~50nt、20~45nt、25~40nt、30~35nt、10nt、15nt、20nt、25nt、30nt、35nt、40nt、45nt、或50nt,The lengths of the 5' and 3' intron fragments are each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 70 nt, less than 80 nt, less than 90 nt, less than 100 nt, less than 150 nt, less than 200 nt, 5-200 nt, 10-150 nt, 50-200 nt, 50-1 50nt, less than 5nt, less than 10nt, less than 15nt, less than 20nt, less than 25nt, less than 30nt, less than 40nt, less than 50nt, less than 60nt, 5-60nt, 10-55nt, 15-50nt, 20-45nt, 25-40nt, 30-35nt, 10nt, 15nt, 20nt, 25nt, 30nt, 35nt, 40nt, 45nt, or 50nt, 任选所述5’和3’外显子片段的长度各自独立地为大于5nt、大于10nt、大于15nt、大于20nt、大于25nt、大于30nt、大于40nt、大于50nt、大于60nt、小于5nt、小于10nt、小于15nt、小于20nt、小于25nt、小于30nt、小于40nt、小于50nt、小于60nt、5~60nt、10~55nt、15~50nt、20~45nt、25~40nt、30~35nt、10nt、15nt、20nt、25nt、30nt、35nt、40nt、45nt、或50nt,Optionally, the length of the 5' and 3' exon fragments are each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, 5-60 nt, 10-55 nt, 15-50 nt, 20-45 nt, 25-40 nt, 30-35 nt, 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, or 50 nt, 任选所述5’和3’内部同源臂的长度各自独立地为大于5nt、大于10nt、大于15nt、大于20nt、大于25nt、大于30nt、大于40nt、大于50nt、大于60nt、小于5nt、小于10nt、小于15nt、小于20nt、小于25nt、小于30nt、小于40nt、小于50nt、小于60nt、5~60nt、10~55nt、 15~50nt、20~45nt、25~40nt、30~35nt、10nt、15nt、20nt、25nt、30nt、35nt、40nt、45nt、或50nt,Optionally, the lengths of the 5' and 3' internal homology arms are each independently greater than 5 nt, greater than 10 nt, greater than 15 nt, greater than 20 nt, greater than 25 nt, greater than 30 nt, greater than 40 nt, greater than 50 nt, greater than 60 nt, less than 5 nt, less than 10 nt, less than 15 nt, less than 20 nt, less than 25 nt, less than 30 nt, less than 40 nt, less than 50 nt, less than 60 nt, 5 to 60 nt, 10 to 55 nt, 15-50 nt, 20-45 nt, 25-40 nt, 30-35 nt, 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, or 50 nt, 任选所述插入片段的长度为大于50nt、大于100nt、大于150nt、大于200nt、大于250nt、大于300nt、大于400nt、大于500nt、大于600nt、大于1k nt、大于1.5k nt、大于2k nt、大于3k nt、小于50nt、小于100nt、小于150nt、小于200nt、小于250nt、小于300nt、小于400nt、小于500nt、小于600nt、小于600nt、小于1k nt、小于1.5k nt、小于2k nt、小于3k nt、50~5knt、50~5k nt、50~4k nt、50~3k nt、50~2k nt、50~1.5k nt、50~1k nt、50~600nt、100~550nt、150~500nt、200~450nt、250~400nt、300~350nt。The length of the inserted fragment is optionally greater than 50 nt, greater than 100 nt, greater than 150 nt, greater than 200 nt, greater than 250 nt, greater than 300 nt, greater than 400 nt, greater than 500 nt, greater than 600 nt, greater than 1k nt, greater than 1.5k nt, greater than 2k nt, greater than 3k nt, less than 50 nt, less than 100 nt, less than 150 nt, less than 200 nt, less than 250 nt, less than 300 nt, less than 400 nt, less than 50 0nt, less than 600nt, less than 600nt, less than 1k nt, less than 1.5k nt, less than 2k nt, less than 3k nt, 50~5knt, 50~5k nt, 50~4k nt, 50~3k nt, 50~2k nt, 50~1.5k nt, 50~1k nt, 50~600nt, 100~550nt, 150~500nt, 200~450nt, 250~400nt, 300~350nt. 根据权利要求32至50中任一项所述的线状RNA前体,其中,所述5’外部同源臂具有如SEQ ID NO:1或56所示的序列,所述3’外部同源臂具有如SEQ ID NO:2或57所示的序列。A linear RNA precursor according to any one of claims 32 to 50, wherein the 5' external homologous arm has a sequence as shown in SEQ ID NO:1 or 56, and the 3' external homologous arm has a sequence as shown in SEQ ID NO:2 or 57. 根据权利要求32至51中任一项所述的线状RNA前体,其中,所述3’内含子片段和所述5’内含子片段来自I型内含子,优选来自蓝藻鱼腥藻属pre-tRNA基因或T4噬菌体Td基因,更优选所述3’内含子片段具有如SEQ ID NO:3所示的序列,所述5’内含子片段具有如SEQ ID NO:4所示的序列。The linear RNA precursor according to any one of claims 32 to 51, wherein the 3' intron fragment and the 5' intron fragment are derived from type I introns, preferably from the cyanobacterium Anabaena pre-tRNA gene or the T4 phage Td gene, more preferably the 3' intron fragment has a sequence as shown in SEQ ID NO:3, and the 5' intron fragment has a sequence as shown in SEQ ID NO:4. 根据权利要求32至52中任一项所述的线状RNA前体,其中,所述3’内含子片段和所述5’内含子片段来自II型内含子,优选来自梭菌属(Clostridium)如破伤风梭菌(Clostridium tetani)的II型内含子,或者芽孢杆菌属(Bacillus)如苏云金芽孢杆菌(Bacillus thuringiensis)的II型内含子,更优选所述II型内含子为SEQ ID NO:5或6所示核苷酸序列中包含的II型内含子。The linear RNA precursor according to any one of claims 32 to 52, wherein the 3' intron fragment and the 5' intron fragment are derived from type II introns, preferably type II introns from Clostridium such as Clostridium tetani, or type II introns from Bacillus such as Bacillus thuringiensis, more preferably the type II intron is a type II intron contained in the nucleotide sequence shown in SEQ ID NO: 5 or 6. 根据权利要求32至53中任一项所述的线状RNA前体,其中,所述3’外显子片段和所述5’外显子片段分别来自天然外显子的3’末端区域和5末端区域,优选来自蓝藻鱼腥藻属pre-tRNA基因或T4噬菌体Td基因,更优选所述3’外显子片段具有如SEQ ID NO:7所示的序列,所述5’外显子片段具有如SEQ ID NO:8所示的序列。The linear RNA precursor according to any one of claims 32 to 53, wherein the 3' exon fragment and the 5' exon fragment are respectively derived from the 3' terminal region and the 5' terminal region of the natural exon, preferably from the cyanobacterium Anabaena pre-tRNA gene or the T4 phage Td gene, more preferably the 3' exon fragment has a sequence as shown in SEQ ID NO:7, and the 5' exon fragment has a sequence as shown in SEQ ID NO:8. 根据权利要求32至54中任一项所述的线状RNA前体,其中,优选所述5’内部同源臂具有如SEQ ID NO:9所示的序列,所述3’内部同源臂具有如SEQ ID NO:10所示的序列。The linear RNA precursor according to any one of claims 32 to 54, wherein the 5' internal homology arm preferably has a sequence as shown in SEQ ID NO:9, and the 3' internal homology arm has a sequence as shown in SEQ ID NO:10. 根据权利要求32至55中任一项所述的线状RNA前体,其中,所述第一间隔区与第二间隔区相同或不同。The linear RNA precursor according to any one of claims 32 to 55, wherein the first spacer is the same as or different from the second spacer. 根据权利要求32至56中任一项所述的线状RNA前体,其具有选自SEQ ID NO:26~49和51~55中任一者所述的核苷酸序列。The linear RNA precursor according to any one of claims 32 to 56, which has a nucleotide sequence selected from any one of SEQ ID NO: 26 to 49 and 51 to 55. 一种核酸序列,其能够转录产生用于权利要求1至29中任一项所述方法中的线状RNA前体或权利要求31至57中任一项所述的线状RNA前体,其优选还以可操作地连接的方式包含转录产生所述线状RNA前体所必需的调控序列,例如启动子、终止子、转录因子结 合位点、非编译区(UTR)、增强子、回文序列、顺式作用元件、反式作用元件、TATA框、CAAT框、操纵子、转座子。A nucleic acid sequence capable of being transcribed to produce a linear RNA precursor for use in the method of any one of claims 1 to 29 or a linear RNA precursor of any one of claims 31 to 57, which preferably further comprises in an operably linked manner a regulatory sequence necessary for transcription to produce the linear RNA precursor, such as a promoter, a terminator, a transcription factor binding site ... Zygote site, untranslated region (UTR), enhancer, palindromic sequence, cis-acting element, trans-acting element, TATA box, CAAT box, operator, transposon. 一种载体,其包含权利要求58所述的核酸序列。A vector comprising the nucleic acid sequence of claim 58. 根据权利要求59所述的载体,其中,所述载体为线状DNA、质粒、病毒核酸片段或细胞基因组DNA片段。The vector according to claim 59, wherein the vector is a linear DNA, a plasmid, a viral nucleic acid fragment or a cell genomic DNA fragment. 一种工程化细胞,其包含用于权利要求1至29中任一项所述方法中的线状RNA前体、权利要求30所述的环状RNA、权利要求31至57中任一项所述的线状RNA前体、权利要求58所述的核酸序列、或者权利要求59或60所述的载体。An engineered cell comprising the linear RNA precursor used in the method of any one of claims 1 to 29, the circular RNA of claim 30, the linear RNA precursor of any one of claims 31 to 57, the nucleic acid sequence of claim 58, or the vector of claim 59 or 60. 一种组合物,其包含用于权利要求1至29中任一项所述方法中的线状RNA前体、权利要求30所述的环状RNA、权利要求31至57中任一项所述的线状RNA前体、权利要求58所述的核酸序列、权利要求59或60所述的载体、或者权利要求61所述的工程化细胞。A composition comprising a linear RNA precursor used in the method of any one of claims 1 to 29, a circular RNA according to claim 30, a linear RNA precursor according to any one of claims 31 to 57, a nucleic acid sequence according to claim 58, a vector according to claim 59 or 60, or an engineered cell according to claim 61. 根据用于权利要求1至29中任一项所述方法中的线状RNA前体、权利要求30所述的环状RNA、权利要求31至57中任一项所述的线状RNA前体、权利要求58所述的核酸序列、权利要求59或60所述的载体、或者权利要求61所述的工程化细胞用于制备环状RNA的用途。Use of a linear RNA precursor used in the method of any one of claims 1 to 29, a circular RNA according to claim 30, a linear RNA precursor according to any one of claims 31 to 57, a nucleic acid sequence according to claim 58, a vector according to claim 59 or 60, or an engineered cell according to claim 61 for preparing circular RNA. 根据用于权利要求1至29中任一项所述方法中的线状RNA前体、权利要求30所述的环状RNA、权利要求31至57中任一项所述的线状RNA前体、权利要求58所述的核酸序列、权利要求59或60所述的载体、或者权利要求61所述的工程化细胞用于制备药物、细胞毒性剂或免疫调节制剂中的用途。Use of a linear RNA precursor used in the method of any one of claims 1 to 29, a circular RNA according to claim 30, a linear RNA precursor according to any one of claims 31 to 57, a nucleic acid sequence according to claim 58, a vector according to claim 59 or 60, or an engineered cell according to claim 61 for preparing a drug, a cytotoxic agent or an immunomodulatory preparation. 根据权利要求64所述的用途,所述治疗性药物、细胞毒性剂或免疫调节制剂选自病毒、多能或专能干细胞、iPS、工程化免疫细胞、抗体或抗体片段、偶联药物的抗体或抗体片段、化疗剂、免疫抑制或调节剂、抗感染药物、抗癌剂、降糖药物、心脑血管系统疾病治疗药物、退行性神经疾病药物、肥胖症治疗药物、血液学疾病治疗药物、呼吸系统疾病治疗医药、或逆转录病毒疾病治疗药物。According to the use of claim 64, the therapeutic drug, cytotoxic agent or immunomodulatory preparation is selected from viruses, pluripotent or multipotent stem cells, iPS, engineered immune cells, antibodies or antibody fragments, drug-conjugated antibodies or antibody fragments, chemotherapeutic agents, immunosuppressive or modulatory agents, anti-infective drugs, anticancer agents, hypoglycemic drugs, cardiovascular and cerebrovascular disease therapeutic drugs, degenerative neurological disease drugs, obesity therapeutic drugs, hematological disease therapeutic drugs, respiratory disease therapeutic drugs, or retroviral disease therapeutic drugs. 施用环状RNA的方法,其包括向有需要的生物体施用有效量的用于权利要求1至29中任一项所述方法中的线状RNA前体、根据权利要求30所述的环状RNA、权利要求31至57中任一项所述的线状RNA前体、权利要求58所述的核酸序列、权利要求59或60所述的载体、权利要求61所述的工程化细胞、或者使用权利要求62所述的组合物制备得到的环状RNA。 A method for administering circular RNA, comprising administering to an organism in need thereof an effective amount of a linear RNA precursor used in the method of any one of claims 1 to 29, a circular RNA according to claim 30, a linear RNA precursor according to any one of claims 31 to 57, a nucleic acid sequence according to claim 58, a vector according to claim 59 or 60, an engineered cell according to claim 61, or a circular RNA prepared using the composition according to claim 62.
PCT/CN2024/104101 2023-07-07 2024-07-05 Method for preparing circular rna and nucleic acid sequence for method WO2025011502A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CNPCT/CN2023/106434 2023-07-07
CN2023106434 2023-07-07

Publications (1)

Publication Number Publication Date
WO2025011502A1 true WO2025011502A1 (en) 2025-01-16

Family

ID=94214649

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/104101 WO2025011502A1 (en) 2023-07-07 2024-07-05 Method for preparing circular rna and nucleic acid sequence for method

Country Status (1)

Country Link
WO (1) WO2025011502A1 (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110283895A (en) * 2019-07-16 2019-09-27 福建农林大学 The method of efficiently purifying and identification circular rna based on two generation Illumina microarray datasets
CN114507691A (en) * 2022-03-02 2022-05-17 深圳市瑞吉生物科技有限公司 Carrier for preparing circular RNA and application thereof
CN115052635A (en) * 2019-12-04 2022-09-13 奥纳治疗公司 Circular RNA compositions and methods
CN116034114A (en) * 2020-03-20 2023-04-28 奥纳治疗公司 Circular RNA compositions and methods
WO2023073228A1 (en) * 2021-10-29 2023-05-04 CureVac SE Improved circular rna for expressing therapeutic proteins
CN116113419A (en) * 2020-05-08 2023-05-12 奥纳治疗公司 Circular RNA compositions and methods
CN116322788A (en) * 2020-05-19 2023-06-23 奥纳治疗公司 Cyclic RNA compositions and methods

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110283895A (en) * 2019-07-16 2019-09-27 福建农林大学 The method of efficiently purifying and identification circular rna based on two generation Illumina microarray datasets
CN115052635A (en) * 2019-12-04 2022-09-13 奥纳治疗公司 Circular RNA compositions and methods
CN116034114A (en) * 2020-03-20 2023-04-28 奥纳治疗公司 Circular RNA compositions and methods
CN116113419A (en) * 2020-05-08 2023-05-12 奥纳治疗公司 Circular RNA compositions and methods
CN116322788A (en) * 2020-05-19 2023-06-23 奥纳治疗公司 Cyclic RNA compositions and methods
WO2023073228A1 (en) * 2021-10-29 2023-05-04 CureVac SE Improved circular rna for expressing therapeutic proteins
CN114507691A (en) * 2022-03-02 2022-05-17 深圳市瑞吉生物科技有限公司 Carrier for preparing circular RNA and application thereof

Similar Documents

Publication Publication Date Title
US11203767B2 (en) Circular RNA for translation in eukaryotic cells
US20210299280A1 (en) Compositions and methods for transient gene therapy with enhanced stability
JP7618577B2 (en) Compositions Comprising Modified Circular Polyribonucleotides and Uses Thereof
JP2023134670A (en) CRISPR-based compositions and methods of use
CN114787357A (en) Antisense guide RNA to which functional region is added for editing target RNA
KR101605932B1 (en) Organic compositions to treat hsf1-related diseases
CA3215367A1 (en) Fn3 domain-sirna conjugates and uses thereof
WO2022138929A1 (en) Guide rna for editing polyadenylation signal sequence of target rna
TW202408542A (en) Methods of making circular RNA
JP2025515996A (en) FN3 domain-siRNA conjugates for enzyme replacement therapy
WO2025011502A1 (en) Method for preparing circular rna and nucleic acid sequence for method
JP2024513131A (en) oligonucleotide
CN118652889B (en) A circular ribozyme system and its application
HK40076284A (en) Antisense guide rna with added functional region for editing target rna
CN113151275A (en) shRNA for inhibiting expression of hsa _ circ _0001610 and expression vector thereof
HK1253674B (en) Compositions and methods for transient gene therapy with enhanced stability

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24838755

Country of ref document: EP

Kind code of ref document: A1