Detailed Description
It should be noted that, without conflict, the embodiments of the present application and features of the embodiments may be combined with each other. The present application will be described in detail with reference to examples.
Term interpretation:
N+1 impurities nucleic acid impurities having a single nucleotide attached in addition to the target synthetic sequence.
N-1 impurities nucleic acid impurities having a single nucleotide deletion compared to the target synthetic sequence.
As mentioned in the background art, the preparation of QPI-1002 in the prior art is carried out by adopting a chemical synthesis preparation method, so that the process is complex, the cost is high, and the generated N+1 and N-1 impurities are more, thereby influencing the subsequent purification of the product. In the present application, the inventors have attempted to develop a method for preparing QPI-1002 by preparing QPI-1002 using enzymatic synthesis, and thus have proposed a series of protection schemes of the present application.
In a first exemplary embodiment of the application, a method of preparing QPI-1002 is provided, the QPI-1002 being a double stranded RNA consisting of complementarily paired sense and antisense strands; the preparation method comprises the steps of mixing a sense strand substrate fragment, an antisense strand substrate fragment and RNA ligase, wherein the sense strand substrate fragment can form a sense strand, the antisense strand substrate fragment can form an antisense strand, the sense strand substrate fragment and the antisense strand substrate fragment are connected through hydrogen bonds formed by base complementation, the head base and the tail base of the sense strand substrate fragment and the antisense strand substrate fragment are not connected with each other to form a double-stranded nucleotide structure containing a notch, the bases at the two ends of the notch are connected through a phosphodiester bond by utilizing the RNA ligase to form QPI-1002, the bases at the two ends of the notch are respectively a 5 'end and a 3' end of different substrate fragments, the 5 'end is a phosphate radical, the 3' end is a hydroxyl radical, the RNA ligase is utilized to connect the phosphate radical at the 5 'end at the upstream and the downstream of the notch and the hydroxyl at the 3' end of the antisense strand substrate fragment, the RNA ligase comprises RNA ligase family 1 and/or RNA ligase of RNA ligase family 2, preferably, the RNA ligase of the RNA ligase family 1 is RNA ligase of the RNA ligase, the RNA ligase has the RNA ligase of the RNA ligase family 1, the RNA ligase of the RNA ligase 1 is the RNA ligase of the SEQ ID 2, and the RNA ligase has the RNA ligase 1 or the RNA ligase 1 shown in the SEQ 1-5-3-5-SEQ 1.
In the above preparation method, the sense strand substrate fragment is a nucleotide sequence of 2 or more that can constitute the sense strand, i.e., a plurality of nucleotide sequences of the sense strand substrate fragment can be spliced to constitute the same sequence as the sense strand sequence, and is different from the sense strand in that there is a nick between the sense strand substrate fragments, which are not linked by a phosphodiester bond. Similarly, the antisense strand substrate fragment and antisense strand have the features described above. The sense strand and the antisense strand of QPI-1002 are obtained by connecting 2 or more sense strand substrate fragments or antisense strand substrate fragments with each other by phosphodiester bonds by RNA ligase.
In the above preparation method, QPI-1002 can be prepared by mixing the sense strand substrate fragment and the antisense strand substrate fragment with RNA ligase. In the preparation method, the sense strand substrate fragment and the antisense strand substrate fragment can be complementarily paired to form a double-stranded structure nucleotide containing a sticky end, the double-stranded structure nucleotide containing the sticky end is continuously combined with other substrate fragments to form a double-stranded nucleotide containing an nick, and the RNA ligase can recognize the nick of the double-stranded structure and connect the nick by a phosphodiester bond, so that the target product QPI-1002 is prepared.
In a preferred embodiment, the nucleotide sequence of the sense strand is a polynucleotide having the nucleotide sequence shown in SEQ ID NO. 23 and the nucleotide sequence of the antisense strand is a polynucleotide having the nucleotide sequence shown in SEQ ID NO. 24.
SEQ ID NO:23:
GAmGAmAUmAUmUUmCAmCCmCUmUCmA。
SEQ ID NO:24:
UmGAmAGmGGmUGmAAmAUmAUmUCmUCm。
In the present application, A, C, G or m after U represents 2' methoxy modification of the ribonucleotide.
In a preferred embodiment, the RNA ligase comprises RNA ligase of RNA ligase family 1 and/or RNA ligase family 2, preferably the RNA ligase of RNA ligase family 1 is the RNA ligase shown in SEQ ID NO:5, the RNA ligase of RNA ligase family 2 is selected from one or more of the RNA ligases shown in SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3 or SEQ ID NO:4, or an enzyme having more than 70% identity, including but not limited to 75%, 80%, 85%, 90%, 95%, 99% (such as 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98.5%, 99.5%, 99.6%, 99.7%, 99.8% or more, even 99.9% or more) and having catalytic phosphodiester bond formation activity with any of the RNA ligases shown in SEQ ID NO: 1-SEQ ID NO: 5.
The RNA ligase can recognize a double-chain structure with a notch formed by complementary pairing of substrate fragments, so that the formation of a phosphodiester bond between a phosphate group and a hydroxyl group is catalyzed.
SEQ ID NO:1(Ligase 25,Vibrio phage NT-1):
MSFVKYTSLENSYRQAFVDKCDMLGVRDWVALEKIHGANFSFIVEFDGGYTVTPAKRTSIIGATATGDYDFYGCTSVVEAHKEKVELVANFLWLNEYINLYEPIIIYGELAGKGIQKEVNYGDKDFWAFDIFLPQREEFVDWDTCVAAFTNAEIKYTKELARGTLDELLRIDPLFKSLHTPAEHEGDNVAEGFVVKQLHSEKRLQSGSRAILKVKNEKFKEKKKKEGKTPTKLVLTPEQEKLHAEFSCYLTENRLKNVLSKLGTVNQKQFGMISGLFVKDAKDEFERDELNEVAIDRDDWNAIRRSLTNIANEILRKNWLNILDGNF.
SEQ ID NO:2(Ligase 26,Escherichia phage AR1):
MQELFNNLMELCKDSQRKFFYSDDVSASGRTYRIFSYNYASYSDWLLPDALECRGIMFEMDGEKPVRIASRPMEKFFNLNENPFTMNIDLNDVDYILTKEDGSLVSTYLDGDEILFKSKGSIKSEQALMANGILMNINHHQLRDRLKELAEDGFTANFEFVAPTNRIVLAYQEMKIILLNIRENETGEYISYDDIYKDAALRPYLVERYEIDSPKWVEEAKNAENIEGYVAVMKDGSHFKIKSDWYVSLHSTKSSLDNPEKLFKTIIDGASDDLKAMYADDEYSYRKIEAFETTYLKYLDRALFLVLDCHNKHCGKDRKTYAMEAQGVAKGAGMDHLFGIIMSLYQGYDSQEKVMCEIEQNFLKNYKKFIPEGY.
SEQ ID NO:3(Ligase 31,Vibrio phage VH12019):
MTTQELYNHLMTLTDDAEGKFFFADHISPLGEKLRVFSYHIASYSDWLLPGALEARGIMFQLDEQDKMVRIVSRPMEKFFNLNENPFTMDLDLTTTVQLMDKADGSLISTYLTGENFALKSKTSIFSEQAVAANRYIKLPENRDLWEFCDDLTQAGCTVNMEWCAPNNRIVLEYPEAKLVILNIRDNETGDYVSFDDIPLPALMRVKKWLVDEYDPETAHADDFVEKLRATKGIEGMILRLANGQSVKIKTQWYVDLHSQKDSVNVPKKLVTTILNNNHDDLYALFADDKPTIDRIREFDSHVSKTVSASFHAVSQFYVKNRHMSRKDYAIAGQKTLKPWEFGVAMIAYQNQTVEGVYEALVGAYLKRPELLIPEKYLNEA.
SEQ ID NO:4:(Ligase 41,Vibrio phage VH7D):
MNVQELYKNLMSLADDAEGKFFFADHLSPLGEKFRVFSYHIASYSDWLLPGALEARGIMFQLDDNDEMIRIVSRPMEKFFNLNENPFTMELDLTTTVQLMDKADGSLISTYLSGENFALKSKTSIFSEQAVAANRYIKKPENRDLWEFCDDCTQAGLTVNMEWCAPNNRIVLEYPEAKLVILNIRDNETGDYVSFDDIPQSALMRVKQWLVDEYDPATAHEPDFVEKLRDTKGIEGMILRLANGQSVKIKTQWYVDLHSQKDSVNVPKKLVTTILNGNHDDLYALFADDKPTIERIREFDSHVTKTLTNSFNAVRQFYARNRHLARKDYAIAGQKVLKPWEFGVAMIAYQKQTVEGVYESLVTAYLKRPELAIPEKYLNGV.
SEQ ID NO:5(Ligase 42,Escherichia phage JN02):
MEKLYYNLLSLCKSSSDRKFFYSDDVSPIGKKYRIFSYNFASYSDWLLPDALECRGIMFEMDGETPVRIASRPMEKFFNLNENPFTLSINLDDVKYLMTKEDGSLVSTYLDGGTVRFKSKGSIKSDQAVSATSILLDIDHKNLADRLLELCNDGFTANFEYVAPTNKIVLTYPEKRLILLNIRDNNTGEYIEYDDIYLDPVFRKYLVDRFEVPEGDWTSDVKSSTNIEGYVAVMKDGSHFKLKTDWYVALHTTRDSISSPEKLFLAIVNGASDDLKAMYADDEFSFKKVELFEKAYLDFLDRSFYICLDTYDKHKGKDRKTYAIEAQAVCKGAQTPWLFGIIMNLYQGGSKEQMMTALESVFIKNHKNFIPEGY.
SEQ ID NO:6:(Ligase 11,Thermococcus):
MVSSYFRNLLLKLGLPEERLEVLEGKGALAEDEFEGIRYVRFRDSARNFRRGTVVFETGEAVLGFPHIKRVVQLENGIRRVFKNKPFYVEEKVDGYNVRVVKVKDKILAITRGGFVCPFTTERIEDFVNFDFFKDYPNLVLVGEMAGPESPYLVEGPPYVKEDIEFFLFDIQEKGTGRSLPAEERYRLAEEYGIPQVERFGLYDSSKVGELKELIEWLSEEKREGIVMKSPDMRRIAKYVTPYANINDIKIGSHIFFDLPHGYFMGRIKRLAFYLAENHVRGEEFENYAKALGTALLRPFVESIHEVANGGEVDETFTVRVKNITTAHKMVTHFERLGVKIHIEDIEDLGNGYWRITFKRVYPDATREIRELWNGLAFVD.
SEQ ID NO:7:(Ligase 20,Archaea):
MVVPLKRIDKIRWEIPKFDKRMRVPGRVYADEVLLEKMKNDRTLEQATNVAMLPGIYKYSIVMPDGHQGYGFPIGGVAAFDVKEGVISPGGIGYDINCGVRLIRTNLTEKEVRPRIKQLVDTLFKNVPSGVGSQGRIKLHWTQIDDVLVDGAKWAVDNGYGWERDLERLEEGGRMEGADPEAVSQRAKQRGAPQLGSLGSGNHFLEVQVVDKIFDPEVAKAYGLFEGQVVVMVHTGSRGLGHQVASDYLRIMERAIRKYRIPWPDRELVSVPFQSEEGQRYFSAMKAAANFAWANRQMITHWVRESFQEVFKQDPEGDLGMDIVYDVAHNIGKVEEHEVDGKRVKVIVHRKGATRAFPPGHEAVPRLYRDVGQPVLIPGSMGTASYILAGTEGAMKETFGSTCHGAGRVLSRKAATRQYRGDRIRQELLNRGIYVRAASMRVVAEEAPGAYKNVDNVVKVVSEAGIAKLVARMRPIGVAKGAAALEH.
SEQ ID NO:8:(Ligase 32,Bacteria):
MVSLHFKHILLKLGLDKERIEILEMKGGIVEDEFEGLRYLRFKDSAKGLRRGTVVFNESDIILGFPHIKRVVHLRNGVKRIFKSKPFYVEEKVDGYNVRVAKVGEKILALTRGGFVCPFTTERIGDFINEQFFKDHPNLILCGEMAGPESPYLVEGPPYVEEDIQFFLFDIQEKRTGRSIPVEERIKLAEEYGIQSVEIFGLYSYEKIDELYELIERLSKEGREGVVMKSPDMKKIVKYVTPYANVNDIKIGSRIFFDLPHGYFMQRIKRLAFYIAEKRIRREDFDEYAKALGKALLQPFVESIWDVAAGEMIAEIFTVRVKKIETAYKMVSHFERMGLNIHIDDIEELGNGYWKITFKRVYDDATKEIRELWNGHAFVD.
Identity (Identity) in the present application refers to "Identity" between amino acid sequences or nucleotide sequences, i.e. the sum of the ratios of amino acid residues or nucleotides of the same kind in an amino acid sequence or nucleotide sequence. The identity of amino acid sequences or nucleotide sequences can be determined using the alignment programs BLAST (Basic Local ALIGNMENT SEARCH Tool), FASTA, etc.
Proteins that are 70%, 75%, 80%, 85%, 90%, 95%, 99% or more (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 98.5%, 99%, 99.5%, 99.6%, 99.7%, 99.8% or more, or even 99.9% or more) identical and have the same function have the same active site, active pocket, active mechanism, protein structure, etc. as those provided by the above sequences with a high probability.
As used herein, amino acid residues are abbreviated as alanine (Ala; A), asparagine (Asn; N), aspartic acid (Asp; D), arginine (Arg; R), cysteine (Cys; C), glutamic acid (Glu; E), glutamine (Gln; Q), glycine (Gly; G), histidine (His; H), isoleucine (Ile; I), leucine (Leu; L), lysine (Lys; K), methionine (Met; M), phenylalanine (Phe; F), proline (Pro; P), serine (Ser; S), threonine (Thr; T), tryptophan (Trp; W), tyrosine (Tyr; Y) and valine (Val; V).
The rules of substitution, replacement, etc., generally, which amino acids are similar in nature, and the effect after replacement is similar. For example, in the above homologous proteins, conservative amino acid substitutions may occur. "conservative amino acid substitutions" include, but are not limited to:
The hydrophobic amino acid (Ala, cys, gly, pro, met, val, ile, leu) is substituted with other hydrophobic amino acids;
The hydrophobic amino acid (Phe, tyr, trp) with a coarse side chain is replaced by other hydrophobic amino acids with a coarse side chain;
The positively charged amino acid (Arg, his, lys) of the side chain is replaced by other positively charged amino acids of the side chain;
The amino acid (Ser, thr, asn, gln) with a side chain having a polarity that is uncharged is substituted with other amino acids with a side chain having a polarity that is uncharged.
The amino acids may also be conservatively substituted by those skilled in the art according to amino acid substitution rules well known to those skilled in the art as the "blosum62 scoring matrix" in the art.
In the application, the phosphodiester bond is formed by the phosphate group and the hydroxyl group of the substrate only through the RNA ligase shown in SEQ ID NO. 1-SEQ ID NO. 5 or the enzyme which has more than 70% of the same degree with any RNA ligase shown in SEQ ID NO. 1-SEQ ID NO. 5, so that the product QPI-1002 is obtained. In the related experiments of the application, the inventors obtained RNA ligase shown by SEQ ID NO. 1-SEQ ID NO. 5 capable of synthesizing QPI-1002 by screening from a large number of enzymes. A negative result with a large proportion in the experiment shows that most RNA ligases are difficult to catalyze the synthesis of QPI-1002, including but not limited to the RNA ligases shown in SEQ ID NO. 6-SEQ ID NO. 8, and the RNA ligases without activity of catalyzing the synthesis of QPI-1002 are only shown by taking SEQ ID NO. 6-SEQ ID NO. 8 as an example in the specification of the application.
In a preferred embodiment, the sense strand substrate fragment comprises 2 or more, the antisense strand substrate fragment comprises 2 or more, preferably the sense strand substrate fragment has a length of 5 to 12nt, more preferably 6 to 11nt, preferably the antisense strand substrate fragment has a length of 2 to 16nt, more preferably 3 to 14nt.
In a preferred embodiment, the sense strand substrate fragment and the antisense strand substrate fragment each comprise 2, the sense strand substrate fragment comprises a first sense strand substrate fragment and a second sense strand substrate fragment, and the antisense strand substrate fragment comprises a first antisense strand substrate fragment and a second antisense strand substrate fragment; the preparation method comprises the steps of S1) mixing a first sense strand substrate fragment, a second sense strand substrate fragment, a first antisense strand substrate fragment and a second antisense strand substrate fragment, S2) connecting the first sense strand substrate fragment and the second sense strand substrate fragment under the catalysis of RNA ligase to form a sense strand, catalyzing the first antisense strand substrate fragment and the second antisense strand substrate fragment to connect to form an antisense strand, and forming QPI-1002 by complementary base pairing of the sense strand and the antisense strand, wherein the sense strand substrate fragment and the antisense strand substrate fragment are annealed and then mixed with RNA ligase to obtain QPI-1002, the first sense strand substrate fragment and the first antisense strand substrate fragment are annealed to form a first double-stranded RNA fragment, the second sense strand substrate fragment and the second antisense strand substrate fragment are annealed to form a second double-stranded RNA fragment, and complementary pairing of 3nt or more is formed in the first double-stranded RNA fragment and the second double-stranded RNA fragment, and the cohesive end between the first double-stranded RNA fragment and the second double-stranded RNA fragment can form a cohesive end or more than or equal to 2nt.
In the preparation method, the sense strand substrate fragment and the antisense strand substrate fragment can be mixed for annealing, and a double-stranded RNA structure can be formed by base complementary pairing between the sense strand substrate fragment and the antisense strand substrate fragment, and the double-stranded RNA structure has a notch between different substrate fragments. And mixing the annealed reaction system with RNA ligase, connecting phosphate groups and hydroxyl groups at two sides of the notch by using the RNA ligase through a phosphodiester bond, and repairing the notch, thereby obtaining a target product QPI-1002 with complete double-chain structure.
In a preferred embodiment, S1) the nucleotide sequence of the first sense strand substrate fragment is the nucleotide sequence shown in SEQ ID NO. 9 and the nucleotide sequence of the second sense strand substrate fragment is the nucleotide sequence shown in SEQ ID NO. 10. Preferably, the nucleotide sequence of the first antisense strand substrate is the nucleotide sequence shown as SEQ ID NO. 12 and the nucleotide sequence of the second antisense strand substrate is the nucleotide sequence shown as SEQ ID NO. 11.
In a preferred embodiment, S1) the nucleotide sequence of the first sense strand substrate is the nucleotide sequence shown in SEQ ID NO. 13 and the nucleotide sequence of the second sense strand substrate is the nucleotide sequence shown in SEQ ID NO. 14. Preferably, the nucleotide sequence of the first antisense strand substrate is the nucleotide sequence shown as SEQ ID NO. 16 and the nucleotide sequence of the second antisense strand substrate is the nucleotide sequence shown as SEQ ID NO. 15.
In a preferred embodiment, S2), the 3 'end of the first sense strand substrate fragment is linked to the 5' end of the second sense strand substrate fragment under the catalysis of RNA ligase to form a sense strand, the 3 'end of the first antisense strand substrate fragment is linked to the 5' end of the second antisense strand substrate fragment under the catalysis of RNA ligase to form an antisense strand, preferably the 5 'end of the first sense strand substrate fragment is a hydroxyl group, the 3' end is a hydroxyl group, the 5 'end of the second sense strand substrate fragment is a phosphate group, the 3' end is a hydroxyl group, preferably the 5 'end of the first antisense strand substrate fragment is a hydroxyl group, the 3' end is a hydroxyl group, and the 5 'end of the second antisense strand substrate fragment is a phosphate group, the 3' end is a hydroxyl group.
QPI-1002 can be prepared using the preparation method described above and the substrate fragment shown as SEQ ID NO 9-SEQ ID NO 12 or SEQ ID NO 13-SEQ ID NO 16. However, it should be noted that the selection of the substrate fragment is not limited to the substrate fragments shown in SEQ ID NOS.9-12 or SEQ ID NOS.13-16, and the substrate fragments capable of being combined to form the sense strand and the antisense strand can be applied to the preparation method described above, which is applicable to the preparation of QPI-1002, but not limited to the difference in the ligation positions of the substrate fragments, and the preparation described above has a good ligation effect for the ligation of the sense strand sequence and the antisense strand sequence of QPI-1002. The number of sense strand substrate fragments or antisense strand substrate fragments includes, but is not limited to, 2,3, 4, or even more.
SEQ ID NO:9:GAmGAmAUmAUmUUmCAmC。
SEQ ID NO:10:CmCUmUCmA。
SEQ ID NO:11:AUmAUmUCmUCm。
SEQ ID NO:12:UmGAmAGmGGmUGmAAm。
SEQ ID NO:13:GAmGAmAUmAUm。
SEQ ID NO:14:UUmCAmCCmCUmUCmA。
SEQ ID NO:15:UmUCmUCm。
SEQ ID NO:16:UmGAmAGmGGmUGmAAmAUmA。
In a preferred embodiment, the sense strand substrate fragment and the antisense strand substrate fragment each comprise 3, the sense strand substrate fragment comprising a first sense strand substrate fragment, a second sense strand substrate fragment, and a third sense strand substrate fragment; the antisense strand substrate fragments comprise a first antisense strand substrate fragment, a second antisense strand substrate fragment and a third antisense strand substrate fragment, preferably the preparation method comprises S1) mixing the first sense strand substrate fragment, the second sense strand substrate fragment, the third sense strand substrate fragment, the first antisense strand substrate fragment, the second antisense strand substrate fragment, the third antisense strand substrate fragment and the RNA ligase, S2) under the catalysis of the RNA ligase, the first sense strand substrate fragment and the second sense strand substrate fragment are connected to form a sense strand, the first antisense strand substrate fragment and the second antisense strand substrate fragment are catalyzed to form an antisense strand, the sense strand and the antisense strand are formed into QPI-1002 through base complementation pairing, preferably the sense strand substrate fragment and the antisense strand substrate fragment are annealed and then mixed with the RNA ligase to obtain QPI-1002, preferably the nucleotide sequence of the first sense strand substrate fragment is the nucleotide sequence shown in SEQ ID NO:17, the second sense strand substrate fragment is the nucleotide sequence shown in SEQ ID NO:17, the nucleotide sequence of the third sense strand substrate fragment is the nucleotide sequence shown in SEQ ID NO: 19), the nucleotide sequence of the first antisense strand substrate fragment is shown as SEQ ID NO. 22, the nucleotide sequence of the second antisense strand substrate fragment is shown as SEQ ID NO. 21, and the nucleotide sequence of the third antisense strand substrate fragment is shown as SEQ ID NO. 20.
SEQ ID NO:17:GAmGAmAUm。
SEQ ID NO:18:AUmUUmCAmC。
SEQ ID NO:19:CmCUmUCmA。
SEQ ID NO:20:CmUCm。
SEQ ID NO:21:AAmAUmAUmU。
SEQ ID NO:22:UmGAmAGmGGmUGm。
In a preferred embodiment, the concentration of the sense strand substrate fragment and the antisense strand substrate fragment is respectively selected from 0.1-4.5mM, preferably 0.8-1.6mM, preferably, the reaction system formed by mixing the sense strand substrate fragment, the antisense strand substrate fragment and the RNA ligase further comprises ATP, tris-HCl, mgCl 2 and DTT, preferably, the reaction temperature of the preparation method is 0-60 ℃, more preferably 4-37 ℃, preferably, the reaction time of the preparation method is 0.5-24 h, more preferably 12-16h, and the pH of the preparation method is 6-8.5.
The concentration of the sense strand substrate fragment and the antisense strand substrate fragment is selected from the group consisting of, but not limited to, 0.1, 0.5, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2.0, 2.5, 3.0, 3.5, 4.0, or 4.5mM, the reaction temperature of the preparation method includes, but not limited to, 10, 15, 16, 20, 25, 30, 35, or 40 ℃, the reaction time of the preparation method includes, but not limited to, 2, 5, 10, 15, 16, 20, 24, 25, 30, 35, 40, 45, or 48 hours, and the pH of the preparation method includes, but not limited to, 6, 6.5, 7, 7.5, 8, or 8.5.
The advantageous effects of the present application will be explained in further detail below in connection with specific examples.
Example 1
The 4 single stranded RNA fragments shown in Table 1 were designed based on the QPI-1002 sequence in nt length units.
TABLE 1
。
Wherein A, C, G or m after U represents 2' methoxy modification of the ribonucleotide.
Ribonucleotides at positions 2, 4, 6, 8, 10 and 12 of substrate 1 have 2' methoxy modifications.
Ribonucleotides at positions 1, 3 and 5 of substrate 2 have 2' methoxy modifications.
Ribonucleotides at positions 2, 4, 6 and 8 of substrate 3 have 2' methoxy modifications.
Ribonucleotides at positions 1, 3,5, 7, 9 and 11 of substrate 4 have 2' methoxy modifications.
The above 4 single-stranded RNA fragments were prepared using a solid phase synthesis method.
The 4 single-stranded RNA fragments were mixed in an equimolar ratio to obtain a substrate mixture having a final concentration of 2.5mM (2.5 mM for each substrate), and annealed to obtain an annealed RNA fragment mixture. The annealed RNA fragment mixture was subjected to enzyme-catalyzed ligation reaction, the reaction system was set to 10. Mu.L, and the reaction system included reaction buffer (50 mM Tris-HCl, pH 7.5), adenosine triphosphate (adenosine triphosphate, ATP), mgCl 2, dithiothreitol (DTT), and RNA Ligase Ligase 25, ligase 26, ligase 31, ligase 41, ligase 42, ligase 11, ligase 20 and Ligase 32 were added, respectively. The reaction was reacted at 16 ℃ for 16h. The resulting reaction system was subjected to enzyme inactivation of ligase at 80℃for 5min and centrifugation at 12000rpm to remove precipitate. The enzyme-catalyzed ligation reaction is schematically depicted in FIG. 1.
SDS-PAGE detection was performed on products obtained by catalyzing with RNA ligases Ligase25, ligase 26, ligase31, ligase 41, ligase 42 Ligase11, ligase 20 and Ligase 32. The electrophoresis results of the products obtained by catalyzing the RNA Ligase31, ligase25 and Ligase11 are shown in FIG. 2, wherein the lane M in FIG. 2 represents the RNA molecular standard (marker), the lane 1 represents the reaction system of Ligase31, the lane 2 represents the reaction system of Ligase25, and the lane 3 represents the reaction system of Ligase 11. The yields were estimated from the gray scale analysis of the target bands in the Urea-PAGE results, and the final yield results are shown in Table 2.
The activity screening of 10 RNA ligases shows that the Ligase25 has better connection effect and can convert most substrates into QPI-1002.
The sense strand of QPI-1002 prepared was GAmGAmAUmAUmUUmCAmCCmCUmUCmA (SEQ ID NO: 23) and the antisense strand was UmGAmAGmGGmUGmAAmAUmAUmUCmUCm (SEQ ID NO: 24).
The reaction results are shown in Table 2.
TABLE 2
。
Remarks:
1) The reaction conditions were 100. Mu.M for the substrate fragment, 10eq for ATP, 2 eq for MgCl, 10eq for DTT (1 eq=100. Mu.M) for a final concentration of 0.2mg/mL enzyme, 2385V, 50 mM Tris-HCl pH 7.5,16 ℃for 16 h;
2) N.d. indicates that no product formation is detected, ++indicates 25-50% (excluding 50% of the end point values), ++ represents 50-75%, 50-75 percent (a).
The yield calculation formula in this example is yield = product gray data/(product gray data + substrate gray data).
Example 2:
Ligase 25 with better reactivity was selected and the annealed substrate fragment was used for enzyme-catalyzed ligation under the following conditions, reaction buffer (50 mM Tris-HCl, pH 7.5), adenosine triphosphate (adenosine triphosphate, ATP), mgCl 2, dithiothreitol (DTT) and RNA Ligase were added sequentially in a 50. Mu.L reactor and reacted for 16h at 16 ℃.
Heating at 80deg.C for 5min to deactivate protein, and centrifuging to obtain supernatant. The HPLC and LC-MS measurements were performed, wherein the HPLC results of the Ligase 25 catalytic product are shown in FIG. 3, and the "yield was measured as the roughly estimated ratio of the product peak in the HPLC data of the reaction system sample, and the results are shown in Table 3.
TABLE 3 Table 3
。
Remarks:
1) The reaction conditions were 800. Mu.M substrate fragment, ATP 4eq, mgCl 2 eq, DTT 10eq (1 eq=800. Mu.M), final concentration of 0.2mg/mL enzyme, 298V, 50mM Tris-HCl pH 7.5,16 ℃for 16h;
2) + represents <50%.
LC-MS is used for identifying that the molecular weight of a sense strand product is 6089.9, the molecular weight of an antisense strand product is 6223.9, the theoretical value of the sense strand product is 6089.96 +/-8, and the theoretical value of the antisense strand product is 6224.0 +/-8, which shows that the ligation of Ligase 25 generates QPI-1002, and the detection result is shown in a graph in FIG. 4.
Example 3
The annealed substrate fragment and Ligase 25 were used for the enzyme-catalyzed ligation reaction under the following conditions of sequentially adding a reaction buffer (50 mM Tris-HCl, pH 7.5), adenosine triphosphate (adenosine triphosphate, ATP), mgCl 2, dithiothreitol (DTT) and RNA Ligase in a 10mL reactor, and reacting at 16℃for 16 hours. After the overnight reaction, heating at 50deg.C for 15min to inactivate protein, centrifuging to obtain supernatant, purifying with Nano-Q column, gradient eluting with NaCl, desalting with membrane bag (molecular weight cut-off 1 kDa), lyophilizing in a lyophilizer to obtain dry powder, and calculating yield 68.73% and purity (HPLC detection) 95.44%.
Example 4
The 4 single stranded RNA fragments shown in Table 4 were designed based on the QPI-1002 sequence in nt length units.
TABLE 4 Table 4
。
Wherein A, C, G or m after U represents 2' methoxy modification of the ribonucleotide.
Ribonucleotides at positions 2, 4, 6 and 8 of substrate 5 have 2' methoxy modifications.
Ribonucleotides at positions 2, 4, 6, 8 and 10 of substrate 6 have 2' methoxy modifications.
Ribonucleotides at positions 1, 3 and 5 of substrate 7 have 2' methoxy modifications.
Ribonucleotides at positions 1,3, 5, 7, 9, 11 and 13 of substrate 8 have 2' methoxy modifications.
The above 4 single-stranded RNA fragments were prepared using a solid phase synthesis method.
The sense strand of QPI-1002 prepared was GAmGAmAUmAUmUUmCAmCCmCUmUCmA (SEQ ID NO: 23) and the antisense strand was UmGAmAGmGGmUGmAAmAUmAUmUCmUCm (SEQ ID NO: 24).
The enzyme-catalyzed ligation was performed using the annealed substrate fragment and Ligase Ligase 25 under the conditions in which the reaction system was set to 50. Mu.L, and 800. Mu.M substrate fragment, ATP 4eq, mgCl 2 12.5eq,DTT 1.25eq (1 eq=800. Mu.M) were sequentially added to the reactor, and Ligase 25,239V 50mM Tris-HCl pH 7.5 was reacted at 16℃and 16 h at a final concentration of 0.2 mg/mL. Heating at 80deg.C for 5min to deactivate protein, and centrifuging to obtain supernatant. The assay HPLC was run and the yield was measured as the coarsely estimated occupancy of the product peak in the HPLC data of the reaction system samples. The results showed a target peak ratio of 72.3% in the sample, i.e. a yield of++.
Example 5
The 6 single stranded RNA fragments shown in Table 5 were designed based on the QPI-1002 sequence in nt length units.
TABLE 5
。
Wherein A, C, G or m after U represents 2' methoxy modification of the ribonucleotide.
Ribonucleotides at positions 2, 4 and 6 of substrate 9 have 2' methoxy modifications.
Ribonucleotides at positions 2,4 and 6 of substrate 10 have 2' methoxy modifications.
Ribonucleotides at positions 1, 3 and 5 of substrate 11 have 2' methoxy modifications.
Ribonucleotides at positions 1 and 3 of substrate 12 have 2' methoxy modifications.
Ribonucleotides at positions 2,4 and 6 of substrate 13 have 2' methoxy modifications.
Ribonucleotides at positions 1, 3, 5, 7 and 9 of substrate 14 have 2' methoxy modifications.
The above 6 single-stranded RNA fragments were prepared using a solid phase synthesis method.
The sense strand of QPI-1002 prepared was GAmGAmAUmAUmUUmCAmCCmCUmUCmA (SEQ ID NO: 23) and the antisense strand was UmGAmAGmGGmUGmAAmAUmAUmUCmUCm (SEQ ID NO: 24).
The enzyme-catalyzed ligation was performed using the annealed substrate fragment and Ligase Ligase 25 under the conditions in which the reaction system was set to 50. Mu.L, 800. Mu.M substrate fragment, ATP 4eq, mgCl 2 12.5eq,DTT 1.25eq (1 eq=800. Mu.M) were added sequentially to the reactor, and LigSase 25,239V 50mM Tris-HCl pH 7.5 was reacted at 16℃to h at a final concentration of 0.2 mg/mL. Heating at 80deg.C for 5min to deactivate protein, and centrifuging to obtain supernatant. The assay HPLC was run and the yield was measured as the coarsely estimated occupancy of the product peak in the HPLC data of the reaction system samples. The results showed that the target peak ratio in the sample was 79.6%, i.e. the yield was++.
Comparative example 1
The average yield of the full length QPI-1002 product synthesized using the solid phase was 29.5% with a total of 1.50% N+1 and N-1 impurities.
The yield of QPI-1002 obtained by the enzyme-linked method is 66.18%, the average value of the solid phase synthesis yield of the used substrate is 45.9%, the yield of the multiplied integral process is 30.3%, the product is higher than the average yield of the product obtained by the solid phase synthesis method, and the total impurity ratio of N+1 and N-1 is 0.38% which is lower than the impurity ratio in the process of synthesizing QPI-1002 by the solid phase synthesis method.
From the above description, it can be seen that the above embodiment of the present application achieves the technical effects that in the preparation method of the present application, 4 short substrate fragments (length is less than 19 nt) are synthesized first, and then the full-length QPI-1002 is synthesized by enzyme ligation, thereby realizing the preparation of the siRNA drug by using the biosynthesis method, since the length of the synthesized fragments is shortened, the N+1 and N-1 impurities generated during the synthesis process are correspondingly reduced, the ligation efficiency of the N+1 and N-1 impurities in the substrate fragments is reduced, the N+1 and N-1 impurities in the full-length product are further reduced, and in addition, the chain length of the N+1 and N-1 impurities in the substrate fragments is greatly different from that of the full-length product, and is easy to remove during the purification process, and the content of the final N+1 and N-1 impurities is less than 0.5%. Compared with a chemical synthesis preparation method, the preparation method provided by the application has the advantages that the purity of the obtained product is high, the generated impurities are less, the preparation process is simple, the reaction condition is mild, the dosage of organic reagent is low, the production cost is reduced, and the large-scale industrial production is convenient to realize.
The above description is only of the preferred embodiments of the present invention and is not intended to limit the present invention, but various modifications and variations can be made to the present invention by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.