CN108315320A - A kind of preparation method of RNA high-throughput sequencing libraries - Google Patents
A kind of preparation method of RNA high-throughput sequencing libraries Download PDFInfo
- Publication number
- CN108315320A CN108315320A CN201710029640.9A CN201710029640A CN108315320A CN 108315320 A CN108315320 A CN 108315320A CN 201710029640 A CN201710029640 A CN 201710029640A CN 108315320 A CN108315320 A CN 108315320A
- Authority
- CN
- China
- Prior art keywords
- rna
- sequence
- throughput sequencing
- library
- preparation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012165 high-throughput sequencing Methods 0.000 title claims abstract description 45
- 238000002360 preparation method Methods 0.000 title claims abstract description 41
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims abstract description 102
- 238000000034 method Methods 0.000 claims abstract description 87
- 108091028075 Circular RNA Proteins 0.000 claims abstract description 30
- 238000006243 chemical reaction Methods 0.000 claims abstract description 23
- 108091036407 Polyadenylation Proteins 0.000 claims abstract description 19
- 102100034343 Integrase Human genes 0.000 claims abstract description 13
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 claims abstract description 13
- 229920001519 homopolymer Polymers 0.000 claims abstract description 11
- 238000010839 reverse transcription Methods 0.000 claims abstract description 7
- 239000002299 complementary DNA Substances 0.000 claims abstract description 4
- 238000012408 PCR amplification Methods 0.000 claims abstract description 3
- 239000000203 mixture Substances 0.000 claims description 13
- 238000000746 purification Methods 0.000 claims description 5
- 230000000295 complement effect Effects 0.000 claims description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 claims 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 claims 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 claims 2
- 229930024421 Adenine Natural products 0.000 claims 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 claims 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 claims 1
- 101100191561 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PRP3 gene Proteins 0.000 claims 1
- 229960000643 adenine Drugs 0.000 claims 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 claims 1
- 229940104230 thymidine Drugs 0.000 claims 1
- 229940035893 uracil Drugs 0.000 claims 1
- 230000035945 sensitivity Effects 0.000 abstract description 6
- 108091070501 miRNA Proteins 0.000 description 71
- 239000002679 microRNA Substances 0.000 description 69
- 230000029142 excretion Effects 0.000 description 23
- 239000000523 sample Substances 0.000 description 20
- 238000012163 sequencing technique Methods 0.000 description 18
- 230000009182 swimming Effects 0.000 description 18
- 210000002381 plasma Anatomy 0.000 description 14
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 10
- 238000006062 fragmentation reaction Methods 0.000 description 10
- 239000000243 solution Substances 0.000 description 10
- 238000013467 fragmentation Methods 0.000 description 9
- 210000002966 serum Anatomy 0.000 description 9
- 239000006228 supernatant Substances 0.000 description 9
- 108091032955 Bacterial small RNA Proteins 0.000 description 8
- 238000000018 DNA microarray Methods 0.000 description 8
- 238000001962 electrophoresis Methods 0.000 description 8
- 238000002156 mixing Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 239000000047 product Substances 0.000 description 8
- 108090000623 proteins and genes Proteins 0.000 description 8
- 238000011160 research Methods 0.000 description 8
- 210000004369 blood Anatomy 0.000 description 7
- 239000008280 blood Substances 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 7
- 150000002500 ions Chemical class 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 6
- 239000007788 liquid Substances 0.000 description 6
- 238000002493 microarray Methods 0.000 description 6
- 102000004169 proteins and genes Human genes 0.000 description 6
- 108091092562 ribozyme Proteins 0.000 description 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 5
- 206010028980 Neoplasm Diseases 0.000 description 5
- 239000013614 RNA sample Substances 0.000 description 5
- 239000003153 chemical reaction reagent Substances 0.000 description 5
- 238000001914 filtration Methods 0.000 description 5
- 239000000872 buffer Substances 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 230000002596 correlated effect Effects 0.000 description 4
- 230000000875 corresponding effect Effects 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 238000000605 extraction Methods 0.000 description 4
- 230000014509 gene expression Effects 0.000 description 4
- 239000013642 negative control Substances 0.000 description 4
- 238000001556 precipitation Methods 0.000 description 4
- 238000011002 quantification Methods 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 108091030146 MiRBase Proteins 0.000 description 3
- 108700011259 MicroRNAs Proteins 0.000 description 3
- 108091034057 RNA (poly(A)) Proteins 0.000 description 3
- 102000003661 Ribonuclease III Human genes 0.000 description 3
- 108010057163 Ribonuclease III Proteins 0.000 description 3
- 108020003224 Small Nucleolar RNA Proteins 0.000 description 3
- 102000042773 Small Nucleolar RNA Human genes 0.000 description 3
- 108020004459 Small interfering RNA Proteins 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 239000011324 bead Substances 0.000 description 3
- 238000007622 bioinformatic analysis Methods 0.000 description 3
- 238000010219 correlation analysis Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 239000004615 ingredient Substances 0.000 description 3
- 238000007689 inspection Methods 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 238000011017 operating method Methods 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000010008 shearing Methods 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- GUAHPAJOXVYFON-ZETCQYMHSA-N (8S)-8-amino-7-oxononanoic acid zwitterion Chemical compound C[C@H](N)C(=O)CCCCCC(O)=O GUAHPAJOXVYFON-ZETCQYMHSA-N 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 229920002527 Glycogen Polymers 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- 238000010220 Pearson correlation analysis Methods 0.000 description 2
- 101710124239 Poly(A) polymerase Proteins 0.000 description 2
- 238000010802 RNA extraction kit Methods 0.000 description 2
- 238000003559 RNA-seq method Methods 0.000 description 2
- 108090000638 Ribonuclease R Proteins 0.000 description 2
- 239000003146 anticoagulant agent Substances 0.000 description 2
- 229940127219 anticoagulant drug Drugs 0.000 description 2
- 238000003149 assay kit Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 239000000090 biomarker Substances 0.000 description 2
- 210000004027 cell Anatomy 0.000 description 2
- 230000009514 concussion Effects 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 238000001035 drying Methods 0.000 description 2
- 235000019441 ethanol Nutrition 0.000 description 2
- 238000000703 high-speed centrifugation Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000011528 liquid biopsy Methods 0.000 description 2
- 239000002105 nanoparticle Substances 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 239000012071 phase Substances 0.000 description 2
- 239000008363 phosphate buffer Substances 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 238000001303 quality assessment method Methods 0.000 description 2
- 238000013441 quality evaluation Methods 0.000 description 2
- 239000008223 sterile water Substances 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- 101000942941 Arabidopsis thaliana DNA ligase 6 Proteins 0.000 description 1
- 201000001320 Atherosclerosis Diseases 0.000 description 1
- 241000244203 Caenorhabditis elegans Species 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 101100408379 Drosophila melanogaster piwi gene Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000628997 Flos Species 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 108091007412 Piwi-interacting RNA Proteins 0.000 description 1
- 101710086015 RNA ligase Proteins 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- UELITFHSCLAHKR-UHFFFAOYSA-N acibenzolar-S-methyl Chemical compound CSC(=O)C1=CC=CC2=C1SN=N2 UELITFHSCLAHKR-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000008346 aqueous phase Substances 0.000 description 1
- 238000012742 biochemical analysis Methods 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000012350 deep sequencing Methods 0.000 description 1
- 230000007850 degeneration Effects 0.000 description 1
- 230000030609 dephosphorylation Effects 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- 239000012154 double-distilled water Substances 0.000 description 1
- 238000009509 drug development Methods 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- IDGUHHHQCWSQLU-UHFFFAOYSA-N ethanol;hydrate Chemical compound O.CCO IDGUHHHQCWSQLU-UHFFFAOYSA-N 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 239000003292 glue Substances 0.000 description 1
- 229940096919 glycogen Drugs 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 108091023663 let-7 stem-loop Proteins 0.000 description 1
- 108091063478 let-7-1 stem-loop Proteins 0.000 description 1
- 108091049777 let-7-2 stem-loop Proteins 0.000 description 1
- 108091053735 lin-4 stem-loop Proteins 0.000 description 1
- 108091032363 lin-4-1 stem-loop Proteins 0.000 description 1
- 108091028008 lin-4-2 stem-loop Proteins 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- KWGKDLIKAYFUFQ-UHFFFAOYSA-M lithium chloride Chemical class [Li+].[Cl-] KWGKDLIKAYFUFQ-UHFFFAOYSA-M 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 230000010355 oscillation Effects 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000011451 sequencing strategy Methods 0.000 description 1
- 239000004055 small Interfering RNA Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000004454 trace mineral analysis Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B50/00—Methods of creating libraries, e.g. combinatorial synthesis
- C40B50/06—Biochemical methods, e.g. using enzymes or whole viable microorganisms
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biochemistry (AREA)
- Zoology (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Analytical Chemistry (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Immunology (AREA)
- Biotechnology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The present invention provides the method that one kind can carry out high-throughput sequencing library preparation to all RNA without poly (A) tail, and the method for the present invention covers all RNA without poly (A) tail, including linear rna and circular rna.The linear RNA without poly (A) tail can directly carry out high-throughput sequencing library preparation using the method for the present invention or after interrupting;Circular rna can carry out high-throughput sequencing library preparation after interrupting using the method for the present invention.First, end extension is carried out, above-mentioned RNA is made to extend one section of homopolymer sequence in 3 ' ends.Then, after being attached reaction, make its 5 ' end connection top connection R5A.After connection is reacted, above-mentioned RNA connection products are purified, recycles special reverse transcriptase primer RTP to carry out reverse transcription reaction, obtains mono- chains of corresponding cDNA.Finally, PCR amplification is carried out using with the matched primer of high-flux sequence platform, you can obtain high-throughput sequencing library.It is noted that the method for the present invention has obtained larger promotion in sensitivity, it is particularly suitable for the preparation of the high-throughput sequencing library of low initial amount RNA in micro-example.
Description
Technical field
The present invention relates to molecular biology field, the RNA high passes in particular to one kind without poly (A) tail measure
The high-flux sequence of low initial amount RNA in preface library preparation method, more particularly to micro-example such as excretion body and extracellular vesica
The preparation in library.
Technical background
MicroRNA (small RNA) is a major class regulatory molecule, is present in almost all of organism, length
Generally 18-40 nucleotide (nucleotide, nt).From 1993 for the first time in caenorhabditis elegan (Caenorhadits
Elegans after finding small RNA in), people recognize the important function of small RNA more and more.They are with difference
In the mode of action of " classical RNA ", participates in gene's expression and control, the processing of RNA and shearing, protein translation, heredity and " enter
Invade " inhibit, the biological processes such as gamete occurs, in the physiology mistake such as generation of gene expression, bion development, metabolism and disease
Journey plays an important role.Can be 4 classes according to the characterization of molecules of tiny RNA:Microrna (microRNA, miRNA),
PiRNA (piwi-associated RNA), siRNA (small interfering RNA, siRNA) and small nucleolar RNA
(snoRNA, small nucleolar RNA), the wherein research of miRNA is more, and progress is most fast.
As typically linearly without the RNA of poly (A) tail, miRNA be a kind of length be 19-25nt microRNA,
It is found in nematode earliest.This 2 kinds of miRNA of lin-4 and let-7 are attached to 3 ' non-codings of said target mrna by partial sequence complementarity
Area regulates and controls elegans development process.Before high throughput sequencing technologies birth, the discovery of miRNA and research speed are relatively slow always, this
It is to be spent greatly, sequencing depth is limited, less efficient since traditional Sanger PCR sequencing PCRs are complicated for operation.Since two thousand five, high
Flux sequencing technologies are widely used for excavating miRNA.High throughput sequencing technologies are fast, at low cost with speed, coverage is deep, output
The advantages that huge, therefore it is very suitable for miRNA sequencings, miRNA especially low to those copy numbers.End in December, 2016,
Sanger miRBase databases (http://www.mirbase.org/) included more than 2800 kinds of miRNA, wherein the mankind at
Ripe body miRNA has more than 2500 kinds.Studies have shown that the unconventionality expression of miRNA and specific cancer, the occurrence and development phase of disease
It closes.The express spectra of miRNA is analyzed using high-flux sequence means in tumor research field, finding can in certain tumours
The miRNA of unconventionality expression can occur, it is possible to find the new biomarkers as diagnosing tumor and Index for diagnosis.Currently,
A very important means of research miRNA are had become with high throughput sequencing technologies combination bioinformatic analysis.
Compared with linear rna, the concern that circular rna (circular RNA, circRNA) is subject to is fewer, also more difficult
In research.Circular rna is a kind of RNA with special unknown function, and is objective a large amount of existing.Circular rna is by precursor
Then RNA connects tail by shearing by the head of linear rna and is formed.Circular rna is difficult to be distinguished with other RNA, amplification or piece
Duan Huahui destroys RNA rings, and the parser of early RNA sequencing can filter out the significant sequence of circular rna.Due to technology
With methodology problem, circular rna is always treated as being rare shearing mistake, to which the RNA of this part objective reality is ignored
.As depth RNA sequencings and the development of scale biology information technology, researcher just really have found largely to deposit in vivo
In cricoid RNA molecule, circular rna is very stablized in vivo due to forming the ring-type being closed.
With the help of bioinformatics, biochemical analysis and deep sequencing, researchers have sky to these mysterious molecules
Preceding understanding.For the function of circular rna, there is the saying of several hypothesis at present:(1) circular rna can be used as the sea " sponge "
Silk floss inhales miRNA, inhibits its function;(2) circular rna passes through other rna levels of base pair complementarity direct regulation and control;(3) circular rna
It can be combined with protein, inhibit protein active, raise the component of protein complex or the activity of regulation protein;(4) ring
Shape RNA also can be used as the synthesis of the template guided protein of translation.On the basis of to circRNA structure and function researchs, have
Researcher has found that they all send out in the diseases generating process such as atherosclerosis, nerve problems, diabetes and tumour
Wave very important effect.Have great importance to circular rna research:(1) the unique competitiveness of circular rna is endogenous
(ceRNA) feature can provide new thinking for drug development;(2) tissue specificity of circular rna and stability there is a possibility that
CircRNA becomes a kind of good biomarker;(3) research of circular rna provides new direction for the Study on Evolution of life.
At present for there are mainly two types of the RNA high-throughput sequencing library preparation methods without poly (A) tail, i.e. random primer
Method and connector connection method.The first random priming is to utilize random primer reverse transcription at cDNA RNA, then synthesizes double-strand
DNA, in addition connector, to complete the preparation in library.Although this method is simple, during prepared by library, it is easy
The RNA of small fragment is caused to lose.Second method is connector connection method, and this method is also that current mainstream is built library kit and adopted
Method.This method is to add RNA connectors at the 3 ' ends of dephosphorylation RNA, then carry out phosphorylation to 5 ' ends of connection product
Processing makes it add the RNA connectors at 5 ' ends, reverse transcription reaction and PCR amplification is then carried out, to complete the preparation in library.
The main problem of this method is easy to happen between connector from connecting, to influence the quality and output of subsequent sequencing data.
In addition, the banking process initial amount of RNA is usually required that it is higher, it is general to require in 10ng or more.However, being led in liquid biopsy
The rna content in domain, body fluid source is usually all relatively low, usually needs considerable humoral sample using conventional RNA banking process.
For example, using traditional RNA banking process based on connector connection, whole blood sample at least needs 10ml that could meet to build library and want
It asks.Therefore, the sensitivity of traditional banking process cannot be satisfied the requirement for construction data base of such low abundance RNA.Liquid biopsy field
Urgently a kind of highly sensitive be especially can be suitably used for all high-throughput sequencing library preparation methods without poly (A) tails RNA.
Invention content
For existing in the prior art to carrying out building library sequencing sensitivity and effective percentage without the RNA of poly (A) tail
Relatively low problem, especially can not be efficiently to the RNA without poly (A) tail in the micro-examples such as excretion body and extracellular vesica
The problem of carrying out building library, the present invention provides a kind of can be directed to obtain all RNA without poly (A) tail, and to its efficient structure
The method for building sequencing library.
The preparation method of RNA high-throughput sequencing libraries of the present invention includes the following steps:(1) end is carried out to extend instead
It answers, poly (A)-RNA is made to extend one section of sequence in its 3 ' end;(2) it is attached reaction, above-mentioned RNA is made to connect at its 5 ' end
Connector R5A is connected, RNA connection products are obtained;(3) above-mentioned RNA connection products are purified;(4) special reverse transcription is utilized
Primer RTP carries out reverse transcription reaction to above-mentioned RNA connection products after purification, obtains mono- chains of corresponding cDNA;(5) PCR is carried out
Amplification enrichment is to get to high-throughput sequencing library.Fig. 1 illustrates to carry out RNA high-throughput sequencing library systems using the inventive method
Standby flow chart.
In the method, for RNA after the extension of end, 3 ' ends are one section of homopolymer.The homopolymer by A, T,
C, any one composition in this five kinds of bases of G or U, it is therefore preferable to base A.End extension method in the present invention theoretically can be with
Unbiased poorly makes all RNA all extend one section of homopolymer at 3 ' ends, thus can completely, accurately RNA in response sample
The time of day of transcript.
In the method, 5 ' end connector R5A are by two sections of Sequence compositions:Only positioned at the distinguished sequence at 5 ' ends and positioned at 3 ' ends
One identification sequence.Wherein, 5 ' terminal specific sequences are the 3 ' terminal sequences or intact full-length sequence of high-flux sequence plain adapter;3 ' ends are only
One identification sequence is the random sequence by these four base compositions of A, T, C and G, length 5nt.Particularly, in the method, by
In R5A 5 ' end be there is no 3 ' end connectors without phosphorylation modification, and in coupled reaction system, therefore will not cause connector it
Between connect certainly, so as to avoid the influence to sequencing data quality.In addition, one section of continuous stochastic ordering is contained at the ends R5A 3 '
On the one hand row can be used as unique recognition sequence, another convenient for rejecting the influence that PCR deviations are brought in subsequent data analysis
Aspect can also improve its joint efficiency between RNA to a certain extent.Further, R5A sequences include Seq ID
No:1。
In the method, distinguished sequence two parts sequence structure that reverse transcriptase primer RTP is held by the distinguished sequence at 5 ' ends and 3 '
At.Wherein, 5 ' terminal specific sequences are the 3 ' terminal sequences or intact full-length sequence of high-flux sequence plain adapter;The distinguished sequence at 3 ' ends
Sequence is captured by RNA connection products and anchor series two parts form.RNA connection products capture sequence in RTP and above-mentioned RNA
3 ' end extension sequences it is mutually complementary, the length of the capture sequence is 2-50nt, it is therefore preferable to 10-30nt.The 1st of anchor series
Base is the degeneracy base that homopolymer complementation is not held with RNA 3 ' (from 5 ' ends to 3 ' extreme directions);2nd bit base to 3 ' least significant ends
For by the random sequence of this 4 kinds of base compositions of A, T, C and G, sequence length 1-5nt, it is therefore preferable to 1-3nt, more preferably
1nt.Further, RTP sequences include Seq ID No:2、Seq ID No:3、Seq ID No:4 and Seq ID No:5 etc.
Sequence.
In the method, the PCR primer for being used for amplified library is the 5 ' terminal sequences or complete of high-flux sequence platform connector
Full length sequence.After PCR reactions, you can sequencing library be enriched with, complete.By the segment screening in library, after purification
With upper machine sequencing can be carried out after quality inspection.
One of feature of the method for the present invention is can be to carrying out building library sequencing without the RNA of poly (A) tail, and covering is all
Without the RNA of poly (A) tail, including linear rna and circular rna.The linear RNA without poly (A) tail can be utilized directly
The method of the present invention or as needed interrupt after carry out high-throughput sequencing library preparation;Nonlinear RNA, can into after Break Row
To carry out high-throughput sequencing library preparation using this method.The two of the feature of the method for the present invention are that sensitivity prepared by library obtains
To larger promotion, less (initial amount can be down to 1-10ng or even 0.5-2ng) is required to RNA initial amounts, can be realized to excretion
RNA in the micro-examples such as body and extracellular vesica carries out the preparation of high-throughput sequencing library.
Description of the drawings
The specific implementation mode of the present invention is described in further detail below in conjunction with the accompanying drawings.
The accompanying drawings which form a part of this application are used only to provide a further understanding of the present invention, not structure
At inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 shows the flow chart that RNA high-throughput sequencing library preparations are carried out using the inventive method.
Fig. 2 shows the nano particle trace analysis instrument testing result figures of blood plasma excretion body.
Fig. 3 shows the analyzing biochips system detectio figure of blood plasma excretion body small RNA.
Fig. 4 shows the PAGE electrophoretograms (Illumina platforms) in library prepared by the method for the present invention and NEB kits.M
For 20bp DNA ladder;1-3 swimming lanes are library prepared by the method for the present invention, wherein No. 1 swimming lane is negative control, No. 2 swimming
Road and No. 3 swimming lane miRNA initial amounts are respectively 10ng and 2ng;4-6 swimming lanes are library prepared by NEB RNA isolation kits, wherein 4
Number swimming lane and No. 5 swimming lane miRNA are respectively 10ng and 2ng, and No. 6 swimming lanes are negative control.
Fig. 5 shows the analyzing biochips system detectio figure in the library prepared by the method for the present invention and NEB kits
(Illumina platforms).(A)-(B) utilizes library prepared by the method for the present invention, and miRNA initial amounts are respectively 10ng and 2ng;
(C)-(D) utilizes library prepared by NEB kits, and miRNA initial amounts are respectively 10ng and 2ng.
Fig. 6 shows the Venn diagram (Illumina platforms) in library prepared by the method for the present invention and NEB kits.(A)-
(B) library for utilizing the method for the present invention to prepare, miRNA initial amounts are respectively 10ng and 2ng;(C)-(D) utilizes NEB kit systems
Standby library, miRNA initial amounts are respectively 10ng and 2ng.
Fig. 7 shows the PAGE electrophoretograms (Illumina platforms) in library prepared by the method for the present invention.M is 20bp DNA
ladder;1-3 swimming lanes are library prepared by the method for the present invention, wherein No. 1 swimming lane is negative control, No. 2 swimming lanes and No. 3 swimming lanes
MiRNA initial amounts are respectively 10ng and 2ng.
Fig. 8 shows the analyzing biochips system detection results (Illumina platforms) of the circular rna after fragmentation;
Fig. 9 shows circular rna library electrophoresis pattern (Illumina platforms) prepared by the method for the present invention.M is 20bp
DNA ladder;1-2 swimming lanes are library prepared by the method for the present invention, and circular rna initial amount is respectively 50ng and 10ng;3-4
Number swimming lane is library prepared by NEB RNA isolation kits, and circular rna initial amount is respectively 50ng and 10ng.
Figure 10 shows the PAGE electrophoresis results (Ion Torrent platforms) in library prepared by the method for the present invention.M is 20bp
DNAladder;No. 1 swimming lane is negative control;No. 2 swimming lanes are that initial amount is 10ng miRNA;It is 2ng that No. 3 swimming lanes, which are initial amount,
miRNA。
Figure 11 shows analyzing biochips system detection results (the Ion Torrent in library prepared by the method for the present invention
Platform).(A) initial amount is the library of 10ng miRNA, and (B) initial amount is the library of 2ng miRNA.
Figure 12 shows the Venn diagram (Ion Torrent platforms) in the library prepared by the method for the present invention.(A) initial amount is
The library of 10ng miRNA, (B) initial amount are the library of 2ng miRNA.
Specific implementation mode
The embodiment and gain effect further illustrated the present invention below in conjunction with specific embodiment, example is only
Validity for explaining the present invention is not intended to limit use and the protection domain of the present invention.
Lifted microarray dataset is used only to the versatility of demonstration the method for the present invention in embodiment, is not intended to limit the present invention
Use and protection domain, the method for the present invention can be compatible with all high-flux sequence platforms.
Experimental method used in embodiment and experiment condition are unless otherwise specified conventional method and condition, or
Abide by corresponding instructions selection.
Room temperature described in this hair embodiment means normal temperature laboratory, i.e., 20-30 DEG C.
The materials, reagents and the like used in the following examples is commercially available unless otherwise specified.
Embodiment 1
MiRNA high-throughput sequencing library preparation process in the blood plasma excretion body that this example passes through a kind of exemplary embodiment
For, to describe the principle and feature that the method for the present invention prepares the linear libraries RNA without poly (A) tail.In this example,
Connector matches with Illumina microarray datasets, and 3 ' end end homopolymers are made of base A;The capture at 5 ' ends in reverse transcriptase primer
Sequence is by 20 T base compositions, anchor series VN(1)。
Correlated series involved in this example:
5 ' end connector R5A-1,5 '-CCUUGGCACCCGAGAAUUCCANNNNN;
Reverse transcriptase primer RTP-1,5 '-CAGAGTTCTACAGTCCGACGATCT(20)VN(1)。
1. the collection of plasma sample and its separation of excretion body
Inventor has collected in the first affiliated hospital of University Of Suzhou the complete of the normal physical examination Healthy People of 10ml in May, 2016
Blood.Whole blood sample is loaded on EDTA- anticoagulant tubes, has detached corresponding plasma sample respectively by the method for centrifugation, plasma sample is set
Next experiment is preserved or is directly entered under the conditions of -80 DEG C.
From plasma sample detach excretion body concrete operations be:
1) blood plasma of freeze thawing state 2000 × g at 4 DEG C completely is centrifuged 20 minutes, removes cell;
2) supernatant is taken to move to new centrifuge tube, 16500 × g is centrifuged 30 minutes under the conditions of 4 DEG C, and it is broken further to remove cell
Piece;
3) it takes supernatant liquid to be transferred to new centrifuge tube, the phosphate buffer of 0.5 times of volume, vortex mixing is added;
4) sample is passed through into 0.22um disposable filters, removes big vesica, collect clear liquid;
5) clear liquid of collection is centrifuged 70 minutes at 110000 × g, abandons supernatant, the PBS for adding 5 times of volumes of whole blood is molten
Liquid cleaning precipitation, 110000 × g are centrifuged 70 minutes again, reject supernatant;
6) use 100ul phosphate buffers that all precipitations are resuspended, -20 DEG C save backup.
7) take above-mentioned 1ul excretions weight suspension, after 1000 times of dilution after, take 1ml dilutions, utilize nano particle tracking point
Analyzer (Marlven companies, model NS300) is detected excretion body, analyzes separated excretion body grain size and quantity, specifically
Testing result is as shown in Figure 2.
2. the extraction of total serum IgE in excretion body
1) 2000 μ l TRIzol solution are added in above-mentioned remaining excretion weight hangs solution, firmly concussion is equal to presenting
One liquid phase is stored at room temperature 5 minutes;
2) 400 μ l of chloroform are added to be mixed well with forced oscillation centrifuge tube, are stored at room temperature 5 minutes;
3) under the conditions of 4 DEG C, 12000rpm high speed centrifugations are drawn after 15 minutes in upper strata aqueous phase to new centrifuge tube, are paid attention to not
The protein layer being drawn onto between two layers of water phase.
4) isometric isopropanol is added, fully reverse mixing stands two hours under the conditions of -20 DEG C;
5) under the conditions of 4 DEG C, 12000rpm high speed centrifugations carefully discard supernatant after 15 minutes, and the 75% of 1ml precoolings is added
Ethyl alcohol washing precipitates, and 7500rpm is centrifuged 5 minutes at 4 DEG C;
6) liquid is discarded, 5 minutes is placed at room temperature fully to dry precipitation, the sterile water without RNA enzyme of 20 μ l is added
It is dissolved;
7) analyzing biochips system Bioanalyzer 2100 is utilized to coordinate Small RNA analysis kits (Agilent
Company, article No. 5067-1548) detect Small RNA purity, concentration and segment distribution situation in total serum IgE, testing result such as Fig. 3
It is shown.RNA sample is placed in freeze preserves or is directly entered next experiment in -80 DEG C.
3. carrying out miRNA high-throughput sequencing library preparations using present patent application method
The preparation of high-throughput sequencing library is carried out with the above-mentioned RNA of the miRNA equivalents of 2ng and 10ng respectively.
1) same polyadenylic acid sequence is added at the ends RNA 3 '
It is formulated as follows reaction system:
Ingredient | Volume (ul) | Final concentration |
10x poly(A)polymerase buffer(NEB) | 0.5 | 1x |
ATP(10mM,NEB) | 0.5 | 100uM |
SuperaseIn(Ambion,20U/μL) | 0.1 | 2U |
miRNA | 3.8ul | 2ng/10ng |
poly(A)polymerase(20U/μL,NEB) | 0.1 | 2U |
Total volume | 5.0 |
Above-mentioned reaction solution is uniformly mixed, is reacted as follows on thermostat:37 DEG C 5 minutes then 65 DEG C 20 minutes.
2) 5 ' end connectors are connected
It is formulated as follows reaction solution:
Ingredient | Volume (ul) | Final concentration |
10x RNA ligase buffer(NEB) | 0.5 | 1x |
R5A-1(10μM) | 0.5 | 100nM |
PEG (50%, NEB) | 2 | 2% |
ATP(10Mm,NEB) | 1 | 200uM |
RNA ligase1(20U/μL,NEB) | 1 | 2U |
DTT(100Mm,NEB,) | 0.2 | 0.4mM |
SuperaseIn(20U/μL,Ambion) | 0.2 | 4U |
Total volume | 5ul |
Above-mentioned reaction solution is uniformly mixed, following procedure is executed on thermostat:22 DEG C 60 minutes.
3) purifying of connection product
It utilizesOligo(dT)25Magnetic bead (Thermo Fisher companies, article No. 61002) to connection product into
Row purifying, is used in combination the sterile water without RNA enzyme of 5.5ul to be eluted.
4) reverse transcription reaction
According to the form below prepares connection reaction solution.
Above-mentioned reaction solution is uniformly mixed, following procedure is executed on thermostat:42 DEG C 60 minutes then 95 DEG C 3 minutes.
5) the amplification enrichment in library
According to the form below prepares connection reaction solution.
Ingredient | Volume (ul) | Final concentration |
5x Phusion HF Buffer(NEB) | 10 | 1x |
SR Primer for Illumina(10μM,NEB) | 2.5 | 200nM |
Index(X)Primer(10μM,NEB) | 2.5 | 200nM |
dNTP Mix(10mM,NEB) | 1.25 | 250uM |
Phusion DNA Polymerase(2U/μL,NEB) | 0.5 | 1U |
ddH2O | 23.25 | - |
Total volume | 40 |
Above-mentioned reaction solution is uniformly mixed, following procedure is executed in PCR instrument:98 DEG C of 30 seconds pre-degenerations;98 DEG C 10 seconds, 62
DEG C 30 seconds, 72 DEG C 15 seconds, totally 15 cycles;72 DEG C 60 seconds.
4. carrying out miRNA high-throughput sequencing library preparations using NEB kits
As a comparison, the libraries NEB small RNA reagent preparation is selected in this example:Multiplex
Small RNA Library Prep Kit for Illumina (NEB companies, article No. E7560S) are carried out with the method for the present invention
Comparison, to illustrate that the method for the present invention has the advantage that in sensitivity.Respectively using the miRNA of 2ng and 10ng as initial amount into
The preparation of row high-throughput sequencing library, method and operating method step are carried out with reference to kit specification.
5. library PAGE is purified
PAGE glue (6%) electricity is carried out by invention method and using the libraries miRNA prepared by NEB kits respectively
Swimming, 300V 1 hour, after waiting for electrophoresis, the corresponding target stripe of recovery purifying.
The results are shown in Figure 4, and library band prepared by the method for the present invention is located near 170bp, prepared by NEB kits
Library band is located near the positions 140bp, with expected consistent.For the library that miRNA initial amounts are 2ng, the method for the present invention
The library band of preparation is clearly apparent, and the library band prepared by NEB kits is faint.
6. library quality inspection and the sequencing of upper machine
Using analyzing biochips system Bioanalyzer2100 cooperation high-sensitivity DNA assay kit (Agilent companies,
Article No. 5067-4626) distribution of lengths in library is determined.The results are shown in Figure 5, and library distribution of lengths is the same as expected consistent.
Utilize high-throughput sequencing library quantification kit:(Kapa is public by KAPA Library Quantification Kits
Department, article No. KK4824) copy number in library is determined, upper machine amount is sequenced to determine.It is carried out using illumina sequenators
Sequencing, sequencing strategy HiSeq2500SE50 are sequenced according to the applied sample amount of each library 8M Raw Reads.
7. bioinformatic analysis
1) Raw data quality assessment and filtration treatment
Quality evaluation and filtering are carried out to sequencing initial data using FastQC, cutadapt, FASTX-Toolkit software
Processing.Statistical result is as shown in table 1, the method for the present invention with NEB kits data in Clean Reads quantity and Q30 ratios
It is more consistent on the two major parameters.
1 sequencing data statistic of attribute result of table
(A)-(B) library that prepared by the method for the present invention, miRNA initial amounts are respectively 10ng and 2ng;(C)-(D) NEB reagents
Library prepared by box, miRNA initial amounts are respectively 10ng and 2ng.
2) reference sequences are compared
Compared with newest miRBase databases and the full reference gene group GRCh37 (hg19) of the mankind using Bowtie softwares
It is right.Statistical result is as shown in table 2, for the initial amount of 10ng miRNA, the text prepared by the method for the present invention and NEB kits
Library, comparison result is in desirable level;When miRNA initial amounts are 2ng, the comparison result in library prepared by the method for the present invention is still
It is more satisfactory, it is suitable with above-mentioned two groups of library results, and library comparison result prepared by NEB kits is poor.
2 clean reads comparison results of table count
(A)-(B) library that prepared by the method for the present invention, miRNA initial amounts are respectively 10ng and 2ng;(C)-(D) NEB reagents
Library prepared by box, miRNA initial amounts are respectively 10ng and 2ng.
3) different banking process and different being compared to each other between the library of library initial amount is built
Unique and overlapping miRNA relationships between four groups of libraries are as shown in fig. 6, utilize the method for the present invention system
Standby miRNA initial amounts are the library in the library and 2ng of 10ng, the text for being 10ng with miRNA initial amounts prepared by NEB kits
Library has preferable plyability between three;And for the initial amount of 2ng miRNA, library prepared by NEB kits,
MiRNA types are considerably less than other three kinds of libraries.Pearson correlation analysis is as shown in table 3, above-mentioned three kinds of libraries miRNA tables
Higher up to spectrum similarity, the initial amount for being substantially distinguished from the preparation of NEB kits is the library of 2ng miRNA.
Table 3 utilizes the Spearson correlation analysis in library prepared by the method for the present invention and NEB kits
(A)-(B) utilizes library prepared by the method for the present invention, and miRNA initial amounts are respectively 10ng and 2ng;(C)-(D) sharp
The library prepared with NEB kits, miRNA initial amounts are respectively 10ng and 2ng.
Embodiment 2
It is illustration the method for the present invention to high-flux sequence platform compatibility and compatibility, the present embodiment is still with blood plasma excretion
In body for miRNA high-throughput sequencing libraries preparation process, except that connector is substituted for and Ion Torrent microarray datasets
Sequence, other conditions are the same as shown in embodiment 1.
Correlated series involved in this example:
5 ' end connector R5A-2,5 '-CCUUGGCACCCGAGAAUUCCANNNNN;
Reverse transcriptase primer RTP-2,5 '-CAGAGTTCTACAGTCCGACGATCT(30)VN(1)。
1. the collection of plasma sample and its separation of middle excretion body are the same as example 1.
2. the extraction of excretion body total serum IgE is the same as example 1.
3. carrying out the preparation of miRNA high-throughput sequencing libraries with example 1 using present patent application method.
4. library PAGE is purified with embodiment 1.
Under the primer sequence combination of the present embodiment, library length is in 180bp or so.Electrophoresis result after PAGE is as schemed
Shown in 7, band is consistent with being expected, and illustrates successfully prepare miRNA high-throughput sequencing libraries in the case where the primer sequence combines.
Embodiment 3
It is prepared by the high-throughput sequencing library of the circular rna in the blood plasma excretion body that this example passes through a kind of exemplary embodiment
For process, to describe to prepare the principle and feature in the nonlinear libraries RNA without poly (A) tail using the method for the present invention.
In this example, connector matches with Illumina microarray datasets, and 3 ' end end homopolymers are made of base A;Reverse transcriptase primer
In 5 ' end capture sequences by 20 T base compositions, anchor series VN(1)。
Correlated series involved in this example:
5 ' end connector R5A-1,5 '-CCUUGGCACCCGAGAAUUCCANNNNN;
Reverse transcriptase primer RTP-1,5 '-CAGAGTTCTACAGTCCGACGATCT(20)VN(1)。
1. the collection of plasma sample and its separation of middle excretion body
Inventor has collected the complete of the normal physical examination Healthy Peoples of 50ml in May, 2016 in the first affiliated hospital of University Of Suzhou
Blood, whole blood sample are loaded on EDTA- anticoagulant tubes.The method and operating procedure of excretion body are the same as embodiment 1 in whole blood.
2. the extraction of total serum IgE in excretion body
Method and operating procedure are the same as embodiment 1.
3. the rRNA in the above-mentioned RNA of removal
Kit is removed using rRNA:Ribo-Zero rRNA Removal Kit (Human, Mouse, Rat) are tried
Agent box (Illumina companies, article No. K1550-01) removes the rRNA in above-mentioned total serum IgE, and method and steps is with reference to explanation
Book carries out, and is finally dissolved without RNA enzyme water with 20ul.The RNA sample of above-mentioned removal rRNA is placed in -80 DEG C
Save backup or be directly entered next experiment.
4. removing poly (A) RNA in total serum IgE
1) it mixes wellOligo(dT)25Magnetic bead (Thermo Fisher companies, article No. 61002), then
It takes in 30 μ l to 1.5ml centrifuge tubes, is placed on magnetic frame, stand 3 minutes, carefully drawn with pipette tips and abandon supernatant.
2) 100 μ l combination buffers are added and fully blow and beat mixing, brief centrifugation stands 3 minutes, use rifle on magnetic frame
Careful draw of head abandons supernatant.
3) it repeats the above steps primary.
4) 50 μ l combination buffers are added, mixing is fully blown and beaten.
5) the total serum IgE sample of above-mentioned removal rRNA is mended without RNA enzyme water to 50 μ l.
6) sample is added to above-mentionedOligo(dT)25In magnetic bead, mixing is fully blown and beaten, 65 DEG C are incubated 5 points
Clock is placed in 2 minutes on ice at once after incubation.
7) it is incubated at room temperature 10 minutes on blending instrument, brief centrifugation, 3 minutes is stood on magnetic frame.
8) clear liquid is transferred to another 1.5ml centrifuge tube, 180 μ l is adjusted to using no RNA enzyme water.The 3M of 18 μ l is added
Sodium acetate, the glycogen (10mg/ml) of 2 μ l, three times volume ice precooling absolute ethyl alcohol, slight vortex mixed, -20 DEG C are quiet
Set 1 hour.16000 × g is centrifuged 30 minutes under the conditions of 4 DEG C, and careful draw abandons supernatant.With 20 μ l without RNA after drying
Enzyme water dissolution RNA precipitate is to get having arrived Poly (A)-RNA sample.
5. then carrying out RNase R digestion
Linear rna is digested using RNase R (Epicentre companies, article No. RNR07250).According to the form below prepares reaction system:
Pressure-vaccum mixing is carried out to above-mentioned 20 μ l reaction systems, brief centrifugation is placed on 37 DEG C and is incubated 1 hour.After digestion
20 μ L phenol-chloroforms-isoamyl alcohol (Thermo Fisher companies, article No. 15593-031), concussion mixing is added to be digested to terminate,
16000 × g, 4 DEG C centrifuge 5 minutes.Supernatant is abandoned, 6 μ L 4M lithium chlorides, 1 μ L glycogens (10mg/ml), the nothing of 90 μ L precoolings are precipitated
Water-ethanol is reverse to be resuspended, and -80 DEG C of precipitations centrifuge 30 minutes for 1 hour, 16000 × g, 4 DEG C, and two are washed with 75% ethyl alcohol of precooling
It is secondary.It is dissolved to get to treated circular rna sample without RNase enzyme water with 20 μ L after drying.By above-mentioned ring-type
RNA sample preserves or is directly entered next experiment under the conditions of being placed in -80 DEG C.
6.RNA fragmentations
Utilize the RNA fragmentation reagents boxes of NEB:RNase III RNAFragmentation Module
(NEB companies, article No. E6146) carries out fragmentation processing to above-mentioned RNA.RNA fragmentation systems are established according to following table:
Component | Volume (μ l) |
Above-mentioned RNA after purification | 17 |
RNase III(1unit/μl) | 1 |
RNase III Reaction Buffer(10X) | 2 |
Total volume | 20 |
Above-mentioned reaction solution is uniformly mixed, is incubated 5 minutes for 37 DEG C on thermostat, the EDTA of 10mM is added after reaction
Terminate reaction.It is purified using RNA purification kits (Qiagen companies, article No. 74204), with the water for going RNA enzyme of 10 μ L
It is dissolved to get to the RNA of fragmentation.Coordinate pico RNA cores using analyzing biochips system Bioanalyzer2100
Total serum IgE sample after piece (Agilent companies, article No. 5067-1548) removal rRNA is detected.As a result such as Fig. 8 institutes
Show, the circular rna clip size after fragmentation is in 200-300bp.RNA sample after above-mentioned fragmentation is placed in -80 DEG C of preservations
Or it is directly entered next experiment.
7. the preparation in library
High-throughput sequencing library is carried out using the circular rna after the above-mentioned fragmentation of 10ng and 50ng as initial amount respectively
It prepares.As a comparison, library kit is built using the RNA of NEB simultaneously in this example:NEBNext Ultra Directional
RNA Library Prep Kit (NEB companies, article No. E7420S) carry out the preparation in library, are carried out pair in the method for the same present invention
Than.
8. the PAGE electrophoresis in library and purifying are the same as embodiment 1.
The results are shown in Figure 9, and prepared by the library that the present invention can be used successfully to the high-flux sequence of circular rna, the present invention
Library band prepared by method and NEB kits is respectively positioned at 300bp to the positions 400bp, it is expected the band obtained.
In addition, the circular rna sample for building the fragmentation that library initial amount is 10ng, the library band prepared using the method for the present invention are clear
It is clear, and utilize the library band prepared by NEB kits faint.Therefore, the method for the present invention compares conventional method, and sensitivity obtains
Larger promotion.
Embodiment 4
It is illustration the method for the present invention to high-flux sequence platform compatibility and compatibility, this example is with Ion Torrent
Microarray dataset illustrates for representative.The present embodiment is still with miRNA high-throughput sequencing library preparation process in blood plasma excretion body
For, except that connector matches with Ion Torrent microarray datasets, remaining condition is the same as embodiment 1.
Correlated series involved in this example:
5 ' end connector R5A-3, CCTCTCTATGGGCAGTCGGTGATNNNNN;
Reverse transcriptase primer RTP-3, CCAUCUCAUCCCUGCGUGUCUCCGACT(20)VN(1)。
1. the collection of plasma sample and its separation of excretion body are the same as embodiment 1.
2. the extraction of excretion body total serum IgE is the same as embodiment 1.
3. carrying out the preparation of miRNA high-throughput sequencing libraries with embodiment 1 using present patent application method.
4. library PAGE is purified with embodiment 1.
The results are shown in Figure 10 after PAGE electrophoresis, and library band is located near 140bp, clear apparent, with expected one
It causes.
5. library quality inspection and high-flux sequence
Using analyzing biochips system Bioanalyzer2100 cooperation high-sensitivity DNAs assay kit, (Agilent is public
Department, article No. 5067-4626) distribution of lengths in library is determined.As a result as shown in figure 11, library distribution of lengths is the same as expected one
It causes, concentration meets confidential ask.
Utilize high-throughput sequencing library quantification kit:(Kapa is public by KAPA Library Quantification Kits
Department, article No. KK4827) copy number in library is determined, upper machine amount is sequenced to determine.Utilize Ion ProtonTMSequenator into
Row sequencing, is sequenced according to the applied sample amount of each library 10M raw reads.
6. bioinformatic analysis
1) Raw data quality assessment and filtration treatment
Quality evaluation and filtering are carried out to sequencing initial data using FastQC, cutadapt, FASTX-Toolkit software
Processing.Treated that statistical result is as shown in table 4 through filtering for the lower machine data of sequencing, is 10ng and 2ng miRNA for initial amount
Library, the two in Clean Reads numbers and Q30 ratios the two mainly to assess result in parameter suitable.
4 Raw data quality statistical result of table
(A) initial amount is the library of 10ng miRNA, and (B) initial amount is the library of 2ng miRNA.
2) reference sequences are compared
Using Bowtie softwares by sequence with miRNA databases miRBase and the full reference gene group GRCh37 of the mankind
(hg19) it is compared.The results are shown in Table 5, and for the library that miRNA initial amounts are 10ng and 2ng, comparison result is equal
Reach desirable level.
5 clean reads comparison results of table count
(A) initial amount is the library of 10ng miRNA, and (B) initial amount is the library of 2ng miRNA.
3) correlation analysis between library
Unique and overlapping miRNA correlations between two groups of libraries are as shown in figure 12, risen for building library
Beginning amount is the library of 10ng miRNA and 2ng miRNA, and the miRNA that two kinds of libraries are identified has preferably consistent in type
Property.
Pearson correlation analysis is as shown in table 6, and the miRNA express spectra similarities in above two library are higher.
Spearson correlation analysis between library prepared by 6 the method for the present invention of table
(A) initial amount is the library of 10ng miRNA, and (B) initial amount is the library of 2ng miRNA.
<110>Li Hui brightness Liu Yue stars
<120>A kind of preparation method of RNA high-throughput sequencing libraries
<141> 2016-12-10
<160> 10
<210> 1
<211> 26
<212> RNA
<213>Artificial sequence
<220>
<221> misc_feature
<222> (22)..(26)
<223>N=a or g or c or t
<400> 1
ccuuggcacc cgagaauucc annnnn 26
<210> 2
<211> 30-77
<212> DNA
<213>Artificial sequence
<220>
<221> misc_feature
<222> (29)..(77)
<223>N=a or g or c or t, v=a or g or c
<400> 2
cagagttcta cagtccgacg atct(5-50)vn(1-3) 30-77
<210> 3
<211> 30-77
<212> DNA
<213>Artificial sequence
<220>
<221> misc_feature
<222> (29)..(77)
<223>N=a or g or c or t, d=a or g or t
<400> 3
cagagttcta cagtccgacg atcc(5-50)dn(1-3) 30-77
<210> 4
<211> 30-77
<212> DNA
<213>Artificial sequence
<220>
<221> misc_feature
<222> (29)..(77)
<223>N=a or g or c or t, h=a or c or t
<400> 4
cagagttcta cagtccgacg atcg(5-50)hn(1-3) 30-77
<210> 5
<211> 30-77
<212> DNA
<213>Artificial sequence
<220>
<221> misc_feature
<222> (29)..(77)
<223>N=a or g or c or t, b=g or c or t
<400> 5
cagagttcta cagtccgacg atca(5-50)bn(1-3) 30-77
Claims (19)
1. a kind of preparation method of RNA high-throughput sequencing libraries, which is characterized in that include the following steps:(1) end extension is carried out
Reaction, makes the RNA without poly (A) tail extend one section of sequence in 3 ' ends;(2) it is attached reaction, makes above-mentioned RNA at it
5 ' end connection top connection R5A, obtain RNA connection products;(3) above-mentioned RNA connection products are purified;(4) it utilizes specifically
Reverse transcriptase primer RTP carries out reverse transcription reaction to above-mentioned RNA connection products after purification, obtains mono- chains of corresponding cDNA;(5) into
The enrichment of row PCR amplification is to get to high-throughput sequencing library.
2. the preparation method of RNA high-throughput sequencing libraries according to claim 1, which is characterized in that 3 ' end extension sequences
For one section of homopolymer sequence.
3. homopolymer sequence according to claim 2, it is characterised in that homopolymer by adenine (A), thymidine (T),
Any one composition in this five kinds of bases of cytimidine (C), guanine (G) or uracil (U), it is therefore preferable to base A.
4. the preparation method of NRA high-throughput sequencing libraries according to claim 1, which is characterized in that 5 ' end connector R5A by
Two sections of Sequence compositions:One section of unique recognition sequence of one section of distinguished sequence at 5 ' ends and 3 ' ends.
5. 5 ' end connector R5A according to claim 4, which is characterized in that 5 ' terminal specific sequences flush for high-flux sequence
3 ' the terminal sequences or intact full-length sequence of head.
6. 5 ' end connector R5A according to claim 4, which is characterized in that 3 ' end unique recognition sequences are by A, T, C and G
The random sequence of these four base compositions, length 5nt.
7. 5 ' the end connector R5A according to claim 5 and claim 6, which is characterized in that its sequence includes Seq ID
No:1。
8. the preparation method of RNA high-throughput sequencing libraries according to claim 1, which is characterized in that reverse transcriptase primer RTP
By distinguished sequence two parts Sequence composition of the distinguished sequence and 3 ' ends at 5 ' ends.
9. reverse transcriptase primer RTP according to claim 7, which is characterized in that the distinguished sequence at 5 ' ends is high-flux sequence
3 ' the terminal sequences or intact full-length sequence of platform connector.
10. reverse transcriptase primer RTP according to claim 7, which is characterized in that the distinguished sequence at 3 ' ends is produced by RNA connections
Object captures sequence and anchor series two parts composition.
11. RNA connection products according to claim 10 capture sequence, which is characterized in that the sequence with the ends RNA3 ' by prolonging
Stretch the complementary base composition of sequence phase.
12. RNA connection products according to claim 10 capture sequence, which is characterized in that the length of the sequence is 2-
50nt, it is therefore preferable to 10-30nt.
13. anchor series according to claim 10, which is characterized in that its 1st bit base (from 5 ' ends to 3 ' extreme directions)
For the degeneracy base not with RNA homopolymer complementations.
14. anchor series according to claim 10, which is characterized in that its 2nd bit base (from 5 ' ends to 3 ' extreme directions)
To the sequences of 3 ' least significant ends, by A, T, C and G, these four bases form at random.
15. anchor series according to claim 10, which is characterized in that the sequence length is 1-5nt, it is therefore preferable to 1-
3nt, more preferably 1nt.
16. according to claim 9 and reverse transcriptase primer RTP according to any one of claims 10, which is characterized in that its sequence includes Seq
ID No:2、Seq ID No:3、Seq ID No:4 and Seq ID No:5.
17. the preparation method of RNA high-throughput sequencing libraries according to claim 1, which is characterized in that be used for amplified library
PCR primer be high-flux sequence platform connector 5 ' terminal sequences or intact full-length sequence.
18. the preparation method of RNA high-throughput sequencing libraries according to claim 1, which is characterized in that this method can be right
All RNA without poly (A) tail carry out high-throughput sequencing library preparation, including the linear RNA without poly (A) tail and
Circular rna.
19. the preparation method of RNA high-throughput sequencing libraries according to claim 19, which is characterized in that linear
It, can be directly using the method for the present invention or into carrying out high-throughput sequencing library system after Break Row without the RNA of poly (A) tail
It is standby;The nonlinear RNA without poly (A) tail can carry out high-throughput sequencing library preparation after interrupting using this method.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710029640.9A CN108315320A (en) | 2017-01-16 | 2017-01-16 | A kind of preparation method of RNA high-throughput sequencing libraries |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710029640.9A CN108315320A (en) | 2017-01-16 | 2017-01-16 | A kind of preparation method of RNA high-throughput sequencing libraries |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108315320A true CN108315320A (en) | 2018-07-24 |
Family
ID=62891846
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710029640.9A Pending CN108315320A (en) | 2017-01-16 | 2017-01-16 | A kind of preparation method of RNA high-throughput sequencing libraries |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108315320A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112585279A (en) * | 2018-09-05 | 2021-03-30 | 深圳华大生命科学研究院 | RNA library building method and kit |
US20230104375A1 (en) * | 2021-10-01 | 2023-04-06 | Sequmed Biotechnology, Inc. | Method for multiplexable strand-specific 3' end sequencing of mrna transcriptome primer set, kit and application thereof |
CN117417928A (en) * | 2023-10-13 | 2024-01-19 | 近岸蛋白质科技(菏泽)有限公司 | Treatment method beneficial to linear RNA digestion of complex structure and application thereof |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102301011A (en) * | 2009-02-02 | 2011-12-28 | 埃克西库恩公司 | Method for quantification of small RNA species |
CN104711340A (en) * | 2013-12-17 | 2015-06-17 | 北京大学 | Transcriptome sequencing method |
CN105063209A (en) * | 2015-08-10 | 2015-11-18 | 北京吉因加科技有限公司 | Quantitative detection method of exosome miRNA (micro ribonucleic acid) |
CN105200530A (en) * | 2015-10-13 | 2015-12-30 | 北京百迈客生物科技有限公司 | Method for establishing multi-sample hybrid library suitable for high-flux whole-genome sequencing |
CN105297144A (en) * | 2015-10-27 | 2016-02-03 | 北京百迈客生物科技有限公司 | High throughput library construction method for small RNA of prokaryote |
CN105463585A (en) * | 2014-09-12 | 2016-04-06 | 清华大学 | Method for constructing sequencing library based on single-stranded DNA molecule, and applications thereof |
CN105734052A (en) * | 2016-04-19 | 2016-07-06 | 武汉生命之美科技有限公司 | Method for constructing mRNA 5'-termnal information library |
CN105985945A (en) * | 2015-01-30 | 2016-10-05 | 深圳华大基因研究院 | mRNA fragmentation method and method for constructing sequencing library based on same |
-
2017
- 2017-01-16 CN CN201710029640.9A patent/CN108315320A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102301011A (en) * | 2009-02-02 | 2011-12-28 | 埃克西库恩公司 | Method for quantification of small RNA species |
CN104711340A (en) * | 2013-12-17 | 2015-06-17 | 北京大学 | Transcriptome sequencing method |
CN105463585A (en) * | 2014-09-12 | 2016-04-06 | 清华大学 | Method for constructing sequencing library based on single-stranded DNA molecule, and applications thereof |
CN105985945A (en) * | 2015-01-30 | 2016-10-05 | 深圳华大基因研究院 | mRNA fragmentation method and method for constructing sequencing library based on same |
CN105063209A (en) * | 2015-08-10 | 2015-11-18 | 北京吉因加科技有限公司 | Quantitative detection method of exosome miRNA (micro ribonucleic acid) |
CN105200530A (en) * | 2015-10-13 | 2015-12-30 | 北京百迈客生物科技有限公司 | Method for establishing multi-sample hybrid library suitable for high-flux whole-genome sequencing |
CN105297144A (en) * | 2015-10-27 | 2016-02-03 | 北京百迈客生物科技有限公司 | High throughput library construction method for small RNA of prokaryote |
CN105734052A (en) * | 2016-04-19 | 2016-07-06 | 武汉生命之美科技有限公司 | Method for constructing mRNA 5'-termnal information library |
Non-Patent Citations (1)
Title |
---|
YUKO OGAWA等: "Small RNA Transcriptomes of Two Types of Exosomes in Human Whole Saliva Determined by Next Generation Sequencing", 《BIOL. PHARM. BULL》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112585279A (en) * | 2018-09-05 | 2021-03-30 | 深圳华大生命科学研究院 | RNA library building method and kit |
US20230104375A1 (en) * | 2021-10-01 | 2023-04-06 | Sequmed Biotechnology, Inc. | Method for multiplexable strand-specific 3' end sequencing of mrna transcriptome primer set, kit and application thereof |
CN117417928A (en) * | 2023-10-13 | 2024-01-19 | 近岸蛋白质科技(菏泽)有限公司 | Treatment method beneficial to linear RNA digestion of complex structure and application thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2022200179B2 (en) | Error Suppression In Sequenced DNA Fragments Using Redundant Reads With Unique Molecular Indices (UMIs) | |
AU2018254595B2 (en) | Using cell-free DNA fragment size to detect tumor-associated variant | |
US20230340590A1 (en) | Method for verifying bioassay samples | |
CN107077537B (en) | Detection of repeat expansions using short-read sequencing data | |
CN114107458B (en) | Methods for sequencing immune repertoires | |
CA3220983A1 (en) | Optimal index sequences for multiplex massively parallel sequencing | |
CN110832087A (en) | Universal short adaptor for indexing of polynucleotide samples | |
CN114008199A (en) | High throughput single cell libraries and methods of making and using the same | |
WO2014164838A1 (en) | Quantification of cell-specific nucleic acid markers having a particular methylation state | |
CN108315320A (en) | A kind of preparation method of RNA high-throughput sequencing libraries | |
CN108315824A (en) | A kind of construction method of RNA high-throughput sequencing libraries | |
HK40040528B (en) | Error suppression in sequenced dna fragments using redundant reads with unique molecular indices (umis) | |
HK40040528A (en) | Error suppression in sequenced dna fragments using redundant reads with unique molecular indices (umis) | |
HK40068328A (en) | High-throughput single-cell libraries and methods of making and of using | |
HK1244513B (en) | Error suppression in sequenced dna fragments using redundant reads with unique molecular indices (umis) | |
HK1193847A (en) | Method for verifying bioassay samples |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20180724 |