CN119585439A

CN119585439A - Probe-based nucleic acid and protein analysis

Info

Publication number: CN119585439A
Application number: CN202380054475.6A
Authority: CN
Inventors: 凯瑟琳·法伊弗; A·S·科韦; A·J·希尔; 保罗·尤金·隆德; 贾瓦德·N·阿布桑德; 莎米拉·查特吉巴塔查尔吉; 瑞恩·斯托特
Original assignee: 10X Genomics Inc
Current assignee: 10X Genomics Inc
Priority date: 2022-06-29
Filing date: 2023-06-28
Publication date: 2025-03-07
Also published as: US20240002914A1; WO2024006392A1; US20240019422A1

Abstract

Provided herein are systems and methods for processing biomolecules (e.g., nucleic acid molecules, proteins) from a sample. Methods for treating biological molecules may include hybridizing a probe molecule to a target region of a nucleic acid molecule (e.g., ribonucleic acid (RNA) molecule) and barcoding the probe-nucleic acid molecule complex or derivative thereof. Such methods may include performing nucleic acid reactions, such as extension, denaturation, and amplification. Methods for treating a sample may include hybridizing a probe to (i) a target region of a nucleic acid molecule (e.g., an RNA molecule) and (ii) a reporter oligonucleotide that features a binding group, and adding a barcode to the molecule with which the probe is associated. One or more of the processes of the methods described herein may be performed within a partition, such as a droplet or a well.

Description

Probe-based nucleic acid and protein analysis

Cross reference

The present application claims the benefit of U.S. provisional application No. 63/356,887 filed on 29 th 6 and 2022 and U.S. provisional application No. 63/386,135 filed on 5 th 12 and 2022, each of which is incorporated herein by reference in its entirety.

Background

The sample may be processed for various purposes, such as identifying certain types of moieties within the sample. The sample may be a biological sample. Biological samples may be processed, such as to detect a disease (e.g., cancer) or to identify a particular species. There are various methods of processing the sample, such as Polymerase Chain Reaction (PCR) and sequencing.

Biological samples may be processed within various reaction environments, such as partitions. The partitions may be holes or droplets. The droplets or wells may be used to process the biological sample in a manner that enables the biological sample to be separately partitioned and processed. For example, such droplets may be fluidly isolated from other droplets, enabling accurate control of the respective environments in the droplets.

Various processes, such as chemical or physical processes, may be performed on the biological samples in the partitions. The sample in the partition may be heated or cooled or chemically reacted to produce species that can be processed qualitatively or quantitatively.

Biomolecules, such as nucleic acids and proteins, within a biological sample can be detected and/or processed for quantitative or qualitative assessment.

Disclosure of Invention

The present disclosure provides methods for sample processing and analysis. Methods provided herein can include hybridizing a probe to a molecule of interest (e.g., target protein, target nucleic acid molecule), and treating the probe-molecule complex. Such treatment may include barcoding probes, probe-molecule complexes or molecules, and/or performing nucleic acid reactions. The probes can comprise nucleic acid molecules, and further processing can include extension, denaturation, and amplification processes to provide nucleic acid molecules comprising sequences that are identical or substantially identical or complementary to a target region of a nucleic acid molecule of interest (e.g., a target nucleic acid molecule). The method may include hybridizing a first probe and a second probe to a first target region and a second target region of a nucleic acid molecule, ligating the first probe and the second probe to provide a probe-ligated nucleic acid molecule, and barcoding the probe-ligated nucleic acid molecule. The method may include hybridizing a first probe to a first target region of a nucleic acid molecule, adding a barcode to the probe, and hybridizing a second probe to a second target region of the nucleic acid molecule to produce a barcoded, probe-linked nucleic acid molecule. In some aspects, the method can include hybridizing a probe to a nucleic acid molecule attached to the characteristic binding moiety to provide a probe binding moiety complex, and adding a barcode to the probe. One or more processes of the methods provided herein may be performed within a partition (such as a droplet or a well). The methods of the present disclosure are useful, for example, for controlled analysis and processing of analytes such as biological particles, nucleic acids, and proteins. One or more of the methods described herein may allow for genomic, transcriptome, or exome profiling with greater sensitivity. The methods of the present disclosure can be used to detect variants and characterize nucleic acid molecules, e.g., for assessing Single Nucleotide Polymorphisms (SNPs), alternative splicing, insertions, deletions, V (D) J rearrangements, and the like. The methods of the present disclosure can be used for multiplex analysis of nucleic acids and proteins while minimizing reagent usage, for example, by reducing the number of unoccupied partitions for analysis.

In one aspect, disclosed herein is a method for multiplex nucleic acid assays comprising (a) contacting a cell, nucleus or cell bead with a first probe, a second probe, and a third probe under conditions sufficient to produce molecules associated with the first probe and molecules associated with the second probe, wherein the cell, nucleus or cell bead comprises (i) a nucleic acid molecule comprising a first target region and a second target region and (ii) a feature coupled to a feature binding group, wherein the feature binding group comprises (i) a reporter oligonucleotide associated with the feature and (ii) a feature probe binding sequence, wherein the first probe comprises (i) a first probe sequence complementary to the first target region and (ii) a probe capture sequence, wherein the second probe comprises (i) a third probe sequence complementary to the feature probe binding sequence, and (ii) a probe capture sequence, wherein in a first partition of the first set of partitions, the first set of nucleic acid molecules and the second nucleic acid molecules are co-coded under conditions sufficient to produce a first set of nucleic acid molecules and a second set of co-coded barcodes, wherein the first set of co-coded barcodes comprises the first nucleic acid molecules and the first set of co-coded barcodes and the first co-coded molecular barcodes, wherein the probe binding molecule comprises (i) a probe binding sequence complementary to the probe capture sequence and (ii) a barcode binding sequence complementary to the consensus sequence, and (c) in a second partition of the second set of partitions, (i) contacting the first barcode-added nucleic acid molecule or derivative thereof with a first capture molecule of the plurality of capture molecules under conditions sufficient to produce a third barcode-added nucleic acid molecule, and (ii) contacting the second barcode-added nucleic acid molecule or derivative thereof with a second capture molecule of the plurality of capture molecules under conditions sufficient to produce a fourth barcode-added nucleic acid molecule, wherein the plurality of capture molecules comprises a second barcode sequence, wherein each of the third barcode-added nucleic acid molecule and the fourth barcode-added molecule comprises a sequence corresponding to the first barcode sequence and a sequence corresponding to the second barcode sequence.

In some embodiments, the first target region and the second target region are located on the same strand of the nucleic acid molecule. In some embodiments, the probe capture sequence is common to a plurality of first probes including the first probe, wherein one or more additional partitions of the first set of partitions include one or more additional probe-associated nucleic acid molecules, wherein each of the one or more additional probe-associated nucleic acid molecules comprises the probe capture sequence. In some embodiments, the second probe comprises a second probe capture sequence complementary to the capture sequences of the plurality of capture molecules, and wherein (c) comprises hybridizing the second probe capture sequence to the capture sequences. In some embodiments, the barcode molecule comprises a capture binding sequence complementary to the capture sequences of the plurality of capture molecules, and wherein (c) comprises hybridizing the capture binding sequence to the capture sequences. In some embodiments, the first set of partitions is a plurality of wells. In some embodiments, the second set of partitions is a plurality of droplets. In some embodiments, the second set of partitions is a plurality of wells. In some embodiments, a plurality of capture molecules are coupled to the particle. In some embodiments, the particles are beads. In some embodiments, the beads are gel beads. In some embodiments, each capture molecule of the plurality of capture molecules coupled to the gel bead comprises a second barcode sequence. In some embodiments, the one or more additional partitions of the second set of partitions comprise one or more additional gel beads of the plurality of gel beads, and wherein the second barcode sequence is unique to the gel bead of the plurality of gel beads. In some embodiments, one capture molecule of the plurality of capture molecules comprises a third barcode sequence specific for the capture molecule of the plurality of capture molecules. In some embodiments, the one or more additional partitions of the second set of partitions include one or more additional capture molecules, and wherein the second barcode sequence is unique to a second partition of the second set of partitions. In some embodiments, (a) comprises hybridizing a first probe and a second probe to a first target region and a second target region, respectively. In some embodiments, the method further comprises subjecting the first probe-associated molecule to conditions sufficient to produce a probe-linked nucleic acid molecule comprising the first probe linked to a second probe. In some embodiments, the probe-linked nucleic acid molecule is produced via chemical or enzymatic ligation of the first probe and the second probe. In some embodiments, the chemical or enzymatic ligation occurs after (b). In some embodiments, the first target region and the second target region are adjacent. In some embodiments, the first target region and the second target region are not adjacent, and the method further comprises (i) extending a first probe or a second probe annealed to the first target region or the second target region, respectively, toward the second target region or the first target region, respectively, to produce an extended probe, and (ii) ligating the extended probe to the second probe or the first probe, respectively. In some embodiments, (a) comprises contacting the first probe and the second probe with a nucleic acid molecule within a cell or nucleus. In some embodiments, the first partition comprises a plurality of cells, cell beads, or nuclei. In some embodiments, the cell or nucleus is permeabilized. In some embodiments, the cell or nucleus is immobilized. In some embodiments, the method further comprises releasing the first probe-associated molecule or derivative thereof from the cell, nucleus, or cell bead. In some embodiments, the releasing comprises lysing the cells. In some embodiments, the reporter oligonucleotide comprises a characteristic probe binding sequence. In some embodiments, the method further comprises combining the first barcoded nucleic acid molecule, the second barcoded nucleic acid molecule, the additional first barcoded nucleic acid molecule from the first set of partitions, and the additional second barcoded nucleic acid molecule from the first set of partitions after (b) and before (c). In some embodiments, the method further comprises combining, after (c) and before sequencing, a third barcoded nucleic acid molecule, a fourth barcoded nucleic acid molecule, an additional third barcoded nucleic acid molecule from the second set of partitions, and an additional fourth barcoded nucleic acid molecule from the second set of partitions. In some embodiments, the probe capture sequence is 8 to 50bp.

In another aspect of the disclosure, provided herein is a method comprising (a) contacting a nucleic acid molecule with a first probe to produce a probe-associated nucleic acid molecule, wherein the nucleic acid molecule comprises a first target region and a second target region that is not adjacent to the first target region, wherein the first probe comprises a first probe sequence that is complementary to the first target region, (b) extending the first probe under conditions sufficient to produce an extended probe molecule comprising a sequence that is complementary to the second target region, (c) providing, in one of a plurality of partitions, an extended probe molecule, a second probe, a barcode molecule, and a probe-binding molecule under conditions sufficient to produce a barcode-tagged molecule, wherein the second probe comprises a second probe sequence corresponding to the second target region, wherein the first probe or the second probe comprises a probe capture sequence, wherein the barcode molecule comprises (i) a probe-capture sequence and (ii) a barcode sequence that is complementary to the probe-capture sequence, wherein the probe-binding molecule comprises a probe binding sequence that is complementary to the probe sequence and (ii) a sequence that is complementary to the target sequence, and a sequence corresponding to the first barcode region.

In some embodiments, (c) comprises ligating a second probe to the barcode molecule. In some embodiments, the linkage comprises a chemical linkage or an enzymatic linkage. In some embodiments, the particles are beads. In some embodiments, the beads are gel beads. In some embodiments, the cell or nucleus is permeabilized. In some embodiments, the cell or nucleus is immobilized. In some embodiments, the first probe comprises a probe capture sequence. In some embodiments, the method further comprises releasing the extended probe molecule from the nucleic acid molecule after (b). In some embodiments, releasing comprises releasing ribonucleic acid (RNA) strands using a ribonuclease. In some embodiments, the release comprises a thermal cycle. In some embodiments, (c) comprises (i) hybridizing a second probe sequence of a second probe to a complementary sequence of a second target region, and (ii) extending the second probe. In some embodiments, in (c), the barcode molecule and the probe binding molecule are provided as a pre-annealed complex, wherein the barcode capture sequence anneals to the barcode binding sequence in the pre-annealed complex. In some embodiments, the first probe comprises a probe capture sequence, wherein (c) comprises (i) annealing the probe binding sequence and the barcode binding sequence to the probe capture sequence and the barcode capture sequence, respectively, (ii) ligating the barcode molecule and the extended probe molecule to produce a first barcoded molecule, and (iii) annealing the second probe to the first barcoded molecule and initiating an extension reaction to produce the barcoded molecule. In some embodiments, (ii) and (iii) are performed outside the partition. In some embodiments, the second probe comprises a probe capture sequence, wherein (c) comprises (i) annealing the second probe to the extended probe molecule and initiating an extension reaction to produce an extended molecule, (ii) annealing the probe binding sequence and the barcode binding sequence to the probe capture sequence and the barcode capture sequence, and (iii) ligating the barcode molecule and the extended molecule.

In another aspect, provided herein is a method of analyzing a sample, comprising (a) providing (i) a characteristic binding group that binds to at least a portion of the sample, wherein the characteristic binding group comprises a reporter oligonucleotide, wherein the reporter oligonucleotide comprises a reporter barcode sequence, a first target region, and a second target region, wherein the first target region and the second target region are disposed on the same strand of the reporter oligonucleotide, (ii) a first probe comprising a first probe sequence, wherein the first probe sequence of the first probe is complementary to the first target region of the reporter oligonucleotide, and (iii) a second probe comprising a second probe sequence, wherein the second probe sequence of the second probe is complementary to the second target region of the reporter oligonucleotide, (b) subjecting the sample to conditions sufficient to (i) hybridize the first probe sequence of the first probe to the first target region of the reporter oligonucleotide and (ii) hybridize the second probe sequence of the second probe to the second target region of the reporter oligonucleotide to produce a probe-oligonucleotide complex, and (c) subjecting the probe to the complex to conditions sufficient to produce a nucleic acid-linked probe comprising the second probe.

In some embodiments, the method further comprises (d) attaching a barcode molecule to the probe-linked nucleic acid molecule. In some embodiments, (d) occurs in one partition. In some embodiments, the partitions are droplets or wells. In some embodiments, the sample comprises a nucleic acid molecule, and wherein (d) further comprises attaching an additional barcode molecule to the nucleic acid molecule or derivative thereof. In some embodiments, the characteristic binding group is an antibody. In some embodiments, the first probe or the second probe comprises an additional probe sequence. In some embodiments, the method further comprises attaching the barcode sequence to an additional probe sequence. In some embodiments, the method further comprises (d) providing a barcode molecule and a probe binding molecule comprising (i) a first sequence complementary to the additional probe sequence and (ii) a second sequence complementary to a capture sequence of the barcode molecule. In some embodiments, the method further comprises providing conditions sufficient to hybridize the first sequence to the additional probe sequence and the second sequence to the capture sequence of the barcode molecule, thereby producing a barcoded, probe-associated complex. In some embodiments, (d) occurs in one of the plurality of partitions. In some embodiments, the method further comprises combining the barcoded, probe-associated complexes from the partition with other barcoded, probe-associated complexes from other partitions of the plurality of partitions to produce a combined set of barcoded, probe-associated complexes. In some embodiments, the method further comprises (i) partitioning the combined set of barcoded, probe-associated complexes into a plurality of additional partitions, wherein one additional partition of the plurality of additional partitions comprises a barcoded, probe-associated complex and additional barcode molecules having additional barcode sequences, and (ii) attaching additional barcode molecules to the barcoded, probe-associated complexes. In some embodiments, additional barcode molecules are coupled to the beads. In some embodiments, the beads are gel beads. In some embodiments, the additional barcode molecule is releasably coupled to the bead. In some embodiments, (d) occurs before (c). In some embodiments, (d) occurs after (c). In some embodiments, at least the portion of the sample comprises the feature. In some embodiments, the feature is a protein. In some embodiments, the protein is a cell surface receptor or an intracellular protein. In some embodiments, the sample comprises cells or cell beads. In some embodiments, the cells are formalin-fixed, paraffin-embedded cells. In some embodiments, (c) occurs in one partition. In some embodiments, the partition is one of a plurality of partitions. In some embodiments, (c) comprises an enzymatic or chemical ligation. In some embodiments, the ligation is performed without adenosine triphosphate. In some embodiments, the first probe or the second probe comprises an adenylated terminus, a phosphorylated terminus, a ribonucleotide, a dideoxynucleotide, or a fin (flap) sequence. In some embodiments, the first target region and the second target region are separated by a gap region disposed between the first target region and the second target region. In some embodiments, (c) comprises performing an extension reaction to fill the gap region, thereby producing a probe-linked nucleic acid molecule. In some embodiments, (c) comprises providing a third probe comprising a third probe sequence complementary to the gap region, hybridizing the third probe sequence to the gap region, and providing conditions sufficient to produce a probe-linked nucleic acid molecule comprising a first probe linked to a second probe via the third probe.

In another aspect, the present disclosure provides a method for analyzing an embedded immobilized tissue, the method comprising (a) providing an embedded immobilized tissue, wherein (i) the embedded immobilized tissue comprises a plurality of cells, (ii) one cell of the plurality of cells comprises a nucleic acid molecule, and (iii) the embedded immobilized tissue is embedded in a solid medium, (b) removing at least a portion of the solid medium from the embedded immobilized tissue, thereby obtaining an immobilized tissue comprising the plurality of cells, (c) dissociating the immobilized tissue into cells of the plurality of cells, wherein the cells comprise cells comprising the nucleic acid molecule, and (d) using a barcode sequence in the cells to produce a barcode-comprising nucleic acid molecule comprising the barcode sequence and a sequence corresponding to the nucleic acid molecule.

In some embodiments, the fixed tissue is fixed at least about 1 year prior to (a). In some embodiments, the fixed tissue is fixed at least about 5 years prior to (a). In some embodiments, the method further comprises adding a first solvent to the embedded fixed tissue in (b) to dissolve the solid medium, thereby obtaining the fixed tissue. In some embodiments, the first solvent is a non-polar solvent. In some embodiments, the non-polar solvent comprises xylene. In some embodiments, the non-polar solvent comprisesXylene substitutes. In some embodiments, the method further comprises removing the first solvent from the fixed tissue. In some embodiments, the method further comprises adding a second solvent to the fixed tissue. In some embodiments, the second solvent is a polar solvent. In some embodiments, the polar solvent comprises ethanol. In some embodiments, the method further comprises removing the second solvent from the fixed tissue. In some embodiments, the method further comprises adding a rehydrating agent to the fixed tissue. In some embodiments, the rehydrating agent comprises water. In some embodiments, the method further comprises removing the rehydration agent from the tissue. In some embodiments, the method further comprises adding a buffer to the fixed tissue. In some embodiments, the method further comprises removing the buffer from the fixed tissue. In some embodiments, the method further comprises dissociating the tissue in (c). In some embodiments, the method further comprises resuspending the tissue in a supernatant. In some embodiments, the method further comprises filtering the tissue. In some embodiments, the method further comprises washing the tissue. In some embodiments, the method further comprises resuspending the tissue. In some embodiments, dissociating the tissue is performed using an automated dissociator. In some embodiments, the automated dissociator is GENTLEMACS ^TM Octo dissociator. In some embodiments, dissociating the tissue is performed using a manual morcellator. In some embodiments, (b) and (c) provide a cell yield of at least about 1x 10 ⁵ cells per two 25 micron sections of the fixed tissue. In some embodiments, the fixed tissue comprises fixed human tissue. In some embodiments, the fixed tissue comprises fixed connective tissue, fixed epithelial tissue, fixed organ tissue, fixed muscle tissue, fixed ligament, fixed tendon, fixed skin tissue, fixed breast tissue, fixed bladder, fixed kidney tissue, fixed liver tissue, fixed colon tissue, fixed thyroid tissue, fixed cervical tissue, fixed prostate tissue, fixed lung tissue, fixed heart tissue, fixed muscle tissue, fixed pancreatic tissue, fixed anal tissue, fixed bile duct tissue, fixed bone marrow, fixed uterine tissue, fixed, Fixed ovarian tissue, fixed endometrial tissue, fixed vaginal tissue, fixed vulvar tissue, fixed stomach tissue, fixed ocular tissue, fixed nasal tissue, fixed sinus tissue, fixed penile tissue, fixed salivary gland tissue, fixed intestinal tissue, fixed gall bladder tissue, fixed gastrointestinal tissue, fixed bladder tissue, fixed brain tissue, fixed spinal tissue, fixed neurons, fixed cells representing the blood brain barrier, fixed blood, fixed hair, fixed nails, fixed keratin, or fixed collagen. In some embodiments, (d) comprises hybridizing a barcode sequence to a nucleic acid molecule. In some embodiments, the method further comprises extending the barcode to form a barcoded nucleic acid molecule. In some embodiments, extending includes the use of enzymes. In some embodiments, extending includes ligating a barcode sequence to the probe. In some embodiments, the probe comprises a moiety that is at least partially complementary to the nucleic acid molecule. In some embodiments, the method further comprises partitioning the nucleic acid molecule into partitions prior to (d). In some embodiments, the partition is one of a plurality of wells. In some embodiments, the partition is one of a plurality of droplets. In some embodiments, the plurality of droplets comprises oil droplets. In some embodiments, the partition comprises a support. In some embodiments, the support comprises beads. In some embodiments, the barcode sequence is attached to a support. In some embodiments, the solid medium comprises paraffin. In some embodiments, the fixed tissue comprises formalin fixed tissue embedded in paraffin. In some embodiments, the nucleic acid molecule comprises a ribonucleic acid (RNA) molecule. In some embodiments, the nucleic acid molecule comprises a messenger RNA molecule.

In one aspect, the present disclosure provides a method for analyzing an embedded, fixed tissue, the method comprising (a) providing an embedded, fixed tissue, wherein (i) the embedded, fixed tissue comprises a plurality of cells, (ii) one cell of the plurality of cells comprises a nucleic acid molecule, and (iii) the embedded, fixed tissue is embedded in a solid medium, (b) removing at least a portion of the solid medium from the embedded, fixed tissue by adding a first solvent to the embedded, fixed tissue, thereby dissolving the solid medium, thereby obtaining a fixed tissue comprising the plurality of cells, (c) removing the first solvent from the fixed tissue, (d) adding a second solvent to the fixed tissue, (e) removing the second solvent from the fixed tissue, (f) adding a rehydrating agent to the fixed tissue, (g) removing the rehydrating agent from the fixed tissue, (h) adding a buffer to the fixed tissue, (i) removing the buffer from the fixed tissue, (j) dissociating the fixed tissue from the fixed tissue, wherein the plurality of cells comprise a nucleic acid sequence comprising the nucleic acid molecule and the nucleic acid sequence to the nucleic acid sequence.

In some embodiments, the solid medium comprises paraffin. In some embodiments, the first solvent comprises xylene. In some embodiments, the second solvent comprises ethanol. In some embodiments, the rehydrating agent comprises water. In some embodiments, the buffer comprises phosphate buffered saline. In some embodiments, dissociating tissue is performed using an automatic dissociator or a manual morcellator. In some embodiments, the filter has a pore size of about 70 microns.

Another aspect of the present disclosure provides a non-transitory computer-readable medium containing machine-executable code that, when executed by one or more computer processors, implements any of the methods above or elsewhere herein.

Another aspect of the present disclosure provides a system comprising one or more computer processors and computer memory coupled thereto. The computer memory includes machine executable code that when executed by one or more computer processors implements any of the methods above or elsewhere herein.

Additional aspects and advantages of the present disclosure will become readily apparent to those skilled in the art from the following detailed description, wherein only illustrative embodiments of the present disclosure are shown and described. As will be realized, the present disclosure is capable of other and different embodiments and its several details are capable of modification in various obvious respects, all without departing from the present disclosure. Accordingly, the drawings and description are to be regarded as illustrative in nature, and not as restrictive.

Incorporated by reference

All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. In the event that publications and patents or patent applications incorporated by reference contradict the disclosure contained in this specification, this specification is intended to supersede and/or take precedence over any such contradictory material.

Drawings

The novel features of the invention are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention are utilized, and the accompanying drawings (also referred to herein as "figures") of which:

Fig. 1 shows an example of a microfluidic channel structure for separating individual biological particles.

Fig. 2 shows an example of a microfluidic channel structure for controlled separation of beads into discrete droplets.

Fig. 3 shows an example of a bead carrying a bar code.

Fig. 4 shows another example of a bead carrying a bar code.

Fig. 5 schematically illustrates an example microwell array.

FIG. 6 schematically shows an example workflow for processing nucleic acid molecules.

FIG. 7 schematically illustrates another example workflow for processing nucleic acid molecules.

FIG. 8 schematically shows another example workflow for processing nucleic acid molecules.

FIG. 9 schematically illustrates another example workflow for processing nucleic acid molecules.

Fig. 10 schematically shows an example workflow for analyzing cells, nuclei or cell beads.

FIG. 11 schematically illustrates an example labeling agent with nucleic acid molecules attached.

Fig. 12A schematically shows an example of the marking agent. FIG. 12B schematically illustrates another example workflow for processing nucleic acid molecules. FIG. 12C schematically illustrates another example workflow for processing nucleic acid molecules.

Fig. 13 schematically shows another example of a bead carrying a bar code.

FIG. 14 illustrates a computer system programmed or otherwise configured to implement the methods provided herein.

FIG. 15 shows an example treated nucleic acid molecule described herein.

FIG. 16A illustrates an example workflow for processing multiple analytes in one partition.

FIG. 16B illustrates another example workflow for processing multiple analytes in one partition.

Fig. 17 schematically illustrates a feature binding group described herein.

FIG. 18 illustrates example data from a workflow described herein.

Fig. 19 shows additional example data from the workflow described herein.

FIG. 20 illustrates additional example data from the workflow described herein.

FIG. 21A shows exemplary data comparing fixed cells to non-fixed cells. Fig. 21B shows additional example data comparing fixed cells and non-fixed cells. Fig. 21C shows additional example data comparing fixed cells and non-fixed cells.

FIG. 22 schematically illustrates an example workflow for determining two different analyte types.

FIG. 23 illustrates example data for the bar code adding method described herein.

FIG. 24 shows example data for different analyte types using the barcoding methods described herein.

FIG. 25 schematically illustrates an example method for processing nucleic acid molecules.

FIG. 26 illustrates another example method for processing nucleic acid molecules.

FIG. 27 shows an example workflow for generating probe-linked nucleic acid molecules.

FIG. 28 shows another example workflow for generating probe-linked nucleic acid molecules.

FIG. 29 illustrates an example workflow for processing cells according to the methods described herein.

FIG. 30A shows exemplary protein expression data obtained for multiple analytes using different sample preparation parameters. FIG. 30B shows additional protein expression data obtained using different sample preparation parameters for multiple analytes with barcodes.

FIG. 31 shows exemplary gene expression data obtained using different sample preparation parameters for multiple analytes with barcodes.

Fig. 32A-C show example data for multi-analyte detection for a negative control group. Fig. 32A shows example data showing different immune cell clusters. Fig. 32B shows example data of gene expression of GZMB gene. Fig. 32C shows exemplary data for protein expression resulting from antibody staining.

Fig. 33A-C show example data for multi-analyte detection for an experimental set. Fig. 33A shows example data showing different immune cell clusters. Fig. 33B shows example data of gene expression of GZMB gene. Fig. 33C shows exemplary data for protein expression resulting from antibody staining.

Fig. 34A-C show example data for multi-analyte detection for an experimental set. Fig. 34A shows example data showing different immune cell clusters. Fig. 34B shows example data of gene expression of GZMB gene. Fig. 34C shows exemplary data for protein expression resulting from antibody staining.

Fig. 35A-C show example data for multi-analyte detection for an experimental set. Fig. 35A shows example data showing different immune cell clusters. Fig. 35B shows example data of gene expression of GZMB gene. Fig. 35C shows exemplary data for protein expression resulting from antibody staining.

Fig. 36A-C show example data for multi-analyte detection for an experimental set. Fig. 35A shows example data showing different immune cell clusters. Fig. 36B shows example data of gene expression of GZMB gene. Fig. 36C shows exemplary data for protein expression resulting from antibody staining.

Fig. 37A-C show example data for multi-analyte detection for an experimental set. Fig. 37A shows example data showing different immune cell clusters. Fig. 37B shows example data of gene expression of GZMB gene. Fig. 37C shows exemplary data for protein expression resulting from antibody staining.

FIG. 38 illustrates another example workflow for determining two different analyte types.

Fig. 39A to 39E are examples of time-series data of fixed samples.

FIG. 40 shows a flow chart of a method for analyzing embedded immobilized tissue nucleic acid molecules according to some embodiments.

FIG. 41 illustrates an example of a nucleic acid profiling workflow according to some embodiments.

FIG. 42 shows an example of a nucleic acid profiling workflow according to some embodiments.

FIG. 43 illustrates an example of a plurality of probe molecules according to some embodiments.

Fig. 44-46 illustrate examples of methods of preparing a fixed sample according to some embodiments.

Fig. 47 provides a table of the pass percentages and other characteristics of the various procedures of the present example, according to some embodiments.

FIG. 48 illustrates an example of various organizations processed by the workflows of FIGS. 44-46, according to some embodiments.

FIGS. 49A-49D illustrate example cell type clusters for single cell cancer fixed tissue.

Fig. 50A and 50B illustrate an example tissue dissociation using a pestle dissociation.

FIG. 51 shows an example GENTLEMACS ^TM procedure for tissue dissociation.

Fig. 52A shows an example of representative cells after dissociation. FIG. 52B shows an example of representative cells after post-hybridization washes.

FIG. 53 shows example cell yields from fixed tissue sections.

Detailed Description

While various embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed.

Where values are described as ranges, it is understood that such disclosure includes disclosure of all possible sub-ranges within such ranges, as well as specific values falling within such ranges, whether or not the specific values or sub-ranges are explicitly stated.

The terms "a," "an," and "the" as used herein generally refer to both singular and plural referents unless the context clearly dictates otherwise.

Whenever the term "at least", "greater than" or "greater than or equal to" precedes the first value in a series of two or more values, the term "at least", "greater than" or "greater than or equal to" applies to each value in the series. For example, 1,2, or 3 or more corresponds to 1 or more, 2 or 3 or more.

Whenever the term "no more," "less than," or "less than or equal to" precedes the first value in a series of two or more values, the term "no more," "less than," or "less than or equal to" applies to each value in the series. For example, less than or equal to 3, 2, or 1 corresponds to less than or equal to 3, less than or equal to 2, or less than or equal to 1.

As used herein, the terms "about" and "approximately" when preceded by a numerical value indicate the value plus or minus the range of 10%. For example, about 10 may be reasonably understood to mean 9, 10, or 11, or a numerical range from 9 to 11. The term "about" or "approximately" applies to each value in a series of two or more values whenever "about" or "approximately" precedes the first value in the series.

As used herein, the term "barcode" generally refers to a label or identifier that conveys or is capable of conveying information about an analyte. The barcode may be part of the analyte. The barcode may be independent of the analyte. The barcode may be a tag attached to an analyte (e.g., a nucleic acid molecule) or a combination of the tag plus an inherent property of the analyte (e.g., the size of the analyte or terminal sequence). Bar codes may be unique. Bar codes can take a number of different forms. For example, barcodes may include polynucleotide barcodes, random nucleic acid and/or amino acid sequences, and synthetic nucleic acid and/or amino acid sequences. The barcode can be attached to the analyte in a reversible or irreversible manner. The barcode may be added to a fragment of, for example, a deoxyribonucleic acid (DNA) or ribonucleic acid (RNA) sample before, during, and/or after sequencing of the sample. Bar codes may allow for identification and/or quantification of individual sequencing reads.

As used herein, the term "real-time" may refer to a response time of less than about 1 second, one tenth second, one hundredth second, one millisecond or less. The response time may be greater than 1 second. In some cases, real-time may refer to simultaneous or substantially simultaneous processing, detection, or identification.

As used herein, the term "subject" generally refers to an animal such as a mammal (e.g., human, mouse, rat) or an avian (e.g., bird), or other organism such as a plant. For example, the subject may be a vertebrate, such as a mammal, rodent (e.g., mouse), primate, ape, or human. Animals may include, but are not limited to, farm animals, sports animals, and pets. The subject may be a healthy or asymptomatic individual, an individual who has or is suspected of having a disease (e.g., cancer) or is susceptible to the disease, and/or an individual in need of treatment or suspected of being in need of treatment. The subject may be a patient. The subject may be a microorganism or microbe (e.g., bacteria, fungi, archaea, viruses).

As used herein, the term "genome" generally refers to genomic information from a subject, which may be, for example, at least a portion or all of the genetic information of the subject. The genome may be encoded in DNA or RNA. The genome may comprise coding regions (e.g., encoding a protein) as well as non-coding regions. The genome may comprise sequences of all chromosomes together in an organism. For example, the human genome typically has a total of 46 chromosomes. The sequence of all these chromosomes together may constitute the human genome.

The terms "adapter", "adapter" and "tag" may be used synonymously. The adaptors or tags may be coupled to the polynucleotide sequences to be "tagged" by any method, including ligation, hybridization or other methods.

As used herein, the term "sequencing" generally refers to methods and techniques for determining the sequence of nucleotide bases in one or more polynucleotides. These polynucleotides may be, for example, nucleic acid molecules such as deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), including variants or derivatives thereof (e.g., single stranded DNA). Sequencing may be performed by various systems currently available, such as, but not limited toPacific BiosciencesOxfordOr Life Technologies (Ion)) A sequencing system produced. Alternatively or in addition, sequencing may be performed using nucleic acid amplification, polymerase Chain Reaction (PCR) (e.g., digital PCR, quantitative PCR, or real-time PCR), or isothermal amplification. Such systems can provide a plurality of raw genetic data corresponding to genetic information of a subject (e.g., a human) as generated by the systems from a sample provided by the subject. In some examples, such systems provide sequencing reads (also referred to herein as "reads"). Reads may include a sequence of nucleobases corresponding to the sequence of a nucleic acid molecule that has been sequenced. In some cases, the systems and methods provided herein may be used with proteome information.

As used herein, the term "bead" generally refers to a particle. The beads may be solid or semi-solid particles. The beads may be gel beads. The gel beads may include a polymer matrix (e.g., a matrix formed by polymerization or cross-linking). The polymer matrix may include one or more polymers (e.g., polymers having different functional groups or repeating units). The polymers in the polymer matrix may be randomly arranged, for example in a random copolymer, and/or have an ordered structure, for example in a block copolymer. Crosslinking may be via covalent, ionic or induced interactions or physical entanglement. The beads may be macromolecules. Beads may be formed from nucleic acid molecules that are bound together. Beads may be formed via covalent or non-covalent assembly of molecules (e.g., macromolecules) such as monomers or polymers. Such polymers or monomers may be natural or synthetic. Such polymers or monomers may be or include, for example, nucleic acid molecules (e.g., DNA or RNA). The beads may be formed of a polymeric material. The beads may be magnetic or non-magnetic. The beads may be rigid. The beads may be flexible and/or compressible. The beads may be destructible or dissolvable. The beads may be solid particles (e.g., metal-based particles including, but not limited to, iron oxide, gold, or silver) covered with a coating comprising one or more polymers. Such coatings may be destructible or dissolvable.

As used herein, the term "barcoded nucleic acid molecule" generally refers to a nucleic acid molecule resulting from, for example, processing a nucleic acid barcode molecule with a nucleic acid sequence (e.g., a nucleic acid sequence that is complementary to a nucleic acid primer sequence comprised by the nucleic acid barcode molecule). The nucleic acid sequence may be a targeting sequence or a non-targeting sequence. For example, in the methods and systems described herein, a nucleic acid molecule (e.g., a messenger RNA (mRNA) molecule) of a cell or cell nucleus is hybridized and reverse transcribed with a nucleic acid barcode molecule (e.g., a nucleic acid barcode molecule containing a barcode sequence and a nucleic acid primer sequence complementary to the nucleic acid sequence of the mRNA molecule) to produce a barcoded nucleic acid molecule having a sequence corresponding to the nucleic acid sequence of the mRNA and the barcode sequence (or the reverse complement thereof). The barcoded nucleic acid molecules can serve as templates, such as template polynucleotides, which can be further processed (e.g., amplified) and sequenced to obtain target nucleic acid sequences. For example, in the methods and systems described herein, the barcoded nucleic acid molecules can be further processed (e.g., amplified) and sequenced to obtain the nucleic acid sequence of the mRNA.

As used herein, the term "sample" generally refers to a biological sample of a subject. The biological sample may comprise any number of macromolecules, such as cellular macromolecules. The sample may be a cell sample. The sample may be a cell line or a cell culture sample. The sample may comprise one or more cells or nuclei. The sample may comprise one or more microorganisms. The biological sample may be a nucleic acid sample or a protein sample. The biological sample may also be a carbohydrate sample or a lipid sample. The biological sample may be derived from another sample. The sample may be a tissue sample, such as a biopsy sample, core needle biopsy sample, needle aspirate, or fine needle aspirate. The tissue sample may be a fresh tissue sample, a frozen tissue sample (e.g., flash frozen, freeze-dried, frozen sections, etc.), or a fixed tissue sample (e.g., a formalin-fixed paraffin-embedded tissue sample). The sample may be a fluid sample, such as a blood sample, a urine sample, or a saliva sample. The sample may be a skin sample. The sample may be a cheek swab. The sample may be a plasma or serum sample. The sample may be a cell-free sample or a cell-free sample. The cell-free sample may comprise extracellular polynucleotides. The extracellular polynucleotides may be isolated from a body sample, which may be selected from the group consisting of blood, plasma, serum, urine, saliva, mucosal secretions, sputum, stool, and tears.

As used herein, the term "biological particle" generally refers to a discrete biological system derived from a biological sample. The biological particles may be macromolecules. The biological particles may be small molecules. The biological particle may be a virus. The biological particles may be cells or derivatives of cells. The biological particles may be organelles. Examples of organelles from cells include, but are not limited to, nuclei, ribosomes, golgi bodies, endoplasmic reticulum, chloroplasts, endocytic vesicles, exocytosis vesicles, vacuoles, and lysosomes. The biological particles may be rare cells from a population of cells. The biological particles can be any type of cell including, but not limited to, prokaryotic cells, eukaryotic cells, bacteria, fungi, plants, mammalian or other animal cell types, mycoplasma, normal tissue cells, tumor cells, or any other cell type whether derived from a single-cell organism or a multicellular organism. The biological particles may be a component of a cell. The biological particles may be or may include DNA, RNA, organelles, proteins, or any combination thereof. The biological particles may be or include a matrix (e.g., a gel or polymer matrix) comprising cells or one or more components from cells (e.g., cell beads), such as DNA, RNA, organelles, proteins, or any combination thereof from cells. The biological particles can be obtained from tissue of a subject (e.g., human, mouse, rat, or other mammal). The biological particles may be hardened cells. Such hardened cells may or may not include cell walls or cell membranes. The biological particles may include one or more components of the cell, but may not include other components of the cell. Examples of such components are nuclei or organelles. The cells may be living cells. Living cells may be capable of being cultured, for example, when enclosed in a gel or polymer matrix, or when comprising a gel or polymer matrix.

As used herein, the term "macromolecular composition" generally refers to macromolecules contained within or derived from a biological particle. The macromolecular composition may comprise a nucleic acid. In some cases, the biological particles may be macromolecules. The macromolecular composition may comprise DNA. The macromolecular composition may comprise RNA. The RNA may be encoded or non-encoded. The RNA may be, for example, messenger RNA (mRNA), ribosomal RNA (rRNA), or transfer RNA (tRNA). The RNA may be a transcript. The RNA may be a small RNA less than 200 nucleobases in length, or a large RNA greater than 200 nucleobases in length. The micrornas can include 5.8S ribosomal RNAs (rrnas), 5S rrnas, transfer RNAs (trnas), micrornas (mirnas), small interfering RNAs (sirnas), small nucleolar RNAs (snornas), RNAs that interact with Piwi proteins (pirnas), tRNA-derived micrornas (tsrnas), and small rDNA-derived RNAs (srrnas). The RNA may be double-stranded RNA or single-stranded RNA. The RNA may be circular RNA. The macromolecular composition may comprise a protein. The macromolecular composition may comprise a peptide. The macromolecular composition may comprise a polypeptide.

As used herein, the term "molecular tag" generally refers to a molecule capable of binding to a macromolecular component. Molecular tags can bind to macromolecular components with high affinity. Molecular tags can bind to macromolecular components with high specificity. The molecular tag may comprise a nucleotide sequence. The molecular tag may comprise a nucleic acid sequence. The nucleic acid sequence may be at least a portion or all of a molecular tag. The molecular tag may be a nucleic acid molecule or may be part of a nucleic acid molecule. The molecular tag may be an oligonucleotide or a polypeptide. The molecular tag may comprise a DNA aptamer. The molecular tag may be or comprise a primer. The molecular tag may be or comprise a protein. The molecular tag may comprise a polypeptide. The molecular tag may be a barcode.

As used herein, the term "partition" generally refers to a space or volume that may be suitable for containing one or more species or carrying out one or more reactions. The partitions may be physical compartments such as droplets or holes. A partition may isolate a space or volume from another space or volume. The droplets may be a first phase (e.g., an aqueous phase) in a second phase (e.g., oil) that is immiscible with the first phase. The droplets may be a first phase in a second phase that is not phase separated from the first phase, such as capsules or liposomes in an aqueous phase. A partition may include one or more other (internal) partitions. In some cases, a partition may be a virtual compartment, which may be defined and identified by an index (e.g., an index library) that spans multiple and/or remote physical compartments. For example, the physical compartment may include a plurality of virtual compartments.

As used herein, the term "solid medium" generally refers to a non-gaseous or non-liquid material. The solid medium may include long chain hydrophobic compounds, glycerin, gels, hydrogels, epoxy resins, or any combination thereof. The long chain hydrophobic compound may include a wax, such as paraffin wax.

Provided herein are methods for sample processing and/or analysis. The methods of the present disclosure may include adding a barcode to one or more types of biomolecules (e.g., nucleic acid molecules, proteins, lipids, carbohydrates, or combinations thereof). The biomolecule may be, for example, a nucleic acid molecule (e.g., a ribonucleic acid (RNA) molecule) or a protein. Such methods may involve attaching one or more probes (e.g., nucleic acid probes) to a biomolecule, followed by attaching a nucleic acid barcode molecule comprising a barcode sequence to the one or more probes. For example, a nucleic acid barcode molecule may be attached to an overhang sequence of a probe or to the end of a probe. An extended nucleic acid molecule may be formed that extends from one end of the probe to one end of the nucleic acid barcode molecule, the extended nucleic acid molecule comprising a sequence complementary to the barcode sequence and a sequence complementary to a target region of the nucleic acid molecule. The extended nucleic acid molecule can then be denatured from the nucleic acid barcode molecule, and the nucleic acid molecule can be replicated. One or more of the processes of the method may be performed within a zone (such as a droplet or a well).

The present disclosure also provides a method of processing a sample (e.g., a cell sample or a tissue sample) that provides a barcoded nucleic acid molecule to which is attached a linked probe molecule. The method can include providing a sample comprising a nucleic acid molecule (e.g., an RNA molecule) having a first target region and a second target region, a first probe having (i) a first probe sequence complementary to the first target region and (ii) an additional probe sequence, and a second probe having a second probe sequence complementary to the second target region. In some cases, the first target region and the second target region are adjacent. The first and second probe sequences may also comprise a first reactive moiety and a second reactive moiety, respectively. The reactive moieties may be adjacent to each other when the first probe sequence of the first probe hybridizes to a first target region of the nucleic acid molecule and the second probe sequence of the second probe hybridizes to a second target region of the nucleic acid molecule. Subsequent reactions between adjacent reactive moieties under sufficient conditions can ligate the first probe and the second probe to produce a probe-ligated nucleic acid molecule. The probe-linked nucleic acid molecule (probe-linked nucleic acid molecule) may also be referred to as a probe-conjugated nucleic acid molecule (probe-ligated nucleic acid molecule). In other cases, the first target region and the second target region are not adjacent, and a nucleic acid reaction (e.g., a nucleic acid extension reaction, a gap filling reaction) can be performed to produce a probe-linked nucleic acid molecule.

The barcode sequence of the nucleic acid barcode molecule may be used to barcode the probe-linked nucleic acid molecule to provide a barcode-tagged, probe-linked nucleic acid molecule. Barcoding may be accomplished by hybridizing the binding sequence of a nucleic acid barcode molecule to an additional probe sequence of a first probe of the probe-linked nucleic acid molecule. The barcoded, probe-linked nucleic acid molecules can be subjected to an amplification reaction to produce an amplification product comprising the first and second target regions and a barcode sequence or sequences complementary to these sequences. Thus, the method can provide an amplification product without using reverse transcription. One or more processes may be performed within a zone (such as a droplet or a well).

The present disclosure also provides a method of generating a barcoded, probe-linked nucleic acid molecule. The method can include providing a sample comprising a nucleic acid molecule (e.g., an RNA molecule) having a first target region and a second target region, a first probe having a first probe sequence complementary to the first target region and optionally an additional probe sequence, and a second probe having a second probe sequence complementary to the second target region. The additional probe sequence of the first probe may comprise a probe capture sequence. Alternatively or in addition, the second probe may comprise a probe capture sequence. The first probe sequence of the first probe can hybridize to a first target region of a nucleic acid molecule, producing a probe-associated nucleic acid molecule, and a nucleic acid reaction (e.g., a nucleic acid extension reaction using a polymerase or reverse transcriptase) can be performed to produce an extended nucleic acid molecule comprising a sequence complementary to the second target region. The second probe may hybridize to the nucleic acid molecule (or the extended nucleic acid molecule or its complement) before, during, or after the nucleic acid extension reaction, and optionally, the nucleic acid extension reaction may be performed. The extended nucleic acid molecule may be barcoded, such as by (a) hybridization of a barcode binding sequence of the nucleic acid barcode molecule to a first probe (e.g., an additional probe sequence of the first probe) or a second probe (e.g., a probe capture sequence of the second probe), or (b) via a probe binding molecule (also referred to herein as a "splint molecule" or "splint oligonucleotide"), wherein the probe binding molecule comprises (i) a probe binding sequence that is complementary to an additional probe sequence of the first probe (which may comprise a probe capture sequence) and/or a capture sequence of the second probe, and (ii) a barcode binding sequence that is complementary to a sequence of the barcode molecule (e.g., a common sequence). In some cases, the barcoding may be performed before hybridization of the second probe to the second target region. In such cases, the barcoded nucleic acid molecule can be subjected to conditions sufficient to hybridize a second probe sequence of a second probe to a second target region of the nucleic acid molecule (or the barcoded nucleic acid molecule). A nucleic acid reaction (e.g., nucleic acid extension) can be performed to produce a barcoded, probe-linked nucleic acid molecule.

Another aspect of the present disclosure provides a method of barcoding a plurality of analytes, such as the probe-linked nucleic acid molecules described herein, as well as other types of biomolecules (e.g., proteins). The method can include providing (i) a sample comprising a nucleic acid molecule (e.g., an RNA molecule) having a first target region and a second target region, and (ii) a characteristic binding portion comprising a reporter oligonucleotide having a capture sequence, (iii) a first probe having a first probe sequence complementary to the first target region and an additional probe sequence, (iv) a second probe having a second probe sequence complementary to the second target region, and (v) a third probe having a third probe sequence complementary to the sequence of the reporter oligonucleotide. The first probe and the second probe may be subjected to conditions sufficient to hybridize to the first target region and the second target region, respectively, and to generate a probe-linked nucleic acid molecule. The third probe sequence of the third probe may be subjected to conditions sufficient to hybridize to the capture sequence of the reporter oligonucleotide, thereby producing a probe-binding moiety complex. The probe-linked nucleic acid molecule and probe-binding moiety complex may be subjected to conditions sufficient to barcode, thereby producing a barcoded, probe-linked nucleic acid molecule and barcoded probe-binding moiety complex. The barcoded, probe-linked molecules can be subjected to an amplification reaction to produce an amplification product comprising the first and second target regions and a barcode sequence or sequences complementary to these sequences. The barcoded probe-binding moiety complex can similarly be subjected to an amplification reaction to produce an amplification product comprising a fourth probe sequence and a barcode sequence. One or more processes may be performed within a cell bead and/or a partition (such as a droplet or well). Advantageously, the methods described herein can be used to index cells, nuclei, or cell beads into a partition, such index can be used in a partition occupied by more than one cell, and identify the cell, nucleus, cell bead, or partition from which the analyte originated.

Immobilized sample

The sample may be a fixed sample. For example, the sample may comprise a plurality of immobilized samples, such as a plurality of immobilized cells or immobilized nuclei. Alternatively or in addition, the sample may comprise fixed tissue. The fixation of a cell or cell component or tissue comprising a plurality of cells or nuclei may comprise the application of a chemical species or chemical stimulus. The term "immobilized" as used herein with respect to biological samples generally refers to a state that prevents decay and/or degradation. "immobilization" generally refers to a process of producing an immobilized sample, and in some cases may include contacting biomolecules within a biological sample with an immobilization agent (or immobilization reagent) for a certain amount of time, whereby the immobilization agent causes covalent bond interactions, such as cross-linking, between the biomolecules in the sample. "immobilized biological sample" may generally refer to a biological sample that has been contacted with an immobilization reagent or fixative. For example, formaldehyde-immobilized biological samples have been contacted with the immobilization reagent formaldehyde. "fixed cells", "fixed nuclei" or "fixed tissue" refers to cells/nuclei or tissue that have been contacted with a fixative under conditions sufficient to allow or cause intramolecular and intermolecular covalent crosslinks to form between biomolecules in a biological sample. In general, contact of a biological sample (e.g., a cell or nucleus) with an immobilizing reagent (e.g., paraformaldehyde or PFA) causes intramolecular and intermolecular covalent crosslinks to form between biomolecules in the biological sample. In some cases, the immobilization reagent formaldehyde can produce covalent aminal crosslinks within RNA, DNA, and/or protein molecules. For example, the widely used fixative reagents paraformaldehyde or PFA fix tissue samples by catalyzing the formation of crosslinks between basic amino acids (such as lysine and glutamine) in proteins. Intramolecular and intermolecular crosslinks can be formed in the protein. These crosslinks preserve the secondary structure of the protein and also eliminate enzymatic activity in the preserved tissue sample. Examples of fixative agents include, but are not limited to, aldehyde fixatives (e.g., formaldehyde, also commonly referred to as "paraformaldehyde," "PFA," and "formalin"; glutaraldehyde; etc.), imide esters, NHS (N-hydroxysuccinimide) esters, and the like.

In some embodiments, the fixative or fixative reagent useful for fixing the sample is formaldehyde. The term "formaldehyde" when used in the context of fixatives may also refer to "paraformaldehyde" (or "PFA") and "formalin," both of which have particular meanings in connection with formaldehyde compositions (e.g., formalin is a mixture of formaldehyde and methanol). Thus, formaldehyde-fixed biological samples may also be referred to as formalin-fixed or PFA-fixed. Protocols and methods for preparing immobilized biological samples using formaldehyde as an immobilization reagent are well known in the art and can be used in the methods and compositions of the present disclosure. For example, suitable ranges for formaldehyde concentrations for preparing the immobilized biological samples are 0.1% -10%, 1% -8%, 1% -4%, 1% -2%, 3% -5% or 3.5% -4.5%. In some embodiments of the present disclosure, a final concentration of 1%, 4% or 10% formaldehyde is used to immobilize the biological sample. Formaldehyde may be diluted from a more concentrated stock solution (e.g., 35%, 25%, 15%, 10%, 5% PFA stock solution).

Other examples of fixatives include, for example, organic solvents such as alcohols (e.g., methanol or ethanol), ketones (e.g., acetone), and aldehydes (e.g., paraformaldehyde, formaldehyde (e.g., formalin), or glutaraldehyde). As described herein, cross-linking agents may also be used for fixation, including but not limited to bis-succinimidyl suberate (DSS), dimethyl suberimide (DMS), formalin and dimethyl adipimide (DMA), dithio-bis (succinimidyl propionate) (DSP), disuccinimidyl tartrate (DST), and ethylene glycol bis (succinimidyl succinate) (EGS). In some cases, the crosslinking agent may be a cleavable crosslinking agent (e.g., thermally cleavable, photocleavable, etc.).

In some cases, when preparing immobilized biological samples, more than one immobilization reagent may be used in combination. For example, a first fixing agent (such as an organic solvent) may be used in combination with a second fixing agent (such as a crosslinking agent). The organic solvent may be an alcohol (e.g., ethanol or methanol), a ketone (e.g., acetone), or an aldehyde (e.g., paraformaldehyde, formaldehyde, or glutaraldehyde). The crosslinking agent may be selected from the group consisting of bis-succinimidyl suberate (DSS), dimethyl suberimide (DMS), formalin and dimethyl adipimide (DMA), dithio-bis (succinimidyl propionate) (DSP), disuccinimidyl tartrate (DST), and ethylene glycol bis (succinimidyl succinate) (EGS). In some cases, a first fixative may be provided to or in contact with the cell or cell nucleus to cause a change in a first property or group of properties of the cell/cell nucleus, and a fixative may be provided to or in contact with the cell or cell nucleus to cause a change in a second property or group of properties of the cell or cell nucleus. For example, a first fixative may be provided to or contacted with the cell or cell nucleus to cause a change in cell size (e.g., a reduction in cross-sectional diameter, see, e.g., U.S. patent publication No. 2020/0033237, which is incorporated herein by reference in its entirety), and a second fixative may be provided to or contacted with the cell or cell nucleus to cause a change in a second property or group of properties of the cell (e.g., formation of crosslinks within and/or around the cell or cell nucleus). The first fixative and the second fixative may be provided to the cell or cell nucleus or contacted with the cell or cell nucleus at the same or different times. Other suitable fixatives include those disclosed, for example, in International PCT application No. PCT/US2020/066705, which is incorporated herein by reference in its entirety.

In one example, a first fixative may be provided to the cells as an organic solvent to alter a first characteristic (e.g., cell size), and a second fixative may be provided to the cells as a cross-linking agent to alter a second characteristic (e.g., cell fluidity or rigidity). The first fixative may be provided to the cells prior to the second fixative.

In another embodiment, a biomolecule (e.g., a biological sample, such as a tissue specimen) is contacted with an immobilization reagent comprising formaldehyde and glutaraldehyde, so that the contacted biomolecule may include immobilization crosslinks resulting from formaldehyde-induced immobilization and glutaraldehyde-induced immobilization. Suitable concentrations of glutaraldehyde which may be used as the fixing reagent may be 0.1% to 1%. The fixing and washing reagents may also comprise commercially available products, e.gA fixation buffer (420801) and a permeabilization wash buffer (421002).

The change in a property or set of properties of the cell or cell component (e.g., caused upon interaction with one or more fixatives) may be at least partially reversible (e.g., via rehydration or de-crosslinking). Alternatively, a property or a change in a set of properties of a cell or cell component (e.g., caused upon interaction with one or more fixatives) may be substantially irreversible.

Samples (e.g., cell samples) may be subjected to a fixation process at any useful point in time. For example, the cells, nuclei, and/or cell/nucleus components of the sample may be subjected to a fixation process involving one or more fixatives (e.g., as described herein) prior to the initiation of any subsequent processing (such as for storage). The cells, nuclei, and/or cell/nuclear components subjected to the fixation process prior to storage (such as cells, nuclei, and/or cell/nuclear components of a tissue sample) may be stored in an aqueous solution, optionally in combination with one or more preservatives configured to preserve the morphology, size, or other characteristics of the cells and/or cell components. The immobilized cells, nuclei, and/or cell/nucleus components may be stored at below room temperature, such as in a refrigerator. Alternatively, the cells, nuclei and/or cell/nuclear components of the sample may be subjected to an immobilization process involving one or more fixatives after one or more other processes, such as filtration, centrifugation, agitation, selective precipitation, purification, permeabilization, separation, heating, etc. For example, a given type of cell, nucleus, and/or cell/nucleus component from a sample may be subjected to a fixation process after an isolation and/or enrichment procedure (e.g., as described herein). In one example, a sample comprising a plurality of cells (including a plurality of cells of a given type) may be subjected to a forward separation process to provide a sample enriched in the plurality of cells of the given type. The enriched sample may then be subjected to an immobilization process involving one or more fixatives (e.g., as described herein) to provide an enriched sample comprising a plurality of fixed cells. The immobilization process may be performed in a bulk solution. In some cases, the immobilized sample (e.g., immobilized cells, immobilized nuclei, and/or cell/nucleus components) can be separated in multiple partitions (e.g., droplets or wells) and subjected to a treatment as described elsewhere herein. In some cases, the immobilized sample may undergo additional processing prior to separation and any subsequent processing, such as partial or complete reversal of the immobilization process by, for example, rehydration or de-crosslinking. In some cases, the immobilized sample may undergo a partial or complete reversal of the immobilization process within multiple partitions (e.g., prior to or concurrent with additional processing described elsewhere herein).

In some cases, tissue specimens comprising multiple cells, nuclei, and/or cell/nucleus components may be processed to provide formalin-fixed paraffin embedded (FFPE) tissue. The tissue specimen may be contacted with (e.g., saturated with) formalin and then embedded in a solid medium. In one example, the solid medium may be paraffin. FFPE processing may facilitate preservation of tissue samples (e.g., prior to subsequent processing and analysis). Tissue samples, including FFPE tissue samples, may additionally or alternatively be stored in a cryogenic refrigerator. The cells, nuclei, and/or cell/nucleus components can be dissociated from the tissue sample (e.g., FFPE tissue sample) prior to undergoing subsequent processing. In some cases, individual cells, nuclei, and/or cell/nucleus components of a tissue sample (such as an FFPE tissue sample) may be optically detected, labeled, or otherwise processed prior to any such dissociation. Such detection, labeling or other processing may be performed according to a two-or three-dimensional array and optionally according to a predetermined pattern. In some cases, tissue specimens may be embedded in other materials, such as Optimal Cutting Temperature (OCT) compounds, cross-linked-based supports (e.g., polymers), and the like.

The tissue specimen may have been fixed for at least about 1 day (d), 2d, 3d, 4d, 5d, 6d, 1 week (wk), 2wk, 3wk, 1 month (m), 2m, 3m, 4m, 5m, 6m, 7m, 8m, 9m, 10m, 11m, 1 year (y), 2y, 3y, 4y, 5y, 6y, 7y, 8y, 9y, 10y, 15y, 20y, 25y, 30y, 35y, 40y, 45y, 50y, or more, and then used in the methods and systems described elsewhere herein.

In some cases, preparation of the immobilized sample may include immobilization and sectioning (e.g., via sectioning, ultra-thin sectioning, etc.) of at least a portion of the immobilized sample. The fixed sample may be sliced into one or more (e.g., multiple) rolls. The thickness of the fixed tissue (e.g., roll) may be at least about 1, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100 microns or more. The slicing and/or fixing may be performed at ambient temperature. The slicing and/or fixing may be performed at a temperature higher or lower than the ambient temperature. The fixed tissue (e.g., roll) may have a mass of at least about 100 micrograms (μg)、150μg,200μg 250μg,300μg,350μg,400μg,450μg,500μg,550μg,600μg,650μg,700μg,750μg,800μg,850μg,900μg,950μg,1milligram(mg),2mg,3mg,4mg,5mg,6mg,7mg,8mg,9mg,10mg,15mg,20mg,25mg,30mg,35mg,40mg,45mg,50mg,55mg,60mg,65mg,70mg,75mg,80mg,85mg,90mg,95mg,100mg,150mg,200mg,250mg,300mg,350mg,400mg,450mg,500mg,550mg,600mg,650mg,700mg,750mg,800mg,850mg,900mg,950mg,1gram(g),1.25g,1.5g,1.75g,2g,2.25g,2.5g,2.75g,3g,3.25g,3.5g,3.75g,4g,4.25g,4.5g,4.75g,5g,10g,15g,20g,25g,30g,35g,40g,45g,50g or more. The fixed tissue (e.g., roll) may have a mass of up to about 50g,45g,40g,35g,30g,25g,20g,15g,10g,5g,4.75g,4.5g,4.25g,4g,3.75g,3.5g,3.25g,3g,2.75g,2.5g,2.25g,2g,1.75g,1.5g,1.25g,1g,950mg,900mg,850mg,800mg,750mg,700mg,650mg,600mg,550mg,500mg,450mg,400mg,350mg,300mg,250mg,200mg,150mg,100mg,95mg,90mg,85mg,80mg,75mg,70mg,65mg,60mg,55mg,50mg,45mg,40mg,35mg,30mg,25mg,20mg,15mg,10mg,9mg,8mg,7mg,6mg,5mg,4mg,3mg,2mg,1mg,950μg,900μg,850μg,800μg,750μg,700μg,650μg,600μg,550μg,500μg,450μg,400μg,350μg,300μg,250μg,200μg,150μg,100μg or less. The fixed tissue (e.g., roll) may have a mass within a range defined by any two of the foregoing values. For example, the fixed tissue or roll may have a mass of about 100 micrograms to about 5 grams.

In some cases, the rolls may be mechanically and/or enzymatically dissociated. Mechanical dissociation may include sonication (e.g., sonication at a sub-ambient temperature). The sonication may comprise sonication at a power of at least about 5,10,15,20,25,30,35,40,45,50,55,60,65,70,65,80,85,90,95 or 100% power. Sonication may include sonication at a power of up to about 100,95,90,85,80,75,70,65,60,55,50,45,40,35,30,25,20,15,10,5 or less percent power. The sonication can last for at least about 1,2,3,4,5,6,7,8,9,10,15,30,45,60 minutes or more. Sonication can be continued for up to about 60,45,30,15,10,9,8,7,6,5,4,3,2,1 or less minutes. Mechanical dissociation may include the use of a rocker plate. For example, the sample may be placed in a sample tube, and the sample tube may be shaken on a rocker plate. Mechanical dissociation may include agitating the sample.

In some optional cases, the immobilized sample can be treated to remove one or more fixatives and/or supports (e.g., solid media). For example, the fixed sample may be dewaxed. In some cases, the immobilized sample may not be treated to remove one or more fixatives and/or supports (e.g., solid media). For example, the immobilized sample may be used in the form of a slice (e.g., a roll). Examples of treatments include the use of one or more non-polar solvents (e.g., linear alkanes, cycloalkanes, benzene, xylenes, neo-clear, orange oil, other substituted or unsubstituted alkanes, etc., or any combination thereof). In one example, the nonpolar solvent is xylene. The removal of one or more fixatives and/or supports (e.g., solid media) may be repeated (e.g., to more completely remove one or more fixatives and/or supports). In some optional cases, the sample may be rehydrated (e.g., via addition of water, ethanol rehydration, gas rehydration, etc., or any combination thereof). For example, an ethanol solution of water with an increased concentration of water may be used to rehydrate the sample. In some cases, the sample may be analyzed without rehydration. The sample may be washed with a polar (e.g., aqueous) solution to remove additional impurities. For example, an aqueous solution of phosphate buffered saline may be used to remove impurities from a sample.

One or more dissociation solutions may be added to treat the sample, including, for example, a release enzyme with low pyrolysis (TL) (liberase), a release enzyme with medium pyrolysis (TM), a release enzyme with high pyrolysis (TH), collagenase, and the like, or any combination thereof. The dissociated sample may be added at ambient temperature (e.g., room temperature). The dissociation solution is heated prior to addition. The dissociation solution may be cooled prior to addition. When added, the dissociated sample may be at a temperature of at least about 20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,34,36,37,38,39,40 degrees celsius or higher. When added, the dissociation solution may be at a temperature of up to about 40,39,38,37,36,35,34,33,32,31,30,29,28,27,26,25,24,23,22,21 degrees celsius or less. The dissociation solution may be added at a temperature within the range defined by any two in-progress values. The sample may be titrated at least about 5,10,15,20,25,30,35,40,45,50 times or more to form a cell suspension. In some cases, impurities (e.g., paraffin wax) may be removed by allowing the suspension to stand, and the impurities may precipitate from the solution, and the purified solution may be removed and further processed.

The release enzyme may include one or more enzymes configured to degrade at least a portion of the biomolecule. For example, the release enzyme may comprise one or more collagenases. Collagenases can include one or more isoforms of collagenase, collagenase I, collagenase II, etc., or any combination thereof. The collagenase isoforms may be present in a ratio between the various isoforms of at least about 1:20,1:19,1:18,1:17,1:16,1:15,1:14,1:13,1:12,1:11,1:10,1:9,1:8,1:7,1:6,1:5,1:4,1:3,1:2,1:1,2:1,3:1,4:1,5:1,6:1,7:1,8:1,9:1,10:1,11:1,12:1,13:1,14:1,15:1,16:1,17:1,18:1,19:1,20:1 or higher. In some cases, a composition comprising one or more enzymes may comprise a dispersed enzyme. The dispase may be a protease (e.g., neutral protease, etc.). The dispase may be a non-clostridium dispase. The composition comprising one or more enzymes may comprise a thermolysin. The thermolysin may be a protease (e.g., a neutral protease, etc.). The thermolysin may be a non-clostridium thermolysin. The compositions comprising one or more enzymes may comprise a variety of additional components (e.g., dispase, thermolysin, etc.) as described elsewhere herein.

The sample may be filtered one or more times (e.g., 1,2,3, 4, 5,6,7, 8, 9, 10, or more times). Filtration may include the use of one or more filters of different sizes (e.g., pore sizes). The filter may include a pore size of up to about 500,400,300,200,100,90,80,70,60,50,40,30,20,10,5,1 microns or less. For example, the first filtration may be performed with a 70 micron pore size filter. In this example, a second filtration can be performed with 30 micron filtration, which can reduce debris (e.g., undissolved paraffin or other support) without reducing cell recovery in the sample. The filtrate and cell suspension may be combined and subsequently centrifuged. Centrifugation may occur at values of at least about 100,200,300,400,500,600,700,850,900,950,1,000,1,100,1,200,1,300,1,400,1,500,1,600,1,700,1,800,1,900,2,000,2,100,2,200,2,300,2,400,2,500,3,000, or greater reciprocating centrifugal force (rcf). Centrifugation may occur at values up to about 3,000,2,500,2,400,2,300,2,200,2,100,2,000,1,900,1,800,1,700,1,600,1,500,1,400,1,300,1,200,1,100,1,000,950,900,850,800,700,600,500,400,300,200,100 or less rcf. Centrifugation may be performed at values within a range defined by any two ongoing values. For example, centrifugation may be performed at a value of about 850 to about 2,000 rcf.

In some cases, the solution may be removed from the centrifuged pellet (e.g., without disturbing the pellet). In some cases, the pellet may be resuspended into solution. For example, the pellet may be resuspended in a buffer solution.

The resuspended solution can then be analyzed (e.g., to determine cell concentration). Examples of cell concentration determination systems include, but are not limited to, countess II FL automatic cell counter, cellaca MX high throughput automatic cell counter, etc. using fluorescent dyes (e.g., ethidium homodimer-1, etc.) or AO/PI staining solutions, etc. The resuspended solution can then be used as a sample for the methods and systems described elsewhere herein (e.g., RNA profiling, etc.).

In some cases, a fixed sample (e.g., FFPE sample) using the methods and systems described elsewhere herein may provide different information than using a fresh sample. For example, the immobilization process may capture transient states of cells in the sample and/or transient types of cells in the sample that may not be captured in a fresh sample. Thus, by using a fixed sample, the cellular process can be studied in different ways. In some cases, the use of the methods and systems described elsewhere herein on a fixed sample may provide unexpected improvements to the analysis of the fixed sample, such as improved sensitivity to the aforementioned analysis of different transient conditions/types within the sample relative to other analysis methods. In some cases, the use of a fixed sample may allow time series analysis of the sample (e.g., the sample may be fixed at different times, and analyzed later). Such time series analysis may provide information related to the state within the sample and/or the evolution of the cell type.

Nucleic acid analysis method

In one aspect, the present disclosure provides a method of barcoding a nucleic acid molecule. The method may generally include contacting a nucleic acid molecule with a pair of probes and a barcode molecule to produce a barcoded molecule (e.g., a barcoded, probe-linked molecule). The nucleic acid molecule may comprise a sequence corresponding to a target sequence or a template sequence. One or more nucleic acid reactions (e.g., ligation, nucleic acid extension reactions, amplification, etc.) can be performed to produce a barcoded molecule. In some aspects, the method includes contacting a nucleic acid molecule with a first probe to produce a probe-associated nucleic acid molecule, wherein the nucleic acid molecule comprises a first target region and a second target region, wherein the first probe comprises a first probe sequence complementary to the first target region, performing a nucleic acid reaction (e.g., a nucleic acid extension reaction, such as by using a polymerase or reverse transcriptase, etc.) to produce an extended probe molecule comprising a sequence complementary to the second target region, providing (i) a second probe comprising a second probe sequence corresponding to or complementary to the second target region, and (ii) a nucleic acid barcode molecule, and subjecting the extended probe molecule or derivative thereof to conditions sufficient to produce the barcode-tagged molecule. The first target region and the second target region may be disposed adjacent to each other, or may be separated from each other (e.g., disposed at opposite ends of the gap region). In some cases, barcoding may be facilitated by providing a probe binding molecule (also referred to herein as a "splint molecule" or in some cases as a "splint oligonucleotide"). For example, the first probe and/or the second probe may comprise a probe capture sequence, and the probe-binding molecule may comprise a probe-binding sequence complementary to the probe capture sequence. Additionally or alternatively, the nucleic acid barcode molecule may comprise a barcode sequence and a barcode capture sequence, and the probe binding molecule may comprise a barcode binding sequence complementary to the barcode capture sequence. In some cases, the probe-binding molecules can be pre-annealed to the nucleic acid barcode molecules. Barcoding may include hybridization of the probe binding molecule to the probe capture sequence of the first probe and/or the second probe (or its complement) and to the barcode capture sequence of the nucleic acid barcode molecule. Thus, a barcoded molecule may comprise a sequence corresponding to a first target region, a sequence corresponding to a second target region, a sequence corresponding to a probe capture sequence, and a sequence corresponding to a barcode sequence. One or more operations may be performed within a partition (e.g., a droplet or a well).

The methods described herein can facilitate gene expression profiling with single cell, single cell nuclear, or single cell bead resolution using, for example, nucleic acid extension reactions, probe hybridization, chemical ligation or enzymatic ligation, barcoding, amplification, and sequencing. The methods described herein may allow for gene expression analysis while avoiding the use of specialized imaging equipment, and in some cases, reverse transcription, which may be very error-prone and inefficient. In some cases, the method can be used to analyze a predetermined set of target genes in a single cell, nucleus, or population of cell beads in a sensitive and accurate manner. The methods described herein can also be used to detect or characterize genetic variants, for example, where the sequence of regions disposed between target regions (e.g., gap regions) is unknown. In some cases, the methods described herein can be used to analyze Single Nucleotide Polymorphisms (SNPs), alternative splicing, insertions, mutations, deletions, gene rearrangements (e.g., V (D) J rearrangements), transposons, or other genetic elements or variants. In some cases, the nucleic acid molecules analyzed by the methods described herein can comprise a fusion gene (e.g., a hybrid gene produced by translocation, intermediate deletion, or chromosomal inversion). In some cases, the methods described herein can be used to analyze genomic, transcriptomic, exome, and/or proteomic elements in cells, nuclei, cell beads, tissue samples, spatial arrays of cells, nuclei, or tissues, and the like.

The nucleic acid molecules analyzed by the methods described herein can be single-stranded or double-stranded nucleic acid molecules. The double-stranded nucleic acid molecule may be fully or partially denatured to access a target region (e.g., target sequence) of one strand of the nucleic acid molecule. Denaturation can be achieved, for example, by adjusting the temperature or pH of the solution containing the nucleic acid molecules, using a chemical agent such as formamide, guanidine, sodium salicylate, dimethyl sulfoxide, propylene glycol, urea, or an alkaline agent (e.g., naOH), or using mechanical agitation (e.g., centrifugation or vortexing of the solution containing the nucleic acid molecules).

The nucleic acid molecule can be a target nucleic acid molecule. The target nucleic acid molecule can be an RNA molecule. The RNA molecule can be, for example, a transfer RNA (tRNA) molecule, a ribosomal RNA (rRNA) molecule, a mitochondrial RNA (mtRNA) molecule, a messenger RNA (mRNA) molecule, a non-coding RNA molecule, a synthetic RNA molecule, or another type of RNA molecule. For example, the RNA molecule may be an mRNA molecule. In some cases, the nucleic acid molecule may be a viral or pathogenic RNA. In some cases, the nucleic acid molecule may be a synthetic nucleic acid molecule that was previously introduced into or onto the cell. For example, a nucleic acid molecule may comprise a plurality of barcode sequences, and two or more barcode sequences may be target regions of the nucleic acid molecule. In some cases, the nucleic acid molecule is a guide RNA (gRNA) that can be exogenously introduced into a cell or cell bead. In some cases, the nucleic acid molecule is an RNA molecule derived from an exogenously introduced nucleic acid molecule, e.g., RNA derived from a plasmid, an integrated DNA sequence (e.g., using viral transduction in a cell), gRNA from a CRISPR genetic element, and the like.

The nucleic acid molecule (e.g., RNA molecule) can comprise one or more features selected from the group consisting of a 5' cap structure, an untranslated region (UTR), a 5' triphosphate moiety, a 5' hydroxyl moiety, a Kozak sequence, a Shine-Dalgarno sequence, a coding sequence, a codon, an intron, an exon, an open reading frame, a regulatory sequence, an enhancer sequence, a silencer sequence, a promoter sequence, and a poly (a) sequence (e.g., a poly (a) tail). For example, the nucleic acid molecule may comprise one or more features selected from the group consisting of a 5' cap structure, an untranslated region (UTR), a Kozak sequence, a Shine-Dalgarno sequence, a coding sequence, and a poly (A) sequence (e.g., a poly (A) tail).

The characteristics of the nucleic acid molecule may have any useful characteristics. The 5' cap structure may comprise one or more nucleoside moieties linked by a linker, such as a triphosphate (ppp) linker. The 5' cap structure can comprise naturally occurring nucleosides and/or non-naturally occurring (e.g., modified) nucleosides. For example, the 5' cap structure may comprise a guanine moiety or a modified (e.g., alkylated, reduced or oxidized) guanine moiety, such as a 7-methylguanylate (m ⁷ G) cap. Examples of 5' cap structures include, but are not limited to m⁷GpppG、m⁷Gpppm⁷G、m⁷GpppA、m⁷GpppC、GpppG、m^2,7GpppG、m^2,2,7GpppG and anti-reverse cap analogs such as m ^7,2'OmeGpppG、m^7,2'dGpppG、m^7,3'Ome GpppG and m ^7,3'd GpppG. The untranslated region (UTR) may be a 5'UTR or a 3' UTR. UTR may comprise any number of nucleotides. For example, a UTR may comprise at least 3,5,7,10,20,30,40,50,60,70,80,90,100 or more nucleotides. In some cases, the UTR may comprise less than 20 nucleotides. In other cases, the UTR may comprise at least 100 nucleotides, such as more than 200,300,400,500,600,700,800,900 or 1000 nucleotides. Similarly, the coding sequence may comprise any number of nucleotides, such as at least 3,5,10,20,30,40,50,60,70,80,90,100 or more nucleotides. The UTR, coding sequence, or other sequence of a nucleic acid molecule may have any nucleotide or base content or arrangement. For example, the sequence of a nucleic acid molecule may comprise any number or concentration of guanine, cytosine, uracil, and adenine bases. Nucleic acid molecules can also comprise non-naturally occurring (e.g., modified) nucleosides. The modified nucleoside can include one or more modifications (e.g., alkylation, hydroxylation, oxidation, or other modifications) in its nucleobase and/or sugar moiety.

The nucleic acid molecule may comprise one or more target regions. In some cases, the target region may correspond to a gene or a portion thereof. Each region may have the same or a different sequence. For example, a nucleic acid molecule may comprise two target regions of identical sequence, located at different positions along one strand of the nucleic acid molecule. Alternatively, the nucleic acid molecule may comprise two or more target regions having different sequences. Different target areas can be probed by different probes. The target regions may be adjacent to each other or may be spatially separated along one strand of the nucleic acid molecule. The target regions may be located on the same strand or on different strands. As used herein with respect to two entities, "adjacent" may mean that the entities are immediately next to (e.g., contiguous with) each other or in close proximity to each other. For example, the first target region may be immediately adjacent to the second target region (e.g., no other entity disposed between the first target region and the second target region) or proximate to the second target region (e.g., with intervening sequences or molecules between the first target region and the second target region). In some cases, the double-stranded nucleic acid molecule may comprise target regions in each strand that may be the same or different. For nucleic acid molecules comprising multiple target regions, the methods described herein can be performed on one or more target regions at a time. For example, a single target region of a plurality of target regions may be analyzed (e.g., as described herein), or two or more target regions may be analyzed simultaneously. Analyzing the two or more target regions may involve providing two or more probes, wherein a first probe has a sequence complementary to a first target region, a second probe has a sequence complementary to a second target region, and so on.

Each probe (e.g., first probe and second probe) may also contain one or more additional sequences (e.g., additional probe sequences, unique Molecular Identifiers (UMIs), barcode sequences, primer sequences, capture sequences, or other functional sequences). For example, in some cases, the first probe and/or the second probe may comprise the same or different barcode sequences. In some examples, the first probe and the second probe may be configured to hybridize to one or more nucleic acid barcode molecules. For example, the first probe and/or the second probe may comprise a probe capture sequence that may be configured to hybridize to a nucleic acid barcode molecule or a probe binding molecule (e.g., a splint oligonucleotide) configured to hybridize to a nucleic acid barcode molecule (e.g., via a barcode binding sequence that is complementary to the capture sequence of the nucleic acid barcode molecule). The probe capture sequence can be of any useful length, for example, the probe capture sequence can be about 1, about 2, about 3, about 4, about 5, about 6, about 7, about 8, about 9, about 10, about 20, about 30, about 40, about 50, about 60, about 70, about 80, about 90, about 100 or more nucleotides in length. The length of the probe capture sequence may be at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100 or more nucleotides. The length of the probe capture sequence may be at most 100, at most 90, at most 80, at most 70, at most 60, at most 50, at most 40, at most 30, at most 20, at most 10, at most 9, at most 8, at most 7, at most 6, at most 5, at most 4, at most 3, at most 2, or at most 1 nucleotide. The probe capture sequence may range in length, such as from about 8 to about 50 nucleotides in length, and the like. In some cases, the probe capture sequence length may vary based on any useful application and characteristics, such as melting temperature, annealing strength (e.g., GC content), hybridization stringency, and the like.

Similarly, the probe binding molecules and nucleic acid barcode molecules may also comprise one or more additional sequences (e.g., unique Molecular Identifiers (UMIs), barcode sequences, primer sequences, capture sequences, or other functional sequences). For example, in some cases, the probe binding molecule or barcode molecule can comprise a functional sequence, a primer sequence (e.g., a sequencing primer sequence or a partial sequencing primer sequence), UMI, or the like. The probe binding molecule and the nucleic acid barcode molecule may be of any useful length, for example, either or both may be about 1, about 2, about 3, about 4, about 5, about 6, about 7, about 8, about 9, about 10, about 20, about 30, about 40, about 50, about 60, about 70, about 80, about 90, about 100 or more nucleotides in length. The length of the probe binding molecule or barcode molecule may be at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100 or more nucleotides. The probe capture binding molecule or barcode molecule may be up to 100, up to 90, up to 80, up to 70, up to 60, up to 50, up to 40, up to 30, up to 20, up to 10, up to 9, up to 8, up to 7, up to 6, up to 5, up to 4, up to 3, up to 2, or up to 1 nucleotide in length. A range of probe binding molecules or barcode molecules may be used, such as from about 16 to about 100 nucleotides in length, etc. In some cases, the length of the probe-binding molecule or barcode molecule may vary based on any useful application and characteristics, such as melting temperature, annealing temperature, and the like. In some cases, the first target region and the second target region of the nucleic acid molecule are not adjacent. For example, the first target region and the second target region may be separated by one or more gap regions disposed between the first target region and the second target region. The gap region can comprise, for example, at least one nucleotide base, at least about 2, at least about 3, at least about 4, at least about 5, at least about 6, at least about 7, at least about 8, at least about 9, at least about 10, at least about 20, at least about 30, at least about 40, at least about 50, at least about 60, at least about 70, at least about 80, at least about 90, at least about 100, at least about 200, at least about 300, at least about 400, at least about 500 or more bases. The gap region may comprise up to about 1000, up to about 500, up to about 400, up to about 300, up to about 200, up to about 100, up to about 90, up to about 80, up to about 70, up to about 60, up to about 50, up to about 40, up to about 30, up to about 20, up to about 10, or up to about 5 bases. The gap region may comprise a range of base numbers, such as about 1 to 30 bases.

The target region of the nucleic acid molecule may have one or more useful properties. For example, the target region can have any useful length, base content, sequence, melting point, or other property. The target region may comprise, for example, at least 10 bases, such as at least about 20,25,30,35,40,45,50,60,65,70,75,80,85,90,95,100,110,120,130,140,150,160,170,180,190,200,250,300,350,400,450,500 or more bases. The target region can have any useful base content and any useful base sequence and combination. For example, the target region can comprise one or more adenine, thymine, uracil, cytosine, and/or guanine bases (e.g., natural or canonical bases). The target region may also comprise one or more derivatives or modified forms of natural or canonical bases, such as oxidized, alkylated (e.g., methylated), hydroxylated, or other modified bases. Similarly, the target region may comprise a ribose or deoxyribose moiety and a phosphate moiety or derivative or modified form thereof.

The target region of a nucleic acid molecule may comprise one or more sequences or features of the nucleic acid molecule or a portion thereof. For example, the target region may comprise all or part of a UTR (e.g., a 3'UTR or a 5' UTR) of a nucleic acid molecule, a Kozak sequence, a Shine-Dalgarno sequence, a coding sequence, a poly-a sequence, a cap structure, an intron, an exon, or any other sequence or feature.

Nucleic acid molecules (e.g., RNA molecules, such as mRNA molecules) of a sample may be contained within cells, nuclei, or cell beads. For example, a sample may comprise a cell or nucleus having a nucleic acid molecule. The cell, nucleus or cell bead may comprise additional nucleic acid molecules which may be the same or different from the nucleic acid molecule of interest. In some cases, the sample may comprise a plurality of cells, and each cell may comprise one or more nucleic acid molecules. The cells may be, for example, human cells, animal cells or plant cells. In some cases, the cells may be derived from tissue or fluid, as described herein. The cells may be prokaryotic or eukaryotic. The cells may be lymphocytes, such as B cells or T cells. Cells may be contained within beads such as those disclosed in U.S. patent No. 10,428,326, incorporated herein by reference in its entirety. In some cases, the cells are contained within a tissue sample and may be immobilized to a substrate. For example, as described above, the cells may be cells of a Formalin Fixed Paraffin Embedded (FFPE) sample. In such cases, the method may include additional operations for preparing the cells or nucleic acid molecules contained therein, such as dewaxing, staining (e.g., using an immunizing agent) or decolorizing, decrosslinking, washing, enzymatic treatment, and the like. Additional examples of processing FFPE samples before and after probe hybridization are included in PCT/US2020/066720, which is incorporated herein by reference in its entirety.

Nucleic acid molecules contained in cells, nuclei or cell beads may be accessed by lysing or permeabilizing the cells or nuclei. Lysing cells, nuclei, or cell beads may release the nucleic acid molecules contained therein from the cells, nuclei, or cell beads. The cells or nuclei may be lysed using a lysing agent, such as a bioactive agent. Bioactive agents that can be used to lyse cells or nuclei can be, for example, enzymes (e.g., as described herein). Enzymes used to lyse cells or nuclei may or may not be capable of performing additional functions such as degrading, extending, reverse transcribing, or otherwise altering nucleic acid molecules. Alternatively, ionic or nonionic surfactants such as TritonX-100, tween20, sodium dodecyl sarcosinate or sodium dodecyl sulfate may be used to lyse cells or nuclei. Cell/nucleus lysis may also be achieved using cell disruption methods such as electroporation or thermal, acoustic or mechanical disruption methods. Alternatively, the cell or nucleus may be permeabilized to access the nucleic acid molecules contained therein. Permeabilization may involve partial or complete dissolution or disruption of the cell membrane/nuclear membrane or a portion thereof. Permeabilization can be achieved, for example, by contacting the cell membrane with an organic solvent (e.g., methanol) or detergent (such as Triton X-100 or NP-40). As described elsewhere herein, the cells, nuclei, or cell beads may be immobilized.

In some cases, the cells may be lysed within the cell beads, and a subset of the intracellular content may be associated with the beads. In some cases, the cell beads can comprise a thioacrylamide (thioacrydite) -modified nucleic acid molecule that can hybridize to nucleic acid from the cells. For example, the poly-T nucleic acid sequence may be thioacrylamide modified and bound to a cell bead matrix. After lysis of the cells or nuclei, the cellular nucleic acids (e.g., mRNA) may hybridize to the poly-T sequence. The retained intracellular/nuclear content may be released, for example, by addition of a reducing agent (e.g., DTT, TCEP, etc.). The release may occur at any convenient step, such as before or after the separation.

The nucleic acid molecule or probe-associated nucleic acid molecule may be subjected to conditions sufficient to produce a probe-linked molecule. For example, the first target region may be adjacent to the second target region, and the first probe and the second probe may hybridize to the first target region and the second target region, respectively. The first probe may comprise a first reactive moiety and the second probe may comprise a second reactive moiety. In some cases, the first reactive moiety of the first probe is adjacent to the second reactive moiety of the second probe. The reactive moiety may then be subjected to conditions sufficient to react them to produce a probe-linked nucleic acid molecule comprising a first probe linked to a second probe. For example, the reactive moieties may be linked together via click chemistry or enzymatic linkages, such as those disclosed in U.S. patent publication No. 2020/0239874, international publication No. WO 2019/165318, and international patent publication No. WO2021/237087, each of which is incorporated herein by reference in its entirety. In some examples, the first probe or the second probe may comprise an adenylated oligonucleotide or moiety (e.g., an adenylated phosphate group) that may be used to reduce non-specific ligation reactions. In some cases, ligation of the probes (e.g., via a ligation reaction) can be performed under substantially ATP-free conditions, optionally with enzymes (e.g., ligases) that do not require ATP (e.g., truncated T4 RNA ligases) or that are preactivated (e.g., preactivated T4 DNA ligases). Additional examples of such connection schemes can be found in PCT/US2020/066720 and International patent application No. PCT/US2021/33649 filed on 21/5/2021, which are incorporated herein by reference in their entirety.

In some cases, a first target region of a nucleic acid molecule (e.g., an RNA molecule) may not be adjacent to a second target region. In such cases, the nucleic acid molecule can be subjected to conditions sufficient to hybridize a first probe sequence of a first probe to a first target region to produce a probe-associated nucleic acid molecule. The probe-associated nucleic acid molecule can undergo a nucleic acid reaction (e.g., a nucleic acid extension reaction, reverse transcription, etc.) to produce an extended probe molecule comprising a sequence complementary to the second target region. A second probe comprising a second probe sequence can hybridize to an extended probe molecule (or its complement) and be subjected to conditions (e.g., nucleic acid extension, amplification, hybridization of additional probe molecules, ligation, etc.) sufficient to produce a probe-ligated molecule comprising a sequence corresponding to the first target region and a sequence corresponding to the second target region. Alternatively or in addition, the first and second probes may be provided simultaneously, and after hybridization of the first and second probe sequences with the first and second target regions, respectively, to produce a dual-probe associated nucleic acid molecule, the gap (e.g., a region disposed between the first and second target regions) may be filled (e.g., via a nucleic acid extension or gap filling reaction and/or hybridization of additional probe molecules hybridized to at least a portion of the gap region). In some cases, one or both probes may comprise an overhang sequence or a winged sequence (e.g., at the 5' end) that can be recognized or cleaved by an enzyme (e.g., an endonuclease, such as a FEN1 endonuclease). For example, the second probe may comprise a 5' winged sequence that would be cleaved by a FEN1 endonuclease if at least one specific portion of the second probe hybridized to a nucleic acid molecule (e.g., a target molecule). After hybridization of the second probe to the second target sequence of the nucleic acid molecule, the fin sequence may be cleaved using an endonuclease (e.g., FEN 1), leaving the ligatable end (e.g., phosphorylated end) of the second probe. In the case where the first target region is not adjacent to the second target region, the gap region may be filled and then the fin sequence cleaved. In some cases, the first probe or the second probe and the gap-fill region may be linked (e.g., chemically linked or enzymatically linked). Additional examples of systems and methods for generating probe-linked nucleic acid molecules and gap-filling reactions can be found, for example, in U.S. patent publication No. 2020/0239874, international publication No. WO 2019/165318, and international patent publication No. WO2021/237087, each of which is incorporated herein by reference in its entirety.

The probe-linked nucleic acid molecules may be barcoded to provide barcoded, probe-linked nucleic acid molecules, or the barcoding may occur prior to the generation of the probe-linked nucleic acid molecules. The bar coding may be performed using a variety of techniques. For example, the first probe or the second probe may comprise a probe capture sequence. The nucleic acid barcode molecule may comprise a barcode capture sequence capable of hybridizing to a probe capture sequence. Alternatively, the barcoding may be mediated by a probe binding molecule (e.g., a splint oligonucleotide) comprising (i) a probe binding sequence that may be complementary to a probe capture sequence of the first probe or the second probe, and (ii) a barcode binding sequence that may be complementary to a barcode capture sequence of the nucleic acid barcode molecule. In some cases, the addition of the barcode may be followed by a ligation, such as a chemical or enzyme-mediated ligation, to covalently attach the nucleic acid barcode molecule to the probe (or to the probe binding sequence, and the probe binding sequence may be attached to the probe). Examples of chemical ligation of nucleic acid molecules may include "click chemistry" methods, such as reaction of azide and alkyne moieties, as described in U.S. patent publication No. 2020/0239874, which is incorporated herein by reference in its entirety.

For example, the first probe can comprise a first probe sequence and a probe capture sequence, and the first probe can be subjected to conditions sufficient to hybridize the first probe sequence to the first target region, thereby producing a probe-associated nucleic acid molecule. In some cases, the probe-associated nucleic acid molecules may be subjected to washing or other conditions to remove unannealed probes from the mixture. The probe-associated nucleic acid molecule may extend from one end of the first probe to one end of the nucleic acid molecule hybridized thereto (toward one end proximal to the second target region) to provide an extended nucleic acid molecule. The extended nucleic acid barcode molecule may comprise a sequence complementary to the first probe sequence and the second target region. In some cases, the extended nucleic acid molecule may be barcoded, for example, by hybridizing a barcode capture sequence of the nucleic acid barcode molecule to a probe capture sequence, or (i) hybridizing a probe binding molecule comprising a probe binding sequence and a barcode binding sequence to a probe capture sequence, and (ii) hybridizing a barcode capture sequence of the nucleic acid barcode molecule to a barcode binding sequence of the probe binding molecule. In some cases, probe binding molecules that are pre-annealed to nucleic acid barcode molecules may be provided. Subsequently, a second probe comprising a second probe sequence may be provided. The barcoded, extended nucleic acid molecule may be subjected to conditions sufficient to hybridize a second probe sequence to a second target region or its complement. A nucleic acid extension reaction can be performed to produce a barcoded molecule (e.g., a barcoded, probe-linked molecule) comprising a sequence corresponding to a first target region, a sequence corresponding to a second target region, a sequence corresponding to a probe capture sequence, and a sequence corresponding to a barcode sequence.

FIG. 7 schematically illustrates a method for producing a barcoded nucleic acid molecule as described herein. A nucleic acid molecule (e.g., RNA molecule) 700 comprising a first target region 702 and a second target region 704 may be provided. The nucleic acid molecule 700 can be contacted with a first probe 706 comprising a first probe sequence 708 and optionally a functional sequence 710, thereby producing a probe-associated nucleic acid molecule. First probe sequence 708 can be complementary to first target region 702. Functional sequence 710 may comprise, for example, a probe capture sequence for downstream barcode addition, or it may comprise a different functional sequence, such as a primer sequence, a partial primer sequence, a barcode sequence, a sequencing primer sequence, and the like.

In operation 701, the probe-associated nucleic acid molecule can be subjected to conditions sufficient to extend the first probe 706, thereby producing an extended probe molecule 712 comprising a sequence complementary to the second target region 704. In some cases, the extended probe molecules 712 can be released from the nucleic acid molecule 700, e.g., by denaturing and/or degrading the nucleic acid molecule 700 (e.g., using an rnase, elevated temperature or thermal cycling, pH, etc.). In operation 703, a nucleic acid barcode molecule may be provided. In some cases, the nucleic acid barcode molecule may be partially double-stranded and may comprise a first strand 720 having a barcode sequence, and a second strand 722 having a sequence 724 at least partially complementary to the barcode sequence and a probe binding sequence 726, which may be at least partially complementary to the functional sequence (e.g., probe capture sequence) 710 of the first probe 706. In some cases, the nucleic acid barcode molecule is single-stranded and comprises only a first strand 720 having a barcode sequence and a barcode capture sequence. A probe binding molecule (e.g., a splint oligonucleotide) 722 may be provided that comprises a barcode binding sequence 724 and a probe binding sequence 726 that are at least partially complementary to the barcode capture sequence. In some cases, the probe binding molecules and nucleic acid barcode molecules may be provided as pre-annealed complexes. The nucleic acid barcode molecule (or pre-annealed complex) may be coupled to a bead, such as a gel bead as described herein, and may contain additional functional sequences including, but not limited to, unique Molecular Identifiers (UMI), capture sequences, primer sequences (e.g., R1/R2 sequences).

In operation 705, extended probe molecules may be barcoded by hybridizing probe binding sequences 726 to functional sequences (e.g., probe capture sequences 710). In some cases, the nucleic acid barcode molecule may be covalently linked to the extended probe molecule (e.g., via a probe capture sequence), for example, enzymatically (e.g., using a ligase) or chemically (e.g., using click chemistry). In operation 707, a second probe molecule 716 may be provided. In some cases, operation 707 may also include denaturation of the double stranded molecules. The second probe molecule 716 may comprise a second probe sequence 714 corresponding to the second target region 704 and optionally a functional sequence 718, which may comprise a probe capture sequence, a barcode sequence, a primer sequence, a sequencing primer sequence, and the like. In operation 709, a nucleic acid extension reaction may be performed, for example, using a polymerase, to extend the second probe 716 along the extended probe molecule, thereby generating a barcoded molecule comprising a sequence corresponding to the first target region 702, the second target region 704, a sequence corresponding to the probe capture sequence 710, and a sequence corresponding to the barcode sequence 720.

In another example, the first probe and the second probe may be ligated (e.g., by chemical ligation or enzymatic extension and/or ligation) prior to barcode addition. In such examples, the first probe can hybridize to a nucleic acid molecule (e.g., via hybridization of the first probe sequence to the first target region) to produce a probe-associated nucleic acid molecule. The probe-associated nucleic acid molecule may extend from one end of the first probe to one end of the nucleic acid molecule to which it hybridizes to provide an extended nucleic acid molecule. The extended molecule may be subjected to conditions sufficient to hybridize (e.g., via hybridization of a second probe sequence to a second target region or its complement) to a second probe sequence. Additional nucleic acid extension reactions can be performed to produce extended molecules, and the resulting extension products can be barcoded to produce barcoded molecules. The barcoded molecule may comprise a sequence corresponding to a first target region, a sequence corresponding to a second target region, a sequence corresponding to a probe capture sequence, and a sequence corresponding to a barcode sequence. In some cases, the nucleic acid barcode molecule (or probe binding molecule) may be chemically linked to the first probe or the second probe, such as by a ligation reaction or click chemistry. For example, the nucleic acid barcode molecule may comprise a first reactive moiety and the first probe or the second probe may comprise a second reactive moiety, the first reactive moiety may be configured to react with the second reactive moiety to produce a covalent bond. The barcoded nucleic acid molecules or derivatives thereof may then optionally be further processed and analyzed by any suitable technique, including nucleic acid sequencing, such as Illumina sequencing.

FIG. 8 schematically illustrates another method for producing a barcoded nucleic acid molecule as described herein. A nucleic acid molecule (e.g., RNA molecule) 800 comprising a first target region 802 and a second target region 804 can be provided. Nucleic acid molecule 800 can be contacted with a first probe 806 comprising a first probe sequence 808 and optionally a functional sequence 810, thereby producing a probe-associated nucleic acid molecule. First probe sequence 808 may be complementary to first target region 802. Functional sequence 810 may comprise a probe capture sequence, for example for downstream barcode addition, or it may comprise a different functional sequence, such as a primer sequence, a partial primer sequence, a barcode sequence, a sequencing primer sequence, or the like.

In operation 801, the probe-associated nucleic acid molecule can be subjected to conditions sufficient to extend the first probe 806, thereby producing an extended probe molecule 812 comprising a sequence complementary to the second target region 804. In some cases, the extended probe molecules 812 can be released from the nucleic acid molecule 800, e.g., by denaturing and/or degrading the nucleic acid molecule 800 (e.g., using an rnase, elevated temperature or thermal cycling, pH, etc.). In operation 803, a nucleic acid barcode molecule and a second probe 816 may be provided. The second probe 816 can comprise a second probe sequence 814 corresponding to the second target region 804 and an optional functional sequence 818, which can comprise a probe capture sequence. In some cases, the nucleic acid barcode molecule may be partially double-stranded and may comprise a first strand 820 having a barcode sequence, and a second strand 822 having a sequence 824 complementary to the barcode sequence and a probe binding sequence 826, which may be complementary to a functional sequence (e.g., probe capture sequence) 818 of the second probe 816. In some cases, the nucleic acid barcode molecule is single-stranded and comprises only a first strand 820 having a barcode sequence and a barcode capture sequence. A probe binding molecule (e.g., a splint oligonucleotide) 822 may be provided that includes a barcode binding sequence 824 and a probe binding sequence 826 that are complementary to the barcode capture sequence. In some cases, the probe binding molecules and nucleic acid barcode molecules may be provided as pre-annealed complexes. The nucleic acid barcode molecule (or pre-annealed complex) may be coupled to a bead, such as a gel bead as described herein, and may contain additional functional sequences including, but not limited to, unique Molecular Identifiers (UMI), capture sequences, primer sequences (e.g., R1/R2 sequences). In operation 803, a second probe 816 can be hybridized to the extended probe molecule 812 (e.g., hybridization to the second target region 804 or its complement via the second probe sequence 814), and a nucleic acid barcode molecule can be attached or coupled to the second probe 816 (e.g., hybridization to the probe capture sequence 818 via the probe binding sequence 826). In some cases, a nucleic acid barcode molecule or probe binding molecule may be attached to the second probe 816 (e.g., using a ligase or via chemical ligation, such as click chemistry).

In operation 805, a nucleic acid extension reaction can be performed, for example, using a polymerase (e.g., a DNA polymerase, a hot start polymerase, etc.), to extend a nucleic acid barcode molecule and a second probe 816 along the extended probe molecule, thereby producing a barcode-added molecule comprising a sequence corresponding to the first target region 802, the second target region 804, a sequence corresponding to the probe capture sequence 818, and a sequence corresponding to the barcode sequence 820. The barcoded nucleic acid molecules or derivatives thereof may then optionally be further processed and analyzed by any suitable technique, including nucleic acid sequencing, such as Illumina sequencing.

FIG. 9 schematically illustrates another method for producing a barcoded nucleic acid molecule in a manner similar to that shown in FIG. 8. A nucleic acid molecule (e.g., RNA molecule) 900 comprising a first target region 902 and a second target region 904 can be provided. The nucleic acid molecule 900 can be contacted with a first probe 906 comprising a first probe sequence 908 and optionally a functional sequence 910, thereby producing a probe-associated nucleic acid molecule. The first probe sequence 908 can be complementary to the first target region 902. Functional sequence 910 may comprise, for example, a probe capture sequence, or it may comprise a different functional sequence, such as a primer sequence, a partial primer sequence, a barcode sequence, a sequencing primer sequence, or the like.

In operation 901, the probe-associated nucleic acid molecule can be subjected to conditions sufficient to extend the first probe 906, thereby producing an extended probe molecule 912 comprising a sequence complementary to the second target region 906. In some cases, the extended probe molecules 912 may be released from the nucleic acid molecule 900, for example, by denaturing and/or degrading the nucleic acid molecule 900 (e.g., using an rnase, elevated temperature or thermal cycling, pH, etc.). In operation 903, a second probe 916 may be provided. The second probe 916 may comprise a second probe sequence 914 corresponding to the second target region 904 and an optional functional sequence 918, which may comprise a probe capture sequence. In operation 905, a nucleic acid extension reaction can be performed, for example, using a polymerase, to extend the nucleic acid barcode molecule and the second probe 916 along the extended probe molecule, thereby producing a probe-linked molecule comprising sequences corresponding to the first target region 902 and the second target region 904.

In operation 905, a second probe may also be provided for the nucleic acid barcode molecule. In some cases, the nucleic acid barcode molecule may be partially double-stranded and may comprise a first strand 920 having a barcode sequence, and a second strand 922 having a sequence 924 complementary to the barcode sequence and a probe binding sequence 926 that may be complementary to a functional sequence (e.g., probe capture sequence) 918 of a second probe 916. In some cases, the nucleic acid barcode molecule is single-stranded and comprises only a first strand 920 having a barcode sequence and a barcode capture sequence. A probe binding molecule (e.g., a splint oligonucleotide) 922 may be provided that includes a barcode binding sequence 924 and a probe binding sequence 926 that are complementary to the barcode capture sequence. In some cases, the probe binding molecules and nucleic acid barcode molecules may be provided as pre-annealed complexes. The nucleic acid barcode molecule (or pre-annealed complex) may be coupled to a bead, such as a gel bead as described herein, and may contain additional functional sequences including, but not limited to, unique Molecular Identifiers (UMI), capture sequences, primer sequences (e.g., R1/R2 sequences). In operation 907, the nucleic acid barcode molecule may be attached or coupled to a second probe 916, e.g., via hybridization of a probe binding sequence 926 to a probe capture sequence 918. The resulting barcoded product can comprise sequences corresponding to the first target region 902, the second target region 904, the probe capture sequence 918, and the barcode sequence 920. In some cases, the nucleic acid barcode molecule may be covalently linked to the extended probe molecule (e.g., via probe capture sequence 918), for example, enzymatically (e.g., using a ligase) or chemically (e.g., using click chemistry). The barcoded nucleic acid molecules or derivatives thereof may then optionally be further processed and analyzed by any suitable technique, including nucleic acid sequencing, such as Illumina sequencing.

In additional examples, the methods of the present disclosure can include generating a probe-associated nucleic acid molecule, and barcoding the probe-associated nucleic acid molecule, and optionally performing a ligation operation (e.g., before or after barcoding the probe-associated nucleic acid molecule). For example, a nucleic acid molecule (e.g., an RNA molecule) comprising a first target region and a second target region can be provided. The nucleic acid molecule can be contacted with (i) a first probe comprising a first probe sequence complementary to a first target region and (ii) a second probe comprising a second probe sequence complementary to a second target region, thereby producing a probe-associated nucleic acid molecule. In some cases, the probe-associated nucleic acid molecules can be subjected to conditions (e.g., enzymatic ligation, such as with a polymerase, reverse transcriptase, and/or ligase, or chemical ligation) sufficient to ligate the first probe to the second probe, thereby producing a probe-ligated nucleic acid molecule. The probe-associated nucleic acid molecules or probe-linked molecules can then be barcoded (e.g., in a partition) to produce barcoded nucleic acid molecules.

For example, FIG. 25 schematically illustrates an example method for producing a probe-linked nucleic acid molecule that can then be barcoded, for example, in a partition, to produce a barcoded nucleic acid molecule. A nucleic acid molecule (e.g., RNA molecule) 2500 comprising a first target region 2502 and a second target region 2504 can be provided. In some cases, the first target region is adjacent to the second target region. In operation 2501, the nucleic acid molecule 2500 may be contacted with a first probe 2506 comprising a first probe sequence 2508 complementary to a first target region 2502 and a second probe 2516 comprising a second probe sequence 2514 complementary to a second target region 2504, thereby producing a probe-associated nucleic acid molecule. The first probe 2506 and/or the second probe 2516 can comprise functional sequences, such as a probe capture sequence, a primer sequence, a partial primer sequence, a barcode sequence, a sequencing primer sequence, and the like.

In some cases, one of the probes (e.g., second probe 2516) comprises a winged sequence or an overhang sequence 2530 that can be recognized by an endonuclease (e.g., FEN 1) when second probe sequence 2514 anneals to second target region 2504. For example, second probe 2516 can comprise a 5' winged sequence 2530, and after first probe 2506 and second probe 2516 anneal to nucleic acid molecule 2500, the winged sequence can be adjacent to one end (e.g., the 3' end) of the first probe and one end (e.g., the 5' end) of the second probe. In operation 2503, an endonuclease (e.g., FEN 1) may be used to remove the fin sequence 2530, leaving the ligatable end (e.g., 5' phosphorylated end) of the second probe 2516. In operation 2507, a ligation reaction (e.g., using a ligase) may be performed to ligate the first probe to the second probe, thereby producing a probe-ligated nucleic acid molecule. The probe-linked nucleic acid molecules may then be barcoded, for example, in a partition, as described elsewhere herein. In some cases, the probe-associated nucleic acid molecules may be barcoded and ligated (e.g., in a partition).

FIG. 26 shows another example workflow similar to the workflow shown in FIG. 25, wherein target regions of nucleic acid molecules are not adjacent. Such a workflow may include additional gap-filling reactions to produce probe-associated molecules. In one such example, the first target region 2602 of the nucleic acid molecule 2600 may not be adjacent to the second target region 2604. For example, a gap region may be disposed between the first target region and the second target region. In operation 2601, a first probe 2606 can be annealed to a first target region 2602 and a second probe 2616 can be annealed to a second target region 2604. In operation 2603, an extension reaction (e.g., using a polymerase, reverse transcriptase, etc.) can be performed to fill a gap region between the first probe 2606 and the second probe 2616, producing a gap-filled nucleic acid molecule. In some cases, the second probe 2616 includes a winged sequence 2630. In such a case, in operation 2605, an endonuclease (e.g., FEN 1) can be used to remove the winged sequence 2630, leaving the ligatable end (e.g., the 5' phosphorylated end) of the second probe 2616. In operation 2607, a ligation reaction (e.g., using a ligase) can be performed to ligate the first probe to the second probe, thereby producing a probe-ligated nucleic acid molecule. The probe-linked nucleic acid molecules or alternatively the unligated molecules may be barcoded, for example in one partition.

FIG. 27 shows an additional scheme for generating probe-linked nucleic acid molecules by gap-filling reactions using a third probe. In fig. 27, panel a, first probe 2706 and second probe 2716 anneal (e.g., via first probe sequence and second probe sequence, respectively) to first target region 2702 and second target region 2704 of nucleic acid molecule 2700 to produce a probe-associated nucleic acid molecule. A gap sequence may be disposed between the first target region 2702 and the second target region 2704. A third probe molecule 2770 (shown as two different probe molecules that can be used for SNP detection) can be provided that can anneal to the gap sequence (fig. 27 panel B). In FIG. 27, panel C, the first probe, the third probe, and the second probe can be ligated (e.g., using a ligase) to generate a probe-ligated nucleic acid molecule. The probe-linked nucleic acid molecules or alternatively probe-associated nucleic acid molecules may be barcoded, for example, in one partition.

FIG. 28 shows an example of a ligation protocol for generating probe-ligated nucleic acid molecules. In such examples, the probe molecule may hybridize to a nucleic acid molecule. The first probe may be attached to the second probe using an enzyme, optionally through a gap filling operation as described above. In some cases, the enzyme may be a pre-activated enzyme, such as a pre-activated T4 DNA ligase, and the ligation may occur under ATP-reducing or ATP-removing conditions, such as using apyrase.

Additional examples of methods and systems for generating and barcoding probe-associated nucleic acid molecules can be found, for example, in U.S. patent publication nos. 2020/0239874, WO 2019/165318, PCT/US2020/066720, and international patent application No. PCT/US2021/33649 filed on month 21 of 2021, each of which is incorporated herein by reference in its entirety.

It will be appreciated that, for example, referring to fig. 7-9 and 25-28, a nucleic acid barcode molecule may be attached (e.g., via hybridization) to a first probe and/or a second probe (e.g., via a probe capture sequence contained in the first probe or the second probe). Similarly, the first probe and the second probe may comprise any useful functional sequence, such as a primer sequence, a barcode sequence, a Unique Molecular Identifier (UMI) sequence, a flow cell attachment sequence, a primer binding sequence, a capture sequence, and the like. The first probe may hybridize to the left (e.g., 3 'end) or right (e.g., 5' end) of a nucleic acid molecule (e.g., 700, 800, or 900). Similarly, the second probe may hybridize to the left or right side of the nucleic acid molecule.

As described herein, one or more extension reactions may be performed on the nucleic acid molecules to which the probes hybridize. For example, a probe may extend from one end of the probe to one end of the nucleic acid barcode molecule, or a second probe may extend from one end of the second probe to one end of the first probe of the probe-associated nucleic acid molecule. Extension may include the addition of one or more nucleotides at the end of the probe using enzymes (e.g., polymerase, reverse transcriptase). The extension may provide an extended nucleic acid molecule comprising a sequence complementary to a target region of the nucleic acid molecule of interest, a barcode sequence, and optionally one or more additional sequences of the nucleic acid barcode molecule, such as one or more binding sequences. In some cases, appropriate conditions and/or chemicals (e.g., as described herein) can be applied to denature the extended nucleic acid molecules from the nucleic acid barcode molecules and the target nucleic acid molecules. In some cases, one or more of the processes may involve the use of a thermal sensitizer. For example, in some cases, the probe may anneal or hybridize under a set of temperature conditions, while the extension may occur under a different set of temperature conditions. In some cases, a warm-start or hot-start polymerase may be used. In some cases, hybridization of a nucleic acid barcode molecule to one or more probes (e.g., direct hybridization or via a probe binding molecule such as a splint oligonucleotide) may precede hybridization of the probes to a target region of the nucleic acid molecule. After the barcode, the barcoded nucleic acid molecule may be replicated or amplified by, for example, one or more amplification reactions. The amplification reaction may comprise a Polymerase Chain Reaction (PCR) and may involve the use of one or more primers or polymerases. The extension, denaturation and/or amplification process can be performed in a single partition or in bulk. In some cases, the extended nucleic acid molecule or derivative thereof (e.g., a barcoded molecule) can be replicated or amplified within a partition to provide an amplified product. The barcoded products or their complementary sequences (e.g., amplified products) can be detected via sequencing (e.g., as described herein).

The nucleic acid molecule or derivative thereof (e.g., a probe-linked nucleic acid molecule, a nucleic acid molecule having one or more probes hybridized thereto, a barcoded, probe-linked nucleic acid molecule or an extended nucleic acid molecule or a complement thereof) or a cell or cell bead comprising the nucleic acid molecule or derivative thereof may be provided within a partition (such as a well or droplet), e.g., as described herein. The one or more reagents may be co-partitioned with the nucleic acid molecule or derivative thereof or a cell comprising the nucleic acid molecule or derivative thereof. For example, a nucleic acid molecule or derivative thereof or a cell comprising the nucleic acid molecule or derivative thereof may be co-partitioned with one or more reagents selected from the group consisting of a lysing agent or buffer, a permeabilizing agent, an enzyme (e.g., an enzyme capable of digesting one or more nucleic acid molecules, reverse transcribing an RNA molecule, permeabilizing or lysing a cell or performing other functions), a fluorophore, an oligonucleotide, a primer, a probe, a barcode, a nucleic acid barcode molecule (e.g., a nucleic acid barcode molecule comprising one or more barcode sequences), a buffer, a deoxynucleotide triphosphate, a detergent, a reducing agent, a chelating agent, an oxidizing agent, a nanoparticle, a bead, and an antibody. In some cases, the nucleic acid molecule or derivative thereof, or a cell (e.g., a cell bead) comprising the nucleic acid molecule or derivative thereof, may be co-partitioned with one or more reagents selected from the group consisting of temperature sensitive enzymes, pH sensitive enzymes, photosensitive enzymes, reverse transcriptases, proteases, ligases, polymerases, restriction enzymes, nucleases, protease inhibitors, exonucleases, and nuclease inhibitors. For example, a nucleic acid molecule or derivative thereof or a cell comprising the nucleic acid molecule or derivative thereof may be co-partitioned with a polymerase and a nucleotide molecule. Separating a nucleic acid molecule or derivative thereof or a cell comprising the nucleic acid molecule or derivative thereof and one or more reagents may comprise flowing a first phase comprising an aqueous fluid, the cell and the one or more reagents, and a second phase comprising a fluid that is immiscible with the aqueous fluid to the junction. Upon interaction of the first phase and the second phase, discrete droplets of the first phase comprising the nucleic acid molecule or derivative thereof or cells (e.g., cell beads) comprising the nucleic acid molecule or derivative thereof and the one or more reagents may be formed. In some cases, a partition may comprise a single cell. Cells can be lysed or permeabilized within a partition (e.g., droplet) to access nucleic acid molecules of the cell.

One or more processes may be performed within a zone (e.g., drop, well, etc.). For example, a nucleic acid molecule or a cell or cell bead comprising a nucleic acid molecule can be co-partitioned with one or more reagents (e.g., as described herein) at any useful stage of the method. For example, a probe-associated nucleic acid molecule (e.g., a nucleic acid molecule that hybridizes to a first probe) can be generated in bulk (e.g., in a population of cells that can be living or immobilized and/or permeabilized, in a tissue sample, etc.) and subjected to conditions sufficient to generate an extended probe molecule. The extended probe molecules may then be partitioned in one of a plurality of partitions. The partition may include a second probe and a nucleic acid barcode molecule, and optionally a probe binding molecule. As described herein, the second probe can hybridize (e.g., via the second probe sequence) to a second target region of a molecule with which the probe is associated, or to a complement thereof. The partitions may include additional reagents for performing a nucleic acid reaction (e.g., digestion, ligation, extension, amplification). For example, the probe-associated nucleic acid molecule may comprise or hybridize to a nucleic acid molecule, and the partition may include a degrading enzyme (e.g., an rnase) that may be used to digest or remove template strands (e.g., a nucleic acid molecule, such as an RNA molecule) from the extended probe molecule. The partition may include a polymerase that may be used to extend the second probe hybridized to the extended probe molecule. In some cases, the partition includes a ligase (linking enzyme) (e.g., ligase) that may be used to ligate the nucleic acid barcode molecule to the first probe or the second probe (e.g., via a probe capture sequence). In some cases, a ligase may be used to ligate the probe binding molecule to the probe capture sequence of the first probe or the second probe. In some cases, the probe binding molecule, probe capture sequence, and/or barcode capture sequence comprise one or more reactive moieties that can be used to chemically or enzymatically link the nucleic acid barcode molecule to the probe capture sequence or its complement. The resulting barcoded product may comprise a sequence corresponding to the first target region, a sequence corresponding to the second target region, a sequence corresponding to the probe capture sequence, and a sequence corresponding to the barcode sequence.

For example, referring again to fig. 7, operation 701 may be performed in an ontology (e.g., outside a partition), while operations 703, 705 may be performed in one partition. Operations 707 and 709 may be performed in the ontology or within the partition. Similarly, referring to FIG. 8, operation 801 may be performed in an ontology, while operation 803 may be performed in a partition. Operation 805 may be performed in an ontology or in a partition. Referring to fig. 9, operation 901 may be performed in an ontology, while operations 903, 905, and 907 may be performed in one partition. It should be appreciated that any operations may be performed in an ontology or in a partition at any convenient step, and that the order of the operations may be altered for suitable or useful purposes.

Similarly, a nucleic acid molecule or a cell or cell bead comprising a nucleic acid molecule or derivative thereof (e.g., a probe-associated molecule, an extended molecule, a barcoded molecule, etc.) may be released from a partition at any useful stage of the method. For example, the extended probe molecule can hybridize to the second probe and be released from the partition after the barcode capture sequence of the nucleic acid barcode molecule hybridizes to the first probe, the second probe, or the probe binding molecule. Alternatively, the extended probe molecule may be released from the partition after (i) hybridization of the second probe to the nucleic acid barcode molecule and (ii) extension of the second probe to produce a barcoded molecule comprising a sequence corresponding to the first target region, a sequence corresponding to the second target region, a sequence corresponding to the probe capture sequence, and a sequence corresponding to the barcode sequence. Replication and/or amplification of the extended nucleic acid molecule may be performed in a partition or in bulk (e.g., in solution). In some cases, the solution may contain additional extended nucleic acid molecules produced by the same process performed in different partitions. Each extended nucleic acid molecule may comprise a different barcode sequence, and the barcode sequence may be used to identify the partition or cell from which the extended nucleic acid molecule is derived. In such cases, the solution may comprise a combined mixture comprising the contents of two or more partitions (e.g., droplets).

Additional processes or manipulations may be performed within the partition, including, but not limited to, cleavage, permeabilization, denaturation, hybridization, extension, replication, and amplification of one or more components of the sample. In some cases, multiple processes are performed within one partition.

Hybridization of the probe sequence to the target region of the nucleic acid molecule may be performed within a partition or outside a partition. In some cases, the double stranded nucleic acid molecule may be denatured prior to hybridization to provide single stranded nucleic acid molecules, or the cells may be lysed or permeabilized. In some cases, hybridization may occur in cell beads comprising cells. The probe sequence complementary to the target region may be located at one end of the probe. Alternatively, the sequence may be disposed between other sequences such that when the probe sequence hybridizes to the target region, the additional probe sequence extends beyond the hybridized sequence in one or more directions. The probe sequence that hybridizes to the target region of the nucleic acid molecule may have the same or different length as the target region. For example, the probe sequence may be shorter than the target region and may hybridize to only a portion of the target region. Alternatively, the probe sequence may be longer than the target region, and may hybridize to the entire target region and extend beyond the target region in one or more directions. In addition to probe sequences complementary to the target region of the nucleic acid molecule, the probe may also comprise one or more additional probe sequences. For example, the probe may comprise a probe sequence complementary to the target region and a second probe sequence. The second probe sequence may have any useful length and other properties.

The probe (e.g., first probe or second probe) may comprise one or more additional sequences or portions, such as one or more barcode sequences or Unique Molecular Identifier (UMI) sequences, adapter sequences, functional sequences (e.g., primer sequences, sequencing primer sequences, etc.). In some cases, one or more probe sequences of the probe may comprise a detectable moiety, such as a fluorophore or fluorescent moiety. In some cases, the first probe or the second probe may comprise a reactive moiety, as described elsewhere herein. For example, the first probe or the second probe may comprise an azide moiety, an alkyne moiety, a phosphorothioate moiety, an iodide moiety, an amine moiety, a phosphate moiety, or a combination thereof. The first probe may comprise a first reactive moiety and the second probe may comprise a second reactive moiety, and the reaction of the first reactive moiety and the second reactive moiety may be sufficient to produce a probe-linked molecule comprising the first probe linked to the second probe. In some cases, the first reactive moiety and the second reactive moiety are linked via a ligation reaction. Thus, the first probe or the second probe may comprise one or more moieties or modified nucleotides to facilitate a ligation reaction, e.g., one or more ribonucleotides or dideoxynucleotides (ddntps), that may be ligated to the phosphorylated end of the second probe using a ligase (e.g., T4 DNA ligase, splintR ligase). In some cases, a probe (e.g., a first probe or a second probe) may comprise an overhang sequence or a winged sequence that can be recognized or cleaved by an endonuclease (e.g., a FEN1 endonuclease). Other suitable enzymes may be used, such as ligases, e.g., the enzymes and ligases disclosed in U.S. provisional application No. 63/171,031, filed on 5, 4, 2021, which is incorporated herein by reference in its entirety.

As described herein, the probe sequence of the probe may be capable of hybridizing to the sequence of a nucleic acid barcode molecule or a probe binding molecule (e.g., a splint oligonucleotide). The nucleic acid barcode molecule may comprise a first binding sequence (e.g., a barcode capture sequence) that is complementary to a probe sequence (e.g., a probe capture sequence) of a probe. The nucleic acid barcode molecule may comprise one or more additional functional sequences, such as primer sequences, primer annealing sequences, and immobilization sequences. The binding sequence may have any useful length and other properties. In some cases, the binding sequence (e.g., a barcode capture sequence) complementary to the probe sequence of the probe may be the same length as the probe sequence. Alternatively, the binding sequences may be probe sequences of different lengths. For example, the binding sequence may be shorter than the probe sequence and may hybridize to only a portion of the probe sequence. Alternatively, the binding sequence may be longer than the probe sequence and may hybridize to the entire probe sequence and extend beyond the probe sequence in one or more directions. Similarly, where a probe binding molecule is used, the binding sequence (e.g., the barcode capture sequence) of the nucleic acid barcode molecule may be the same length as the barcode binding sequence of the probe binding molecule, or the binding sequence may be longer or shorter than the barcode binding sequence.

One or more of the processes described herein may be performed in a cell, nucleus, or cell bead. For example, in some embodiments, a plurality of cells, nuclei, or cell beads may comprise a plurality of nucleic acid molecules. The cells, nuclei or cell beads may be living or immobilized and/or permeabilized. In some cases, the first probe may be provided to the cell, nucleus, or cell bead, such as in a bulk solution. Optionally, the cells, nuclei, or cell beads may be washed to remove unbound first probes, and a nucleic acid extension reaction as described herein may be performed. Subsequently, the cells, nuclei, or cell beads comprising the plurality of nucleic acid molecules (or extended probe nucleic acid molecules) may be partitioned into a plurality of separate partitions, wherein at least a subset of the plurality of separate partitions comprises a single cell, a single nucleus, or a single cell bead. The target nucleic acid molecules within the cells, nuclei, or cell beads contained in one partition can be accessed by lysing or permeabilizing the nuclei or cells (e.g., as described herein), which can be performed prior to or during partitioning. Alternatively, access to target nucleic acid molecules contained within cells, nuclei, or cell beads can be performed in bulk solution using the methods described herein. The addition of the barcode to the target nucleic acid can be performed intracellularly, within a cell bead, or within a partition containing the cell or cell bead. In one example, the target nucleic acid molecule can be in a partition and a barcode added to the cell, nucleus or cell bead. Additional probe hybridization (e.g., providing a second probe) and/or bar coding may be performed within a separate partition. As described herein, barcoding can include using a nucleic acid barcode molecule to attach or hybridize to a target nucleic acid molecule or derivative thereof (e.g., an extended probe molecule or a complementary sequence thereof). Nucleic acid barcode molecules provided within each of the plurality of separate partitions may be provided attached to the beads. In some cases, the nucleic acid barcode molecules may be releasably attached to the beads (e.g., via labile bonds), as described elsewhere herein. Each partition (or subset of partitions) of the plurality of individual partitions may comprise a bead comprising a plurality of nucleic acid barcode molecules attached thereto (e.g., as described herein). The plurality of nucleic acid barcode molecules attached to each bead may comprise a unique barcode sequence such that each of the plurality of separate partitions comprises a different barcode sequence. Upon release of the component from a plurality of different partitions of the plurality of individual partitions (e.g., after barcoding), the barcoded molecules derived from a single cell, a single cell nucleus, or a single cell bead may have the same barcode sequence (e.g., a common barcode sequence) such that each barcoded nucleic acid molecule may be traced back to a given partition and/or in some cases, a given cell, cell nucleus, or cell bead.

The methods described herein may include additional barcoding operations that may be used, for example, to index nucleic acid molecules to cells, nuclei, cell beads, samples, a partition, or multiple partitions. Such indexing may be used when a single partition is occupied by multiple cells, nuclei, or beads of cells. In some cases, it may be advantageous to overload the partitions such that one partition includes more than a single cell, a single nucleus, or a single cell bead, for example, it may be useful in some cases to overload the partition, e.g., to overcome poisson load statistics in the partition and/or to prevent reagent waste (e.g., caused by an unoccupied partition). Thus, such an index can be used to attribute cells, nuclei, or nucleic acid molecules in a multi-occupied partition to the starting cells, nuclei, cell beads, partitions, samples, and the like.

In one example, a barcoded molecule, such as produced using the methods described herein, may be provided (e.g., in fig. 7-9, 25-28, and the barcoded, probe-linked nucleic acid molecules described in U.S. patent publication nos. 2020/0239874 and WO 2019/165318, each of which is incorporated herein by reference). As described herein, a barcoded molecule may comprise a sequence corresponding to a first target region, a sequence corresponding to a second target region, a sequence corresponding to a probe capture sequence (which may be disposed on either the first probe or the second probe), and a sequence corresponding to a barcode sequence of a nucleic acid barcode molecule. Such barcode sequences may be specific for the partition and may be different from other barcode sequences of other partitions, and thus may be used to identify the partition from which the nucleic acid molecule (or derivative thereof) originated. In some cases, some partitions may include single cells, single nuclei, or single cell beads, and thus nucleic acid barcode molecules or barcode sequences may be used to identify cells, nuclei, or cell beads from which the nucleic acid molecules (or derivatives thereof) were derived.

In some cases, the barcoded molecules may be subjected to additional barcoding operations (e.g., in a partition or in an ontology). For example, the barcoded molecules may be re-partitioned in one of a plurality of partitions that include a plurality of additional nucleic acid barcode molecules. The plurality of additional nucleic acid barcode molecules may comprise additional barcode sequences that differ between partitions. The barcoded molecules may be subjected to conditions sufficient to barcode the barcoded molecules to produce a combined barcoded molecule comprising two barcode sequences. Since each barcode sequence belongs to a unique partition, combinations of barcodes can be used to create more diverse barcoded molecules, as well as to identify the starting partition of the combined barcoded molecule.

In some cases, the combined assembly of barcode fragments may be performed using, for example, a split-pool method. For example, in some embodiments, the probe-linked nucleic acid molecules may be subjected to combinatorial barcoding using a split-cell method. In one such example, a plurality of permeabilized cells (or permeabilized nuclei or cell beads) comprising, for example, a probe-linked nucleic acid molecule (e.g., the product following operation 709 of FIG. 7, operation 805 of FIG. 8, or operation 905 or 907 of FIG. 9), optionally with a barcode, can be separated into a plurality of partitions (e.g., a plurality of wells), wherein each partition in the plurality of partitions comprises a different (e.g., unique) barcode sequence fragment. Alternatively, a plurality of permeabilized cells (or permeabilized nuclei or cell beads) can be separated, and then different barcode sequence fragments delivered to the corresponding partition comprising the cells, nuclei, and/or cell beads. After the barcode sequence fragments are added, cells (or nuclei or cell beads) can be collected from the plurality of partitions, pooled and partitioned into an additional plurality of partitions (e.g., a plurality of wells), wherein each partition of the additional plurality of partitions includes a different (e.g., unique) second barcode sequence fragment. Repeating the split-cell process allows the generation of barcodes or barcoded molecules comprising any suitable amount of barcode sequence fragments. A combined plus bar code as described herein may include at least 1,2, 3, 4, 5, 6, 7, 8, or more operations (e.g., split-cell cycles). A combined plus barcode comprising multiple operations can be used, for example, to create greater barcode diversity, as well as to synthesize unique barcode sequences on nucleic acid molecules derived from each single cell, nucleus, or cell bead in a plurality of cells, nuclei, or cell beads. For example, a combined plus barcode that includes three operations, each of which includes attaching a unique nucleic acid sequence in each of 96 partitions, will result in up to 884,736 unique barcode combinations. typically, where there are M partitions and N split pool iterations are performed, up to M ^N unique bar code combinations can be generated. The cells or nuclei or cell beads may be separated such that there is at least one cell (or nucleus or cell bead) in each of the plurality of partitions. The cells, nuclei or cell beads may be separated such that at least 1;2;3;4;5;10;20;50;100;500;1,000;5,000;10,000;100,000;1,000,000; or more cells, nuclei or cell beads are present in a single partition. Cells, nuclei or cell beads may be separated such that up to 1,000,000;100,000;10,000;5,000;1,000;500;100;50;20;10;5;4;3;2; or 1 cell (or nucleus or cell bead) is present in a single partition. Cells, nuclei and/or cell beads may be separated in a random arrangement.

In some cases, additional bar code adding operations may be performed prior to some of the operations described herein. For example, it may be advantageous to combine the first probes in a bulk solution with a barcode, e.g., before or after the creation of the extended probe molecules or probe-linked molecules. In such cases, the nucleic acid molecule may be contacted with the first probe, e.g., in bulk, to produce a probe-associated molecule. The probe-associated molecules can optionally be extended, for example, using the methods described herein, to produce extended probe molecules. The probe-associated molecules or extended probe molecules may then be subjected to combined barcoding, for example, in a partition as described above, to produce combined barcoded molecules. The combined barcoded molecule can then be separated from the second probe and the nucleic acid barcode molecule, which can be attached to the first probe (or combined barcoded probe), the second probe, or both, as described herein. Since each partition of the combined barcoding process includes a different barcode sequence fragment, multiple combined barcoded molecules can trace back to the individual partition from which they originated. In addition, combinatorial addition of barcodes can be used to create greater probe diversity.

Advantageously, the combined addition of a barcode to the first probe may be particularly useful when combined with a second probe and a nucleic acid barcode molecule, which may contain a barcode sequence specific to the partition. For example, the presence of a probe-specific barcode and a partition-specific barcode sequence may allow indexing of individual cells (or nuclei or cell beads) within a partition. For example, a partition comprising a cell/nucleus/cell bead multiplex (e.g., cell duplex, triplet, etc.) may be computationally deconvolved into a single cell/nucleus/cell bead. Thus, in some cases, the cells, nuclei, or cell beads may be "overloaded" into partitions using conditions such that there is a higher probability of forming cell/nucleus/cell bead complexes (2, 3,4, 5+ cells, nuclei, or cell beads per partition), where the target library of these cell complexes may be computationally deconvolved into a single cell, nucleus, or cell bead.

FIG. 10 schematically shows an example workflow for barcode of nucleic acid molecules in a partition containing cell/nucleus/cell bead multiplex. In operation 1010, one or more populations of cells/nuclei/cell beads (or nucleic acid molecules contained therein) may be subjected to barcoding, as described herein (e.g., using the processes shown and described in fig. 7-9 and 15-16). For example, a first population 1002 (comprising a first plurality of nucleic acid molecules) of cells (or nuclei or cell beads) may be subjected to a barcode in a first subset of the first plurality of partitions, resulting in a first plurality of barcode-tagged nucleic acid molecules comprising a first barcode sequence. A second population 1004 of cells (or nuclei or cell beads) can be barcoded in a second subset of the first plurality of partitions, resulting in a second plurality of barcoded nucleic acid molecules comprising a second barcode sequence. The first barcode sequence may be different from the second barcode sequence. In operation 1020, a first population of cells (or nuclei or cell beads) 1002 and a second population of cells (or nuclei or cell beads) 1004 may be combined to produce a mixture of cells. In operation 1030, the mixture of cells (or nuclei or cell beads) may be partitioned into a second plurality of partitions. In some cases, the mixture of cells/nuclei/cell beads may be partitioned into a second plurality of partitions such that some of the second plurality of partitions include more than one cell (e.g., a cell multi-body partition). For example, the partitions 1035 in the second plurality of partitions may include cells, nuclei, or cell beads ("cell a") from the first population of cells 1002 and cells, nuclei, or cell beads ("cell B") from the second population of cells 1004. The partition 1035 may include additional barcode sequences that may be unique to the partition. The cells/nuclei/cell beads in each partition may be subjected to additional barcoding operations to supplement the barcoded nucleic acid molecules with additional barcode sequences. In operation 1040, the barcoded nucleic acid molecules may be deconvolved using different barcode sequences (e.g., a first barcode sequence, a second barcode sequence, and additional barcode sequences) to identify starting cells/nuclei/cell beads. For example, a barcoded nucleic acid molecule comprising an additional barcode sequence from partition 1035 and a first barcode sequence from first population of cells (or nuclei or cell beads) 1002 may be used to identify the barcoded nucleic acid molecule as originating from cell a. Similarly, a barcoded nucleic acid molecule comprising an additional barcode sequence from partition 1035 and a second barcode sequence from a second population of cells (or nuclei or cell beads) 1004 can be used to identify the barcoded nucleic acid molecule as originating from cell B.

After partition-based barcoding, the contents of the partitions may be combined, and the barcoded molecules (e.g., the barcoded, probe-linked nucleic acid molecules) may be replicated or amplified by, for example, one or more amplification reactions, which may be isothermal in some cases. The amplification reaction may comprise a Polymerase Chain Reaction (PCR) and may involve the use of one or more primers or polymerases. The one or more primers can comprise one or more functional sequences (e.g., primer sequence/primer binding sequence, sequencing primer sequence (e.g., R1 or R2), partial sequencing primer sequence (e.g., partial R1 or partial R2), sequence configured to be attached to a flow cell of a sequencer (e.g., P5 or P7, or partial sequences thereof), etc.) and can facilitate addition of the one or more functional sequences to the extended nucleic acid molecule. The barcoded molecules or derivatives thereof can be detected via nucleic acid sequencing (e.g., as described herein).

In some aspects, provided herein are systems useful for barcoding nucleic acid molecules. The system may include any of the components described herein, e.g., a plurality of partitions (e.g., droplets, wells), which may be provided in any useful form, e.g., a microfluidic device, a multi-well array, or plate, etc. The system can include a nucleic acid barcode molecule, optionally coupled to a support (e.g., a particle, bead, gel bead, etc.). In some cases, the system can include any of the probes described herein, such as a first probe or first plurality of probes, a second probe or second plurality of probes, and any useful reaction components (e.g., for performing a nucleic acid reaction, e.g., extension, ligation, amplification, etc.). In non-limiting examples, such useful reaction components can include enzymes (e.g., ligases, polymerases, reverse transcriptases, restriction enzymes, etc.), nucleotide bases, and the like.

Also provided herein are compositions useful for systems and methods for barcoding nucleic acid molecules. The composition may comprise any of the probes described herein. For example, the composition may comprise a plurality of first probes, a plurality of second probes, and/or a plurality of first probes and a plurality of second probes. A probe or set of probes may be designed to target a specific sequence or set of specific sequences. Such probes may be designed to have the same or different sequences within different partitions. For example, the first composition may comprise a first probe and a second probe designed to target two regions of a first gene, and the second composition may comprise a first probe and a second probe designed to target two regions of a second gene, the second gene being different from the first gene. The composition may comprise a nucleic acid barcode molecule and/or a probe binding molecule, which may optionally be provided coupled to a support (e.g., particle, bead). The composition may be part of or comprise a reaction mixture that may comprise reaction components or reagents, such as enzymes, nucleotide bases, catalysts, buffers, and the like.

Multiplex analysis of nucleic acids and proteins

In another aspect, the present disclosure provides a method for performing a multiplex assay. Such multiplex assays may include assaying or analyzing one or more biomolecules (e.g., nucleic acid molecules, proteins, lipids, carbohydrates, etc.). The method may include barcoding a nucleic acid molecule of a cell/cell nucleus/cell bead using one or more probes and nucleic acid barcode molecules to produce a first barcoded nucleic acid molecule, attaching or coupling a feature binding group to a feature of the cell/cell nucleus/cell bead, wherein the feature binding group comprises a reporter oligonucleotide comprising a reporter sequence that recognizes the feature binding group, barcoding the reporter sequence using an additional nucleic acid barcode molecule and optionally an additional probe to produce a second barcoded nucleic acid molecule, and optionally barcoding the first and second barcoded nucleic acid molecules to produce a third and fourth barcoded nucleic acid molecule. One or more operations may be performed within a partition (e.g., a droplet or a well).

The methods described herein can facilitate profiling one or more biomolecules at single cell/single cell nuclear/single cell bead resolution using, for example, probe hybridization, characteristic binding groups (e.g., antibodies, antibody fragments, epitope binding groups, etc.), barcoding, amplification, and sequencing. The method can be used to provide genomic, transcriptomic, proteomic, exome, or other "histologic" information from individual cells/nuclei/cell beads. As described herein, the method can be used to analyze a set of predetermined target genes and a set of predetermined target features (e.g., proteins, peptides, or other biomolecules) in a sensitive and accurate manner. Alternatively or in addition, the method may be used to analyze characteristics of whole genomes, whole transcriptomes, whole exons, etc. of cells.

In some aspects, the method comprises contacting the cell/nucleus/cell bead with the first probe, the second probe, and the third probe under conditions sufficient to produce a first probe-associated molecule and a second probe-associated molecule. The cell/nucleus/cell bead may comprise (i) a nucleic acid molecule (e.g., a target nucleic acid molecule such as RNA or DNA) comprising a first target region and a second target region, and (ii) a feature (e.g., a protein, peptide, or other biomolecule) coupled to a feature binding group. The feature binding group may comprise or be coupled to (i) a reporter oligonucleotide comprising a reporter sequence, which may be associated with a feature or may be used to identify a feature, and (ii) a feature probe binding sequence. The first probe may comprise a first probe sequence complementary to a first target region of the nucleic acid molecule, and optionally additional probe sequences, such as a probe capture sequence or other functional sequences. The second probe may comprise a second probe sequence complementary to the second target region, and optionally a probe capture sequence or functional sequence. The third probe may comprise (i) a third probe sequence complementary to the characteristic probe binding sequence, and (ii) a probe capture sequence or functional sequence, which may be the same sequence as the probe capture sequence of the first probe and/or the second probe.

In some cases, the first probe-associated molecule can comprise a nucleic acid molecule, a first probe, a second probe, or a combination or complement thereof. The second probe-associated molecule may comprise a reporter oligonucleotide (which comprises a reporter sequence) and a third probe or a complement thereof.

In some aspects, the method includes providing a first probe-associated molecule and a second probe-associated molecule, and barcoding the first probe-associated molecule and the second probe-associated molecule. Such a bar code addition operation may occur in a first set of partitions (e.g., droplets or holes). Such example methods can include contacting the first probe-associated molecule and the second probe-associated molecule with a probe-binding molecule (e.g., a splint oligonucleotide) and a barcode molecule (e.g., a nucleic acid barcode molecule) under conditions sufficient to produce a first barcoded nucleic acid molecule and a second barcoded nucleic acid molecule. The barcode molecule may comprise (i) a barcode capture sequence, such as a common sequence common to a plurality of barcode molecules, and (ii) a first barcode sequence. In the case of using partitions, the first barcode sequence may be unique to a first partition of the first set of partitions, and the barcode molecules within the first partition may share the same first barcode sequence. The probe binding molecules can comprise (i) a probe binding sequence that is complementary to a probe capture sequence (of the first probe, the second probe, and/or the third probe), and (ii) a barcode binding sequence that is complementary to a barcode capture sequence (e.g., a common sequence) of a plurality of barcode molecules. Thus, the barcoding of the first probe-associated molecule and the second probe-associated molecule may comprise hybridization of the probe-binding molecule to (i) the probe capture sequences of the first probe, the second probe, and/or the third probe (or their complements), and (ii) the barcode capture sequences (or the common sequences) of the nucleic acid barcode molecules. In some examples, the first barcoded nucleic acid molecule comprises a sequence corresponding to the first probe sequence, a sequence corresponding to the second probe sequence, and a sequence corresponding to the first barcode sequence. Similarly, the second barcoded nucleic acid molecule can comprise a sequence corresponding to the reporter sequence, a sequence corresponding to the third probe sequence, and a sequence corresponding to the first barcode sequence.

The method may further comprise providing a second set of partitions, and in the second partition of the second set of partitions, (i) contacting the first barcoded nucleic acid molecule or derivative thereof (e.g., a complement thereof, an amplicon, an extension product) with a first capture molecule of the plurality of capture molecules under conditions sufficient to produce a third barcoded nucleic acid molecule, and (ii) contacting the second barcoded nucleic acid molecule or derivative thereof with a second capture molecule of the plurality of capture molecules under conditions sufficient to produce a fourth barcoded nucleic acid molecule. The plurality of capture molecules may each comprise a second barcode sequence, which may be the same or different from the first barcode sequence from the first set of partitions. The second barcode sequence may be specific to the partition (e.g., different between partitions). The third and fourth barcoded nucleic acid molecules may each comprise a sequence corresponding to the first barcode sequence and a sequence corresponding to the second barcode sequence. For example, the third barcoded nucleic acid molecule can comprise a sequence corresponding to the first target region, a sequence corresponding to the second target region, a sequence corresponding to the probe capture sequence, a first barcode sequence, and a second barcode sequence. The fourth barcoded nucleic acid molecule can comprise a sequence corresponding to a reporter sequence, a sequence corresponding to a characteristic probe binding sequence, a sequence corresponding to a third probe, a first barcode sequence, and a second barcode sequence.

The characteristic binding group may comprise a labelling agent, as described elsewhere herein. Thus, in some examples, the characteristic binding group may comprise an antibody or antibody fragment, an epitope-binding portion, a protein, a peptide, a lipophilic portion (such as cholesterol), a cell surface receptor binding molecule, a receptor ligand, a small molecule, a bispecific antibody, a bispecific T cell adapter, a T cell receptor adapter, a B cell receptor adapter, an antibody prodrug, an aptamer, a monoclonal antibody, affimer, darpin, and a protein scaffold, or any combination thereof.

The probe capture sequence of the first probe (or the second probe) may be common to a plurality of first probes (or the second probe), a plurality of partitions, and/or a plurality of cells/nuclei/cell beads. For example, the first set of partitions can include one or more additional partitions that include additional probe-associated nucleic acid molecules. The additional probe-associated nucleic acid molecule may comprise the same sequence as the first partitioned probe-associated nucleic acid molecule (e.g., first probe sequence, second probe sequence), or the additional partitioned additional probe-associated nucleic acid molecule may comprise a different sequence (e.g., a different probe sequence) than the first partitioned probe-associated nucleic acid molecule. In some cases, each of the one or more additional probe-associated nucleic acid molecules comprises a probe capture sequence, which may be the same or different in the first set of partitions.

The probe-associated molecules may be probe-linked molecules. For example, the probe-associated molecules may be probe-associated molecules or bar-coded molecules described herein (e.g., in fig. 7-9), or probe-linked molecules, such as those described in U.S. patent publication No. 2020/0239874 and international publication No. WO 2019/165318, each of which is incorporated herein by reference in its entirety. In some examples, two sets of probe-associated molecules can be generated, wherein (i) the first probe-associated molecule comprises a nucleic acid molecule with which the first probe and the second probe hybridize (e.g., via hybridization of the first probe sequence to a first target region and hybridization of the second probe sequence to a second target region), and (ii) the second probe-associated molecule comprises a reporter oligonucleotide (comprising a reporter sequence) with which the third probe hybridizes.

The first probe, the second probe, and/or the third probe may comprise a probe capture sequence. The probe capture sequence on the first probe may be the same as or different from the probe capture sequence of the second probe or the third probe. Similarly, the probe capture sequence of the second probe may be the same as or different from the probe capture sequence of the third probe. Thus, the barcoding operations described herein may occur on the first probe, the second probe, the third probe, or any combination thereof. For example, for a molecule associated with a probe comprising a nucleic acid molecule and a first probe ("probe 1") and a second probe ("probe 2") hybridized thereto, a first barcode molecule comprising a first barcode sequence ("BC 1") may be hybridized to the first probe (e.g., directly or via a probe binding molecule) to produce a first barcoded nucleic acid molecule, and then a capture molecule comprising a second barcode sequence ("BC 2") may be annealed to a region of the first barcode molecule to produce a molecule comprising the sequence or complementary sequence of BC2-BC 1-probe 2. Alternatively or in addition, a first barcode molecule comprising a first barcode sequence ("BC 1") may be hybridized (e.g., directly or via a probe binding molecule) with a second probe to produce a first barcoded nucleic acid molecule, and then a capture molecule comprising a second barcode sequence ("BC 2") may be annealed to a region of the first barcode molecule to produce a molecule comprising a probe 1-probe 2-BC1-BC2 sequence. Alternatively or in addition, the barcode molecule and the capture molecule may anneal to different probes. For example, a first barcode molecule comprising a first barcode sequence ("BC 1") may be hybridized (e.g., directly or via a probe binding molecule) with a first probe to produce a first barcoded nucleic acid molecule, and then a capture molecule comprising a second barcode sequence ("BC 2") may be annealed to a second probe to produce a molecule comprising a BC 1-probe 2-BC2 sequence. Alternatively or in addition, a first barcode molecule comprising a first barcode sequence ("BC 1") may be hybridized (e.g., directly or via a probe binding molecule) with a second probe to produce a first barcoded nucleic acid molecule, and then a capture molecule comprising a second barcode sequence ("BC 2") may be annealed to the first probe to produce a molecule comprising a BC 2-probe 1-probe 2-BC1 sequence. It should be understood that while several examples of barcoding schemes are described herein, additional combinations and locations of barcode sequences are possible, for example, combined barcoding may be used to generate greater barcode diversity, as described herein, and such barcoding may occur on any probe molecule (or molecule to which a barcode has been added).

In some cases, the barcode molecule may comprise a capture binding sequence that is complementary to the capture sequences of the plurality of capture molecules. For example, the first probe may comprise a probe capture sequence that can hybridize to a probe binding molecule that can mediate hybridization of the barcode molecule (e.g., via hybridization of the barcode binding sequence of the probe binding molecule to the barcode capture sequence (e.g., a common sequence) of the barcode molecule). The barcode molecule may additionally comprise a capture binding sequence, which may allow the capture sequence of the capture molecule to hybridize to the barcode molecule.

FIG. 15 schematically illustrates an example barcoded nucleic acid molecule as described herein. Referring to fig. a, a nucleic acid molecule (e.g., RNA molecule) 1500 comprising a first target region 1502 and a second target region 1504 can be provided. The nucleic acid molecule 1500 can be contacted with a first probe 1506 comprising a first probe sequence 1508 and optionally a first probe capture sequence 1510. The first probe sequence 1508 may be complementary to the first target region 1502. In addition, in some cases, the first probe capture sequence 1510 can comprise functional sequences, such as primer sequences, partial primer sequences, barcode sequences, sequencing primer sequences, and the like. The nucleic acid molecule 1500 can also be contacted with a second probe 1516 comprising a second probe sequence 1514 and optionally a second probe capture sequence 1518. The second probe sequence 1514 may be complementary to the second target region 1504. The second probe capture sequence 1518 may additionally comprise a functional sequence. Hybridization of the first probe 1506 and the second probe 1516 to the nucleic acid molecule 1500 may produce a probe-associated molecule.

As described herein, the probe-associated molecules may be subjected to one or more barcode addition operations. Such barcoding operations may occur in one or more partitions (e.g., a first set of partitions), and may include hybridizing probe-binding molecules 1517 and barcode molecules 1519 comprising a barcode capture sequence (e.g., a common sequence) to molecules associated with the probes. In some cases, the probe-binding molecules 1517 and the barcode molecules 1519 may be provided as pre-annealed complexes, or they may be provided as separate molecules. The barcode capture sequence (e.g., the common sequence) may be a sequence common to a plurality of barcode molecules in the first set of partitions, or the common sequence may be unique to only barcode molecules in a single first partition (e.g., the common sequence differs between partitions of the first set of partitions). The probe binding molecule 1517 can comprise a probe binding sequence complementary to the probe capture sequence 1518 of the second probe 1516 and a barcode binding sequence complementary to the sequence of the barcode molecule 1519. The probe-associated molecules can be subjected to conditions sufficient to produce a first barcoded nucleic acid molecule, which can include annealing of the probe-binding molecule 1517 to (i) a probe capture sequence 1518 and (ii) a barcode capture sequence (e.g., a common sequence) of the barcode molecule 1519. The barcoding process may include additional operations, such as ligation, which may be performed chemically or enzymatically, as described elsewhere herein.

The first barcoded nucleic acid molecule or derivative thereof (e.g., complementary sequence, amplicon, extension product, combined barcoded nucleic acid molecule, as described elsewhere herein) can be subjected to a second barcoding operation. Such a second bar code addition operation may occur in a second set of partitions. For example, a first barcoded nucleic acid molecule can be removed from a first set of partitions, pooled (e.g., pooled with other barcoded nucleic acid molecules from other first partitions of the first set of partitions), and partitioned in a second partition of the second set of partitions. The second partition may include capture molecules 1520. The capture molecule 1520 may comprise a second barcode sequence and a sequence complementary to the probe capture sequence 1510 of the first probe 1506. The second barcode sequence may be a sequence common to a plurality of capture molecules in the second set of partitions, or the barcode sequence may be unique to only capture molecules in the second partition (e.g., different from partition to partition). The capture molecules 1520 can hybridize to the probe capture sequences 1510 to produce additional barcoded molecules (also referred to herein as "third barcoded nucleic acid molecules"). Additional barcoded molecules may include a sequence corresponding to a first barcode sequence (of the barcode molecule 1519) and a sequence corresponding to a second barcode sequence (of the capture molecule 1520).

Fig. 15, panel B, schematically illustrates another example barcoded molecule, in which a capture molecule 1520 hybridizes to a barcode molecule 1519. Similar to panel a, in panel B, a nucleic acid molecule (e.g., RNA molecule) 1500 comprising a first target region 1502 and a second target region 1504 can be provided. The nucleic acid molecule 1500 can be contacted with a first probe 1506 comprising a first probe sequence 1508 and a probe capture sequence 1510. The first probe sequence 1508 may be complementary to the first target region 1502. In addition, the probe capture sequence 1510 can comprise functional sequences, such as primer sequences, partial primer sequences, barcode sequences, sequencing primer sequences, and the like. The nucleic acid molecule 1500 can also be contacted with a second probe 1516 comprising a second probe sequence 1514 and optionally an additional sequence 1518. The second probe sequence 1514 may be complementary to the second target region 1504. The additional sequence 1518 may comprise, for example, a probe capture sequence or a functional sequence (e.g., primer binding site, sequencing primer sequence, etc.). Hybridization of the first probe 1506 and the second probe 1516 to the nucleic acid molecule 1500 may produce a probe-associated molecule.

The probe-associated molecules may be contacted with one or more barcode molecules. Such barcode addition operations may occur in multiple partitions (e.g., a first partition of a first set of partitions and/or a second partition of a second set of partitions), as described herein. The probe-associated molecules can be contacted with a probe-binding molecule 1517 and a barcode molecule 1519, which can comprise a first barcode capture sequence (e.g., a common sequence) and a second barcode capture sequence 1521 (also referred to herein as a "capture binding sequence"). In some cases, the probe-binding molecules 1517 and the barcode molecules 1519 may be provided as pre-annealed complexes or provided as separate molecules. The first barcode capture sequence (e.g., a common sequence) may be a sequence common to a plurality of barcode molecules in the first set of partitions, or the common sequence may be unique to only the barcode molecules in the first partition (e.g., different between partitions). The probe binding molecule 1517 can comprise a probe binding sequence that is complementary to the probe capture sequence 1510, and a barcode binding sequence that is complementary to a first barcode capture sequence (e.g., a common sequence) of the barcode molecule 1519. The probe-associated molecules can be subjected to conditions sufficient to produce a first barcoded nucleic acid molecule, which can include annealing of the probe-binding molecule 1517 to (i) the probe capture sequence 1510 and (ii) a first barcode capture sequence (e.g., a common sequence) of the barcode molecule 1519. The barcoding process may include additional operations, such as ligation, which may be performed chemically or enzymatically, as described elsewhere herein.

The first barcoded nucleic acid molecule or derivative thereof may be subjected to a second barcoding operation. Such a second bar code addition operation may occur in a second set of partitions. For example, a first barcoded nucleic acid molecule can be removed from a first partition and separated in a second partition of a second set of partitions (e.g., droplets). The second partition may include capture molecules 1520. The capture molecule 1520 may comprise a second barcode sequence and a sequence complementary to the second barcode capture sequence 1521 of the barcode molecule 1519. The second barcode sequence may be a sequence common to a plurality of capture molecules in the second set of partitions, or the barcode sequence may be unique to only capture molecules in the second partition (e.g., different from partition to partition). The capture molecule can hybridize to the second barcode capture sequence 1521 to produce an additional barcoded molecule (also referred to herein as a "third barcoded nucleic acid molecule"). Additional barcoded molecules may include a sequence corresponding to a first barcode sequence (of the barcode molecule 1519) and a sequence corresponding to a second barcode sequence (of the capture molecule 1520).

FIG. 15, panel C, shows another example barcoded nucleic acid molecule. A nucleic acid molecule (e.g., RNA molecule) 1500 comprising a first target region 1502 and a second target region 1504 can be provided. The nucleic acid molecule 1500 can be contacted with a first probe 1506 comprising a first probe sequence 1508 and optionally a first probe capture sequence 1510. The first probe sequence 1508 may be complementary to the first target region 1502. In addition, in some cases, the first probe or first probe capture sequence 1510 can comprise functional sequences, such as primer sequences, partial primer sequences, barcode sequences, sequencing primer sequences, and the like. The nucleic acid molecule 1500 can also be contacted with a second probe 1516 comprising a second probe sequence 1514 and optionally a second probe capture sequence 1518. The second probe sequence 1514 may be complementary to the second target region 1504. The second probe capture sequence 1518 may additionally comprise a functional sequence. Hybridization of the first probe 1506 and the second probe 1516 to the nucleic acid molecule 1500 may produce a probe-associated molecule or complex.

As described herein, the probe-associated molecules may be subjected to one or more barcode addition operations. Such barcoding operations may occur in one or more partitions (e.g., a first set of partitions), and may include hybridizing probe-binding molecules 1517 and barcode molecules 1519 comprising a barcode capture sequence (e.g., a common sequence) to molecules or complexes associated with the probes. In some cases, the probe-binding molecules 1517 and the barcode molecules 1519 are provided as pre-annealed complexes (e.g., partially double-stranded molecules comprising the probe-binding molecules 1517 and the barcode molecules 1519), or they may be provided as separate molecules that may be annealed separately to the probe-associated molecules or complexes (e.g., the probe-binding molecules 1517 may hybridize to the probe-associated molecules or complexes, e.g., via the second probe capture sequences 1518, and the barcode molecules 1519 may hybridize to the probe-binding molecules 1517). The barcode capture sequence (e.g., the common sequence) may be a sequence common to a plurality of barcode molecules in the first set of partitions, or the common sequence may be unique to only barcode molecules in a single first partition (e.g., the common sequence differs between partitions of the first set of partitions). The probe binding molecule 1517 can comprise a probe binding sequence complementary to the probe capture sequence 1518 of the second probe 1516 and a barcode binding sequence complementary to the sequence of the barcode molecule 1519. In some cases, the probe binding molecule 1517 and/or the barcode molecule 1519 comprises additional sequences, such as an adapter sequence, a primer sequence (e.g., a sequencing primer sequence or a partial sequencing primer sequence), UMI, a sample index sequence, and the like. In some cases, the probe-binding molecule 1517 comprises the complete sequence of the barcode molecule 1519, thus leaving no overhangs. In some cases, the probe binding molecules 1517 and the barcode molecules 1519 comprise sample index sequences that can be used to identify the partition, cell, nucleus, or cell bead from which the target nucleic acid molecule 1500 originates. The probe-associated molecules can be subjected to conditions sufficient to produce a first barcoded nucleic acid molecule, which can include annealing of the probe-binding molecule 1517 to (i) a probe capture sequence 1518 and (ii) a barcode capture sequence (e.g., a common sequence) of the barcode molecule 1519. The barcoding process may include additional operations, such as ligation (e.g., ligation of the barcode molecule 1519 to the probe capture sequence 1518), which may be performed chemically or enzymatically, as described elsewhere herein.

The first barcoded nucleic acid molecule or derivative thereof (e.g., complementary sequence, amplicon, extension product, combined barcoded nucleic acid molecule, as described elsewhere herein) can be subjected to a second barcoding operation. Such a second bar code addition operation may occur in a second set of partitions. For example, a first barcoded nucleic acid molecule can be removed from a first set of partitions, pooled (e.g., pooled with other barcoded nucleic acid molecules from other first partitions of the first set of partitions), and partitioned in a second partition of the second set of partitions. The second partition may include capture molecules 1520. The capture molecule 1520 may comprise a second barcode sequence and a sequence complementary to the probe capture sequence 1510 of the first probe 1506 (and/or the second probe 1516). The second barcode sequence may be a sequence common to a plurality of capture molecules in the second set of partitions, or the barcode sequence may be unique to only capture molecules in the second partition (e.g., different from partition to partition). The capture molecules 1520 can hybridize to the probe capture sequences 1510 to produce additional barcoded molecules (also referred to herein as "third barcoded nucleic acid molecules"). Additional barcoded molecules may include a sequence corresponding to a first barcode sequence (of the barcode molecule 1519) and a sequence corresponding to a second barcode sequence (of the capture molecule 1520).

In addition to barcoding nucleic acid molecules, the present disclosure also provides methods of multiplex analysis (e.g., processing additional biomolecule types, such as proteins and peptides). The method can include providing a characteristic binding group (e.g., an antibody, a protein, a binding moiety, etc.), which can be coupled to or bind to a characteristic (e.g., a protein, a peptide) of a cell, a cell nucleus, or a cell bead. Such methods may include providing a cell, nucleus, or cell bead having a feature of interest (e.g., a protein), and contacting the cell, nucleus, or cell bead with a feature binding group. The feature binding group may be coupled to a feature of interest. The feature binding group may comprise a reporter oligonucleotide comprising a reporter sequence coupled thereto, which reporter oligonucleotide may be specific for a particular feature and thus may be used to identify that feature. For example, the characteristic binding group may be an antibody, and the reporter oligonucleotide may comprise a reporter sequence that identifies the antigen or binding portion (e.g., epitope fragment) to which the antibody is coupled or bound. Alternatively or in addition, the characteristic binding group may comprise a characteristic probe binding sequence, which may be used for downstream probe binding and/or barcode addition. After the cells (nuclei or cell beads) are contacted with the characteristic binding group, the cells/nuclei/cell beads may comprise a characteristic coupled to the characteristic binding group.

In some cases, the methods described herein can additionally include providing a cell, nucleus, or cell bead comprising (i) a nucleic acid molecule comprising a first target region and a second target region and (ii) a feature coupled to a feature binding group, and contacting the cell, nucleus, or cell bead with a plurality of probes. The cell/nucleus/cell beads may be contacted with the first probe, the second probe, and the third probe (e.g., in a first zone). As described herein, the first probe and the second probe can be associated with a first target region and a second target region of a nucleic acid molecule, thereby producing a first probe-associated molecule. Similarly, the third probe can be associated (e.g., via hybridization) with the characteristic binding group, thereby producing a second probe-associated molecule. In some cases, the third probe may comprise a third probe sequence that is complementary to the characteristic probe binding sequence, and in some cases, the third probe may additionally comprise a probe capture sequence. The first probe and/or the second probe may further comprise a probe capture sequence, which may be the same as or different from the probe capture sequence of the third probe.

In the first set of partitions, a barcode may be added to molecules associated with the first probe (e.g., a nucleic acid molecule having the first and second probes associated therewith) and molecules associated with the second probe (e.g., a characteristic binding group having the third probe associated therewith). Such barcoding operations may include, for example, providing a barcode molecule comprising a first barcode sequence and a barcode capture sequence (such as a common sequence), which may hybridize, for example, via a probe capture sequence, directly to a molecule associated with a first probe and a molecule associated with a second probe. Alternatively or in addition, a probe binding molecule may be provided for a barcode molecule comprising (i) a probe binding sequence complementary to a probe capture sequence of the first, second and/or third probe, and (ii) a barcode binding sequence that may be complementary to a common sequence of the barcode molecule. In some cases, the probe-binding molecules and the barcode molecules may be provided as pre-annealed complexes. The barcoding of the first probe-associated molecule and the second probe-associated molecule may include hybridization of a barcode molecule (e.g., a barcode capture sequence such as a common sequence) to a portion of the first probe-associated molecule and the second probe-associated molecule (e.g., a probe capture sequence), or the barcoding may include hybridization of a barcode molecule to a probe-binding molecule and hybridization of a probe-binding molecule to either the first probe-associated molecule or the second probe-associated molecule. Additional operations such as ligation (e.g., enzymatic ligation or chemical ligation) may be performed to produce a first barcoded molecule and a second barcoded molecule.

The first and second barcoded molecules may be subjected to additional barcoding operations, for example, in a second set of partitions. Such additional barcoding operations may include contacting a first barcoded nucleic acid molecule or derivative thereof with a first capture molecule of a plurality of capture molecules to produce a third barcoded nucleic acid molecule, and contacting a second barcoded nucleic acid molecule or derivative thereof with a second capture molecule of the plurality of capture molecules to produce a fourth barcoded nucleic acid molecule. The capture molecules within a partition may each comprise a second barcode sequence, which may be unique to the partition (e.g., different from partition to partition). Thus, both the third and fourth barcoded nucleic acid molecules may comprise a first barcode sequence (or its complement) and a second barcode sequence (or its complement).

FIG. 16A schematically illustrates an example workflow for bar coding multiple analytes for a cell, nucleus or cell bead. The cell, nucleus, or cell bead 1600 can comprise a nucleic acid molecule (e.g., an RNA molecule or other target nucleic acid molecule) 1601 that comprises a first target region 1602 and a second target region 1604. The cell, nucleus, or cell bead may further comprise a feature (e.g., a protein, such as a cell surface receptor (or nuclear membrane protein) or an intracellular/nuclear protein) 1650. In some cases, the cells, nuclei, or cell beads 1600 may be treated, e.g., immobilized, permeabilized, treated with a treatment method, etc. In some cases, such treatment may include providing one or more feature binding groups (e.g., antibodies, antibody fragments, etc.) 1652, which may be coupled to the feature 1650. The characteristic binding group 1652 may comprise or be coupled to a reporter oligonucleotide 1657, which may comprise a reporter sequence 1654. The reporter sequence 1654 can indicate a feature binding group 1652 or a feature 1650. For example, the reporting sequence 1654 may be pre-indexed or assigned to a particular antibody or other feature binding group such that the presence of the reporting sequence 1654 indicates the presence of a particular feature 1650 in the sample. The characteristic binding group 1652 or reporter oligonucleotide 1657 may also comprise or be coupled to a characteristic probe binding sequence 1656. In some cases, the cell, nucleus, or cell bead 1600 may be contacted and immobilized with the characteristic binding group 1652, e.g., in addition to or in lieu of the immobilization and permeabilization operations prior to contact.

In some cases, analysis of intracellular and/or nuclear proteins and membrane proteins of cells (or nuclei) may be performed. In one embodiment, permeabilized (and optionally immobilized) cells (or nuclei) can be contacted with (i) one or more feature binding groups (or labeling reagents) configured to couple to an intracellular protein (or an intracellular protein), and/or (ii) one or more feature binding groups (or labeling reagents) configured to couple to a cell membrane protein (or a nuclear membrane protein). As further described herein, permeabilization may involve partial or complete dissolution or disruption of the cell membrane (or nuclear membrane) or a portion thereof. Permeabilization can be achieved, for example, by contacting the cell membrane (or nuclear membrane) with an organic solvent (e.g., methanol) or detergent (such as Triton X-100 or NP-40). As described elsewhere herein, the cells, nuclei, or cell beads may be immobilized.

Referring again to fig. 16A, a second feature binding group (or labeling reagent) (not shown) similar to 1652 can be used to couple to an intracellular feature, such as an intracellular protein, and to include or couple to a second reporter oligonucleotide, which can include a second reporter sequence. The second reporter sequence may be indicative of a second feature binding group or an intracellular feature. For example, the second reporter sequence may be pre-indexed or assigned to a particular antibody or other feature binding group such that the presence of the second reporter sequence indicates the presence of a particular intracellular feature in the sample. The second characteristic binding moiety or second reporter oligonucleotide may also comprise or be coupled to a second characteristic probe binding sequence similar to 1656.

The cell, cell nucleus, or cell bead 1600 can be contacted with the first probe 1606, second probe 1616, and third probe 1658 under conditions sufficient to produce a first probe-associated molecule (or probe-associated complex) 1630 and a second probe-associated molecule (or probe-associated complex) 1665. As described elsewhere herein, the first probe-associated molecule 1630 can be or comprise a probe-linked molecule. For example, first probe-associated molecule 1630 (or probe-linked molecule) can be any probe-associated molecule or probe-linked molecule described herein (e.g., a molecule resulting from an extended probe, a barcoded extended probe, etc.). The first probe 1606 may comprise a first probe sequence 1608 and optionally a probe capture sequence 1610. The first probe sequence 1608 may be complementary to the first target region 1602. The second probe 1616 can comprise a second probe sequence 1615 and optionally a probe capture sequence 1618. The second probe sequence 1615 may be complementary to the second target region 1604. Third probe 1658 can comprise third probe sequence 1660 and probe capture sequence 1662. Third probe sequence 1660 can be complementary to feature probe binding sequence 1656. In some cases, the probe capture sequence 1662 is the same probe capture sequence as the probe capture sequences 1610, 1618 of the first and/or second probes, respectively.

In one embodiment, the cell, cell bead or cell nucleus 1600 may be further contacted with additional probes under conditions that produce additional probe-associated molecules or probe-associated complexes. As described elsewhere herein, the additional probe-associated molecule may be or comprise a probe-linked molecule. For example, the additional probe-associated or probe-linked molecules can be any of the probe-associated or probe-linked molecules described herein (e.g., molecules resulting from extended probes, bar-coded extended probes, etc.). In one embodiment, the cell (or cell bead or nucleus) 1600 may be further contacted with a fourth probe (not shown) similar to 1658 comprising (i) a fourth probe sequence similar to 1660 and (ii) a fourth probe capture sequence similar to 1662. The fourth probe sequence may be complementary to the second characteristic probe binding sequence, as further described herein. In some cases, the fourth probe capture sequence is the same probe capture sequence as the probe capture sequences 1610, 1618 of the first and/or second probes, respectively.

In one embodiment, cells, nuclei, or cell beads 1600 may be partitioned into a first of a first set of partitions prior to any of the above-described processing operations including, but not limited to, immobilization, permeabilization, contact with probes, and creation of probe-associated or probe-linked molecules. In another embodiment, the cells, nuclei, or cell beads 1600 may be immobilized and optionally permeabilized prior to separation in the first partition, and then subjected to a subsequent treatment in the first partition, e.g., contacting with a probe and producing a molecule.

In operation 1670, the cell, nucleus, or cell bead 1600 comprising the first probe-associated molecule 1630 and the second probe-associated molecule 1665 may be partitioned into or further processed in a first partition of the first set of partitions. In another embodiment, the cell, cell bead, or cell nucleus 1600 may also comprise additional probe-associated molecules or complexes. For example, referring to fig. 16a,1600 can comprise a complex (not shown) associated with a third probe similar to 1665, but comprising (i) a fourth probe having a fourth probe sequence complementary to the second characteristic probe binding sequence, and (ii) a reporter oligonucleotide (similar to 1657), as further described herein. the reporter oligonucleotide may be provided as part of or coupled to a second feature binding group, e.g., a feature binding group configured to be coupled to an intracellular protein. In some cases, the cells, nuclei, or cell beads 1600 may be subjected to a treatment, such as lysis, within the partition to release cell/nucleus components (e.g., first probe-associated molecules and second probe-associated molecules) within the partition. Alternatively, the cells, nuclei, or cell beads 1600 may remain intact. In one embodiment, the cell beads are treated to release cellular components while leaving the cell beads intact. Within the first partition, probe binding molecules 1617 and barcode molecules 1619 may be provided. First probe-associated molecule 1630 and second probe-associated molecule 1665 can be contacted with one or more probe-binding molecules 1617 and barcode molecules 1619. In some examples, the first partition further comprises one or more additional probe-associated molecules or complexes (not shown) similar to 1665. The additional probe-associated complex may comprise a third probe-associated complex as described above comprising a fourth probe and a reporter oligonucleotide for a second characteristic binding group, e.g., a characteristic binding group configured to couple to an intracellular protein. Additional probe-associated complexes (such as third probe-associated complexes) can be contacted with one or more probe-binding molecules 1617 and barcode molecules 1619. In one embodiment, the contacting of the cells, nuclei, or cell beads 1600 in the first partition with one or more probe-binding molecules may be performed simultaneously with the contacting of probes (e.g., 1606, 1616, 1658, and optionally fourth probes) as described above. The barcode molecule 1619 may include a barcode capture sequence or a common sequence common to a plurality of barcode molecules and a first barcode sequence common to a first partition of the first set of partitions. In some cases, the nucleic acid barcode molecule may be coupled to a bead, such as a gel bead or other support as described herein, and may include additional functional sequences including, but not limited to, unique Molecular Identifiers (UMIs), capture sequences, primer sequences (e.g., R1/R2 sequences), additional barcode sequence fragments, and the like. The probe binding molecule 1617 may comprise a probe binding sequence complementary to any one or combination of the probe capture sequences 1610, 1618, 1662, fourth probe capture sequences, and a barcode binding sequence complementary to a common sequence of the barcode molecule 1619. In some cases, the probe-binding molecules 1617 and the barcode molecules 1619 may be provided as pre-annealed complexes. The probe binding molecules 1617 and barcode molecules 1619 can hybridize (e.g., via hybridization of the probe binding molecules 1617 to the probe capture sequences 1610, 1618, 1662 and fourth probe capture sequences) to the first probe-associated molecules 1630 and the second probe-associated molecules 1665 and/or additional probe-associated complexes (such as third probe-associated complexes), thereby producing first and second and optionally additional barcode-added nucleic acid molecules. Additional processing may occur within the first partition, for example, the ligation of bar code molecules 1619 to probes (1606, 1616, 1658, or fourth probes). In an additional embodiment, additional probe-associated complexes (e.g., a third probe-associated complex (not shown)), probe-binding molecules 1617, and barcode molecules 1619 are used to generate additional barcoded nucleic acid molecules.

In operation 1680, the contents of each partition or a subset of the first set of partitions may be collected from the first set of partitions, e.g., from operation 1670, and re-partitioned into the second set of partitions. The contents of the first set of partitions may comprise cells, nuclei or cell beads 1600 and/or treated cells or nuclear components, such as a first barcoded nucleic acid molecule, a second barcoded nucleic acid molecule, and optionally additional barcoded nucleic acid molecules. The contents of each partition of the first set of partitions may be combined together and redistributed to the second set of partitions. Thus, the second partition of the second set of partitions may include cells, nuclei or cell beads 1600 and/or treated cell/nucleus components. In some cases, the cells, nuclei, or cell beads 1600 may be subjected to a treatment, such as lysis, within the second partition to release the cell/nucleus components (e.g., the first barcoded nucleic acid molecule, the second barcoded nucleic acid molecule, and optionally additional barcoded nucleic acid molecules) within the second partition. Alternatively, the cells, nuclei, or cell beads 1600 may remain intact. Within the second partition, a plurality of capture molecules 1620 may be provided. In some cases, multiple capture molecules 1620 may be coupled to a support (e.g., a particle, bead, gel bead, etc.). In some cases, the plurality of capture molecules 1620 may be releasably coupled to the support, and the plurality of capture molecules 1620 may be released in the second partition. The capture molecules 1620 may each comprise a second barcode sequence, which may be the same sequence as the first barcode sequence (of the barcode molecule 1619) or a different sequence. The second barcode sequence may be unique to the second partition and different from the second barcode sequences of other partitions of the second set of partitions. The first and second barcoded nucleic acid molecules may each be contacted with a capture molecule 1620. Capture molecule 1620 may comprise a second barcode capture sequence, which may be complementary to the sequence of barcode molecule 1619. Hybridization of capture molecule 1620 to the first and second barcoded nucleic acid molecules may be sufficient to produce a third and fourth barcoded nucleic acid molecules. In addition, hybridization of capture molecule 1620 to additional barcoded nucleic acid molecules (e.g., from additional reporter oligonucleotides 1657 on additional feature binding groups 1652) may be sufficient to produce fifth barcoded nucleic acid molecules. Alternatively, hybridization of capture molecule 1620 to the first and second barcoded molecules may be sufficient to couple the capture molecule (comprising the second barcode sequence) to both the first and second barcoded molecules. Furthermore, hybridization of capture molecule 1620 to additional barcoded nucleic acid molecules may be sufficient to couple capture molecules (comprising a second barcode sequence) to additional barcoded nucleic acid molecules. Optionally, further processing may be performed, for example, ligating capture molecules 1620 to the first and second barcoded nucleic acid molecules (and optionally additional barcoded nucleic acid molecules). After ligation, the first and second barcoded nucleic acid molecules may comprise capture molecules 1620. The third barcoded nucleic acid molecule, the fourth barcoded nucleic acid molecule, and the fifth barcoded nucleic acid molecule may each comprise a sequence corresponding to the first barcode sequence and a sequence corresponding to the second barcode sequence. In some cases, an extension reaction (e.g., from capture molecule 1620 toward reporter oligonucleotide sequence 1657) is performed to produce a fourth barcoded molecule and/or a fifth barcoded nucleic acid molecule. FIG. 16B schematically illustrates another example workflow for barcoding multiple analytes for a cell, nucleus, or cell bead. In such examples, the workflow for processing nucleic acid molecules (e.g., RNA molecules) may be substantially similar to the workflow depicted in fig. 16A, but the workflow for processing features (e.g., proteins) may be different. For example, the characteristic binding group 1652 or the reporter oligonucleotide 1657 may comprise a binding sequence capable of hybridizing to the probe binding molecule 1617 and/or the barcode molecule 1619.

As described herein, permeabilized (and optionally immobilized) cells or nuclei can be contacted with one or more feature binding groups 1652, which can (a) comprise a reporter oligonucleotide 1657 and (b) be configured to couple to (i) an intracellular protein (or an intracellular protein) or (ii) a cell membrane protein (or a nuclear membrane protein). In some embodiments, the one or more feature binding groups 1652 include (i) a first feature binding group comprising a reporter oligonucleotide 1657 and configured to be coupled to an intracellular protein (or an intracellular protein), and (ii) a second feature binding group comprising a reporter oligonucleotide 1657 and configured to be coupled to a cell membrane protein (or a nuclear membrane protein).

In operation 1670, the cell, nucleus, or cell bead 1600 comprising the first probe-associated molecule 1630 and the one or more feature-binding groups 1652 may be partitioned into or further processed in a first partition of the first set of partitions. Within the first partition, probe binding molecules 1617 and barcode molecules 1619 may be provided. The characteristic binding groups 1652 coupled to the reporter oligonucleotide 1657 (e.g., one or more characteristic binding groups configured to couple to an intracellular protein or an intracellular protein) can be contacted with one or more probe binding molecules 1617 and barcode molecules 1619. The barcode molecule 1619 may include a barcode capture sequence or a common sequence common to a plurality of barcode molecules and a first barcode sequence common to a first partition of the first set of partitions. In some cases, the nucleic acid barcode molecule may be coupled to a bead, such as a gel bead or other support as described herein, and may include additional functional sequences including, but not limited to, unique Molecular Identifiers (UMIs), capture sequences, primer sequences (e.g., R1/R2 sequences), additional barcode sequence fragments, and the like. The probe binding molecule 1617 may comprise a probe binding sequence that is complementary to the sequence of the reporter oligonucleotide 1657. In some cases, the probe-binding molecules 1617 and the barcode molecules 1619 may be provided as pre-annealed complexes. The probe-binding molecule 1617 and the barcode molecule 1619 can hybridize to the molecule 1630 (described above) associated with the first probe and the reporter oligonucleotide 1657 (e.g., to the sequence of the reporter oligonucleotide 1657 via the probe-binding molecule 1617), thereby producing a first barcode-bearing nucleic acid molecule and a second barcode-bearing nucleic acid molecule. Additional reporter oligonucleotides 1657 from additional feature binding groups 1652 (e.g., configured to couple to cell or nuclear membrane proteins and/or intracellular or nuclear proteins) can be used to generate additional barcoded nucleic acid molecules. Additional processing may occur within the first partition, for example, ligation of the barcode molecule 1619 to the probe (1606, 1616) or reporter oligonucleotide 1657.

In operation 1680, the contents of each partition or a subset of the first set of partitions may be collected from the first set of partitions, e.g., from operation 1670, and re-partitioned into the second set of partitions. The contents of the first set of partitions may comprise cells, nuclei or cell beads 1600 and/or treated cell/nucleus components, such as a first barcoded nucleic acid molecule, a second barcoded nucleic acid molecule, and optionally additional barcoded nucleic acid molecules. The contents of each partition of the first set of partitions may be combined together and redistributed to the second set of partitions. Thus, the second partition of the second set of partitions may include cells, nuclei, or cell beads 1600 and/or treated cell/nucleus components (e.g., bar coded products). In some cases, the cells, nuclei, or cell beads 1600 may be subjected to a treatment, such as lysis, within the second partition to release the cell/nucleus components (e.g., the first barcoded nucleic acid molecule, the second barcoded nucleic acid molecule, and optionally additional barcoded nucleic acid molecules) within the second partition. Alternatively, the cells, nuclei, or cell beads 1600 may remain intact. Within the second partition, a plurality of capture molecules 1620 may be provided. In some cases, multiple capture molecules 1620 may be coupled to a support (e.g., a particle, bead, gel bead, etc.). In some cases, the plurality of capture molecules 1620 may be releasably coupled to the support, and the plurality of capture molecules 1620 may be released in the second partition. the capture molecules 1620 may each comprise a second barcode sequence, which may be the same sequence as the first barcode sequence (of the barcode molecule 1619) or a different sequence. The second barcode sequence may be unique to the second partition and different from the second barcode sequences of other partitions of the second set of partitions. The first and second barcoded nucleic acid molecules may each be contacted with a capture molecule 1620. Capture molecule 1620 may comprise a second barcode capture sequence, which may be complementary to the sequence of barcode molecule 1619. Alternatively, capture molecules 1620 may comprise sequences that are complementary to additional probe-binding molecules (e.g., splint oligonucleotides, not shown), and probe-binding molecules may comprise sequences that are complementary to the sequences of barcode molecules 1619. Hybridization of capture molecule 1620 to the first and second barcoded molecules (or to additional probe-binding molecules that can hybridize to the first and second barcoded molecules) can be sufficient to produce a third and fourth barcoded nucleic acid molecules. Furthermore, hybridization of 1620 to additional barcoded nucleic acid molecules (e.g., from additional reporter oligonucleotides 1657 on additional feature binding groups 1652) may be sufficient to produce fifth barcoded nucleic acid molecules. Alternatively, hybridization of capture molecule 1620 to the first and second barcoded molecules may be sufficient to couple the capture molecule (comprising the second barcode sequence) to both the first and second barcoded molecules. Furthermore, hybridization of 1620 to an additional barcoded nucleic acid molecule can be sufficient to couple a capture molecule (comprising a second barcode sequence) to the additional barcoded nucleic acid molecule (e.g., generated by an additional reporter oligonucleotide 1657 on the additional feature binding group 1652). Optionally, further processing, such as an extension reaction, may be performed to attach capture molecule 1620 to the first barcoded nucleic acid molecule, the second barcoded nucleic acid molecule, and the optionally additional barcoded nucleic acid molecules. After ligation, the first and second barcoded nucleic acid molecules may comprise capture molecules 1620. The third barcoded nucleic acid molecule, the fourth barcoded nucleic acid molecule, and the fifth barcoded nucleic acid molecule may each comprise a sequence corresponding to the first barcode sequence and a sequence corresponding to the second barcode sequence. In some cases, an extension reaction (e.g., from capture molecule 1620 toward reporter oligonucleotide sequence 1657) is performed to produce a fourth barcoded molecule and/or a fifth barcoded nucleic acid molecule.

In some cases, a reporter oligonucleotide (comprising a reporter sequence) that features a binding group can be contacted with multiple probes. For example, it may be advantageous to contact a characteristic binding group with a pair of probes. In some cases, the reporter oligonucleotide comprises one or more characteristic probe binding sequences, which may comprise sequences complementary to the pair of probes. For example, referring to fig. 17, a cell, nucleus, or cell bead 1700 may comprise a feature (e.g., a protein, such as a cell/nuclear membrane protein or an intracellular/nuclear protein) 1750. Feature binding group 1752 can be coupled to feature 1750. The feature binding group 1752 can comprise or be coupled to an oligonucleotide comprising a reporter oligonucleotide (comprising a reporter sequence) 1754, and in some cases, an additional functional sequence, such as a primer sequence, sequencing primer sequence, UMI, and the like, as described elsewhere herein. Reporter oligonucleotide 1754 may comprise any number of target regions. For example, the reporter oligonucleotide 1754 may comprise two target regions to which the first probe 1757 and the second probe 1758 may hybridize. The two target regions may be contiguous or non-contiguous, and they may be disposed on the same strand of reporter oligonucleotide 1754. As described herein, the probes may comprise sequences complementary to the target region of reporter oligonucleotide 1754, and each probe may comprise other useful sequences. For example, a probe (e.g., first probe 1757 or second probe 1758) can comprise (i) a probe sequence (e.g., 1760) that is complementary to a target region of reporter oligonucleotide 1754, and (ii) a probe capture sequence 1762 that can be complementary to a sequence of probe binding molecule 1717 (also referred to as a splint or splint oligonucleotide). The probe binding molecules 1717 can also include sequences that are complementary to sequences (e.g., capture sequences) of the barcode molecules 1719. Such barcoding (e.g., hybridization of probe binding molecules 1717 and barcode molecules 1719 to probe capture sequences 1762) may occur in bulk or in a partition. In some embodiments, the barcoding may be performed in the absence of a probe binding molecule. For example, the barcode molecule 1719 may contain a sequence complementary to the probe capture sequence 1762 and anneal directly to the probe.

In some cases, after contacting the feature binding groups with probe molecules 1757 and 1758 (e.g., in bulk or in a partition), feature binding groups 1752 are subjected to conditions sufficient to hybridize the probe molecules to reporter oligonucleotide 1754, thereby producing a probe-associated reporter oligonucleotide complex. The coupling of the probe to the reporter oligonucleotide 1754 may occur in bulk or in a partition. In some cases, after the probes are coupled or hybridized to reporter oligonucleotide 1754, the probes may be ligated together (e.g., enzymatically or chemically) to create probe-ligated nucleic acid molecules (or complexes). For example, the first probes 1757 can comprise first reactive moieties and the second probes 1758 can comprise second reactive moieties. The reactive moieties may be positioned such that after hybridization of the first probe 1757 and the second probe 1758 to the reporter oligonucleotide 1754, the reactive moieties are adjacent. The reactive moiety may then be subjected to conditions sufficient to react them to produce a probe-linked nucleic acid molecule (or complex) comprising a first probe 1757 linked to a second probe 1758. In some cases, the probe comprises a "click chemistry" moiety. Alternatively or in addition, the first probe may be enzymatically linked (e.g., via a ligation reaction) to the second probe. In other cases, a gap region (not shown) may be provided between the first probe 1757 and the second probe 1758 after hybridization of the probes to the reporter oligonucleotide 1754. In such cases, the first probes 1757 can be connected to the second probes 1758 using gap filling methods (such as those described above).

The probe-linked nucleic acid molecules (or complexes) may then be subjected to barcoding (e.g., in contact with probe-binding molecules 1717 and barcode molecules 1719), which may occur in a partition. Alternatively, the bar coding may occur prior to probe ligation. For example, reporter oligonucleotide 1754 may be hybridized to a probe, separated, barcoded, and then the probe may be ligated. Alternatively, reporter oligonucleotide 1754 may be hybridized to a probe, ligated, partitioned, and then barcoded. In yet another example, reporter oligonucleotide 1754 may be hybridized to a probe, separated, ligated, and then barcoded. It should be appreciated that the operations described herein (e.g., hybridization, probe ligation, bar coding) may occur in any useful process or in any useful order. In some cases, multiple separation operations may be performed, for example, for combined addition of bar codes.

The reporter oligonucleotide can comprise the same target sequence (e.g., 702, 704, 802, 804, 902, 904, 1502, 1504, 1602, 1604, etc.) as a nucleic acid molecule (e.g., an RNA molecule). For example, referring to fig. 17, a first probe can have a first sequence that is complementary to both a first target sequence (e.g., 702, 802, 902, 1502, 1602) of a nucleic acid molecule and a first sequence of a reporter oligonucleotide 1754, and a second probe can have a second sequence that is complementary to both a second target sequence (e.g., 704, 804, 904, 1504, 1604) of a nucleic acid molecule and a second sequence of a reporter oligonucleotide 1754. In such cases, providing cells, nuclei, or cell beads with only two probe types (e.g., a first probe and a second probe) may be sufficient to produce a first barcoded molecule (e.g., produced from a nucleic acid molecule, e.g., an RNA molecule), a second barcoded molecule (e.g., produced from a reporter oligonucleotide that features a binding group (such as a group configured to couple to a cell/nuclear membrane protein), and an additional barcoded molecule (e.g., produced from a reporter oligonucleotide that additional features a binding group (such as a group configured to couple to an intracellular/nuclear protein). As described herein, each probe (e.g., first probe and second probe) may be capable of or configured to hybridize with a barcode molecule (e.g., in a first partition) and/or a capture molecule. As also described elsewhere herein, each probe may be multiplexed or combined with a barcode such that multiple partitions (e.g., partitions containing more than one cell, nucleus, or cell bead) may be deconvolved, e.g., to determine an initial partition or sample for each cell, nucleus, or cell bead within a partition (see, e.g., fig. 10). Similarly, the barcoded molecules can be used to determine the source of different analyte types (e.g., proteins, nucleic acid molecules, etc.), for example, two analyte types can be assigned to the same starting cell, nucleus, cell bead, sample, or partition.

In some cases, the reporter oligonucleotide comprises two or more target sequences that are different from the target sequences of the nucleic acid molecule (e.g., RNA molecule). Thus, four probe types may be provided for multiplex assays, a first probe and a second probe may hybridize to a first target region and a second target region of a nucleic acid molecule, and a third probe and a fourth probe may hybridize to a target region of a reporter oligonucleotide (e.g., a reporter oligonucleotide from a characteristic binding group, such as a characteristic binding group configured to couple to a cell/nuclear membrane protein). Additional probe types, such as a fifth probe and a sixth probe, may be provided that hybridize to a target region of an additional reporter oligonucleotide (e.g., a reporter oligonucleotide from a characteristic binding group, such as a characteristic binding group configured to couple to an intracellular/nuclear protein). Each probe or combination of probes may comprise a probe capture sequence, which may be used for subsequent barcoding. For example, each probe (e.g., first probe, second probe, third probe, fourth probe, fifth probe, sixth probe, or a combination thereof) may be capable of or configured to hybridize with a barcode molecule (e.g., in a first partition) and/or a capture molecule (e.g., in a second partition). As described elsewhere herein, each probe may be multiplexed or combined with a barcode such that multiple partitions (e.g., partitions containing more than one cell, nucleus, or cell bead) may be deconvolved, for example, to determine the starting partition or sample for each cell, nucleus, or cell bead within a partition (see, e.g., fig. 10). Similarly, the barcoded molecules can be used to determine the source of different analyte types (e.g., proteins, nucleic acid molecules, etc.), for example, two analyte types can be assigned to the same starting cell, nucleus, cell bead, sample, or partition.

As described elsewhere herein, a nucleic acid molecule (e.g., from a cell, a cell nucleus, or a cell bead, or a reporter oligonucleotide) may comprise one or more target regions. The one or more target regions may correspond to a gene or a portion thereof, or another known sequence. The target regions may have the same or different sequences and may be located within the same strand or on different strands. The target regions may be adjacent to each other or may be spatially separated along one strand of the nucleic acid molecule. The target regions may be located on the same strand or on different strands. Analyzing the two or more target regions may involve providing two or more probes, wherein a first probe has a sequence complementary to a first target region, a second probe has a sequence complementary to a second target region, and so on. As described elsewhere herein, a nucleic acid molecule can be a target nucleic acid molecule and can comprise any number of nucleic acid features or nucleotides.

As also described elsewhere herein, any probe (e.g., first probe, second probe, third probe, etc.), reporter oligonucleotide or barcode or capture molecule may comprise any number of additional adaptors or functional sequences, such as additional probe sequences, unique molecular identifiers, barcode sequences, primer sequences, capture sequences, sequencing primer sequences, and the like.

As described herein, one or more operations may be performed within a partition (such as a droplet or a well). For example, a nucleic acid molecule (e.g., an RNA molecule) and a feature (e.g., a protein) or a cell, nucleus, or cell bead comprising the nucleic acid molecule and the feature can be co-partitioned with one or more reagents (e.g., as described herein) at any useful stage of the method. For example, probe-linked or probe-associated nucleic acid molecules optionally contained in or on cells, nuclei, or cell beads may be produced in bulk solution or in a partition. Similarly, cells, nuclei, or cell beads may be contacted with the characteristic binding group in bulk solution or in a zone. The provision of probes (e.g., first probe, second probe, and third probe) may occur in bulk solution or in separate partitions. Where partitions are used, one partition (e.g., a first partition of a first set of partitions) may include a first probe, a second probe, a third probe, or a combination thereof. Different partitions within the first set of partitions may include the same or different probes (e.g., for different target sequences or different reporter sequences). Alternatively or in addition, the probe binding molecules and the nucleic acid barcode molecules may be provided in one partition. For example, a cell, nucleus or cell bead comprising the feature and nucleic acid molecule may be contacted with the probe in bulk and separated into a first set of partitions. The first set of partitions may include probe-binding molecules and nucleic acid barcode molecules comprising a common sequence. For example, additional partitions of the first set of partitions may include a plurality of barcode molecules, each having a barcode sequence that is unique to the partition (e.g., different between partitions). The partitions may include additional reagents for performing a nucleic acid reaction (e.g., digestion, ligation, extension, amplification). For example, a partition can include a ligase (linking enzyme) (e.g., ligase) that can be used to ligate a nucleic acid barcode molecule to a first probe, a second probe, or a third probe (e.g., via a probe capture sequence of each probe). In some cases, the probe binding molecule, probe capture sequence, and/or barcode capture sequence (e.g., a common sequence) comprises one or more reactive moieties that can be used to chemically link the nucleic acid barcode molecule to the probe capture sequence. The resulting barcoded products can include a first barcoded product comprising a sequence corresponding to a first target region, a sequence corresponding to a second target region, a sequence corresponding to a probe capture sequence of the first probe or the second probe, and a sequence corresponding to a barcode sequence, and a second barcoded product comprising a sequence corresponding to a reporter sequence, a probe capture sequence of the third probe (which can be the same or different from the probe capture sequence of the first probe or the second probe), and a barcode sequence.

As described herein, one or more of the processes described herein can be performed in a cell (e.g., a cell in solution, or a cell contained within a tissue sample), a nucleus, or a cell bead. For example, a plurality of cells, nuclei, or cell beads may comprise a plurality of nucleic acid molecules and features. The cells, nuclei or cell beads may be living or immobilized and/or permeabilized. In some cases, the cell, nucleus, or cell bead may be contacted with a characteristic binding group comprising a reporter sequence. The first probe, the second probe, and the third probe may also be provided to the cell, the nucleus, or the cell bead in a bulk solution or a partition to produce a first probe-associated molecule and a second probe-associated molecule. Optionally, the cells, nuclei or cell beads may be washed to remove unbound probes. Subsequently, the cells, nuclei, or cell beads comprising the probe-associated molecules can be partitioned into a plurality of separate partitions, wherein at least a subset of the plurality of separate partitions comprises a single cell, a single nucleus, or a single cell bead. The bar coding may be performed in a separate partition. As described herein, barcoding may include attaching or hybridizing a nucleic acid barcode molecule to a first probe-associated molecule and a second probe-associated molecule. Nucleic acid barcode molecules provided within each of the plurality of separate partitions may be provided attached to the beads. In some cases, the nucleic acid barcode molecules may be releasably attached to the beads (e.g., via labile bonds), as described elsewhere herein. Each partition (or subset of partitions) of the plurality of individual partitions may comprise a bead comprising a plurality of nucleic acid barcode molecules attached thereto (e.g., as described herein). The plurality of nucleic acid barcode molecules attached to each bead may comprise a unique barcode sequence such that each of the plurality of separate partitions comprises a different barcode sequence. Upon release of the component from a plurality of different partitions of the plurality of individual partitions (e.g., after barcoding), the barcoded molecules derived from the single cells, single cell nuclei, or single cell beads may have the same barcode sequence (e.g., a common barcode sequence) such that each barcoded nucleic acid molecule may be traced back to a given partition and/or in some cases, a single cell, single cell nucleus, or single cell bead. The released components may then be partitioned into a second set of partitions comprising capture molecules having a second barcode sequence, as described herein, such that different partitions in the second set of partitions have unique second barcode sequences.

The cells, nuclei, or cell beads described herein may be treated before, during, or after bar coding. For example, cells, nuclei, or cell beads may be immobilized or permeabilized at any useful point in time. In some cases, the cells, nuclei, or cell beads may be immobilized and permeabilized either before or after hybridization of the probes or before or after contact with the characteristic binding group. In some cases, the cells, nuclei, or cell beads may be immobilized and permeabilized prior to contact with the characteristic binding group, and then contacted with the probe. The immobilization or permeabilization process may be repeated. For example, cells, nuclei, or cell beads may be immobilized and permeabilized, contacted with the probe and feature binding group (either simultaneously or in a stepwise manner), and then immobilized again.

After immobilization and/or permeabilization, the cells, nuclei or cell beads can be stored for a period of time prior to further processing, e.g., contacting the cells, nuclei or cell beads with probes and/or feature binding groups. For example, the cells, nuclei, or cell beads may be immobilized and/or permeabilized, and then contacted with the probe and/or feature binding group after about 1 minute, 10 minutes, 30 minutes, 1 hour, 2 hours, 3 hours, 4 hours, 5 hours, 6 hours, 7 hours, 8 hours, 9 hours, 10 hours, 11 hours, 12 hours, 13 hours, 14 hours, 15 hours, 16 hours, 17 hours, 18 hours, 19 hours, 20 hours, 21 hours, 22 hours, 23 hours, 24 hours, or more. The cells, nuclei or cell beads may be immobilized and/or permeabilized and then contacted with the probes and/or the feature binding groups after about 1 day, 2 days, 3 days, 4 days, 5 days, 6 days, 7 days or more. Cells, nuclei or cell beads may be immobilized and/or permeabilized and then contacted with the probe and/or feature binding group after about 1 week, 2 weeks, 3 weeks, 4 weeks, 5 weeks, 6 weeks, 7 weeks, 8 weeks, 9 weeks, 10 weeks, 20 weeks, 30 weeks, 40 weeks, 50 weeks or more. Cells, nuclei or cell beads may be immobilized and/or permeabilized and then contacted with the probe and/or feature binding group after about 1 month, 2 months, 3 months, 4 months, 5 months, 6 months, 7 months, 8 months, 9 months, 10 months, 11 months, 12 months or more. The cells, nuclei or cell beads may be immobilized and/or permeabilized, and then contacted with the probes and/or the feature binding groups at any useful time, which may be in the range of, for example, about 2-5 weeks later, about 3-6 months later, about 1-2 years later, etc.

In some cases, the cells, nuclei, or cell beads may be frozen, for example, after fixation and/or permeabilization. Such freezing of cells, nuclei or cell beads may be used to store the sample for a longer duration (e.g., if the sample is stored for more than 1-2 weeks before the sample is contacted with the probe and/or feature binding moiety). For example, the cells, nuclei or cell beads may be immobilized, optionally permeabilized, then frozen for any useful duration, followed by contacting the cells, nuclei or cell beads with probes and/or characteristic binding groups. Alternatively, the cells, nuclei or cell beads may be immobilized, frozen and permeabilized before or after the cells, nuclei or cell beads are contacted with the probes and/or the feature binding groups. It will be appreciated that the freezing operation may be performed at any useful or convenient time, e.g., before, simultaneously with, or after immobilization, permeabilization, contact with a probe, contact with a feature binding group, etc.

The cells, nuclei or cell beads may be contacted with the probes and feature binding groups in the partition or in the bulk at any useful time. For example, the cell, nucleus or cell bead may be contacted with the probe before, during or after contact with the characteristic binding group. Contact with probes and/or feature binding groups can occur in bulk or in partitions (e.g., droplets, wells). In some cases, the cells, nuclei, or cell beads may be contacted (either simultaneously or in a stepwise manner) with the probes and feature binding groups, followed by the addition of a barcode in the partitions. In other cases, the cells, nuclei, or cell beads may be contacted with the probes and the characteristic binding members in a partition.

FIG. 29 illustrates an example workflow for processing cells according to the methods described herein. May be prepared, for example, in 4% formaldehyde and 0.01% tween-20 or a commercially available immobilization and permeabilization buffer (e.g., commercially availableFixation and permeabilization buffer) to fix and permeabilize the cells. In one example, the immobilized and permeabilized cell can be incubated with a first probe and a second probe to produce a first probe-associated molecule (e.g., a probe-associated RNA molecule). The cells can then be contacted with a characteristic binding group (e.g., an antibody) comprising a reporter oligonucleotide to produce cells comprising a characteristic coupled to the characteristic binding group. The subsequent bar code addition may be performed, for example, in a partition.

In some examples, the immobilized and permeabilized cell can be incubated with the characteristic binding group, optionally, the cell can be immobilized again, and then the cell can be contacted with the first probe and the second probe to produce a probe-associated molecule (e.g., a probe-associated RNA molecule). Alternatively, the immobilized and permeabilized cell can be incubated with the first probe and the second probe to produce a probe-associated molecule, and then the cell can be contacted with the characteristic binding group. The subsequent bar code addition may be performed, for example, in a partition.

In some cases, it may be useful to permeabilize the cells prior to contacting the cells with the probe or feature binding group (e.g., as a negative control). Thus, a cell may be immobilized, contacted with a probe and/or feature binding group, and then subsequently permeabilized. It should be understood that any order of immobilization, permeabilization, probe hybridization, contact with a feature binding group, etc., can be performed in any convenient or useful step and in any order, and any process can be repeated. For example, a cell, nucleus or cell bead may be contacted with a feature binding group, the cell, nucleus or cell bead immobilized and/or permeabilized, the cell, nucleus or cell bead contacted with an additional feature binding group (which may be beneficial for analysis of extracellular and intracellular peptides, polypeptides or proteins), and optionally, the cell, nucleus or cell bead immobilized again. Alternatively, the cells, nuclei, or cell beads may be immobilized and/or permeabilized, then contacted with a characteristic binding group (e.g., for an intracellular and/or extracellular analyte), and optionally re-immobilized. Before or after these processes, the cells, nuclei, or cell beads may be contacted with a set of probes (e.g., a first probe, a second probe, and/or a third probe). See also examples 8 and 9.

The methods, compositions, kits, and systems of the present disclosure can include providing methods for processing immobilized biological particles (e.g., cells, nuclei, or cell beads). In one embodiment, the method comprises a) immobilizing and permeabilizing the biological particle or providing an immobilized and permeabilized biological particle.

The method may further comprise b) contacting the immobilized and permeabilized biological particle with a first reagent configured to couple to an analyte of the biological particle. In one embodiment, the analyte is an intracellular analyte, such as a nucleic acid or polypeptide, and the biological particle is a cell. In another embodiment, the analyte is an intra-nuclear analyte, such as a nucleic acid or polypeptide, and the biological particle is a nucleus. The first reagent configured to be coupled to the analyte may be (i) a first reagent configured to be coupled to a nucleic acid (such as one or more nucleic acid probes as described herein) or (ii) a first reagent configured to be coupled to a peptide, polypeptide, or protein (such as one or more characteristic binding groups as described herein). In another embodiment, b) provides an immobilized and permeabilized biological particle, such as a cell or a cell nucleus, comprising a first agent coupled to an analyte (e.g., a nucleic acid or a polypeptide) of the biological particle.

The method may further comprise c) additional immobilization of the biological particles from b). In one embodiment, c) comprises additional immobilization of the biological particle from b), wherein the biological particle from b) comprises a first reagent configured to couple with an analyte of the biological particle. The first reagent may be coupled to an analyte (nucleic acid or polypeptide) of a biological particle (e.g., a cell or nucleus). The first reagent may be a reagent configured to couple to a nucleic acid analyte or a reagent configured to couple to a polypeptide. In one embodiment, c) comprises additional immobilization of the biological particle (such as a cell), wherein the cell comprises a first agent coupled to the polypeptide. In another embodiment, the polypeptide is an intracellular polypeptide.

The method can further include d) comprising contacting the biological particle (e.g., cell or nucleus) from c) (which has been initially immobilized and permeabilized, is contacted with or comprises a first reagent, and is otherwise immobilized) with a second reagent configured to couple to an analyte (e.g., a nucleic acid or polypeptide), wherein the second reagent is different from the first reagent, and/or the second reagent is configured to couple to an analyte different from the analyte to which the first reagent is configured to couple. In one embodiment, the first reagent is configured to be coupled to a polypeptide (such as one or more characteristic binding groups as described herein), and the second reagent is configured to be coupled to a nucleic acid (such as one or more nucleic acid probes as described herein). d) May comprise a first agent coupled to the polypeptide and a second agent coupled to the nucleic acid.

Any number of barcoding operations can be performed for a given nucleic acid molecule and/or feature binding group, for example, using a combinatorial barcoding (e.g., split cell) approach. As described herein, additional barcoding operations can be used, for example, to index nucleic acid molecules and features (e.g., proteins) to cells, nuclei, cell beads, samples, a partition, or multiple partitions. Such indexing may be used when a single partition is occupied by multiple cells, nuclei, or beads of cells. In some cases it may be advantageous to overload a partition such that one partition includes more than one cell, nucleus or cell bead, for example, it may be useful in some cases to overload a partition, for example, to overcome poisson load statistics in a partition and/or to prevent reagent wastage (e.g., caused by an unoccupied partition). Thus, such an index can be used to attribute (i) nucleic acid molecules and (ii) features (e.g., proteins) in a multiplex occupied partition to a starting cell, nucleus, cell bead, partition, sample, etc., as described elsewhere herein.

For example, the workflow provided in fig. 10 may be performed on nucleic acid molecules and features (e.g., proteins) within a cell, nucleus, or population of cell beads. In such an example, prior to operation 1010, a first cell, nucleus, or cell bead population 1002 can be contacted with a first probe, a second probe, and optionally a third probe (e.g., as shown in fig. 16A and 16B). The first and second probes can hybridize to a nucleic acid molecule to produce a first probe-associated molecule (or complex), and optionally, the third probe can hybridize to a reporter oligonucleotide (comprising a reporter sequence) or a characteristic probe-binding sequence of a characteristic binding group (e.g., a group configured to couple to a cell/nuclear membrane protein) to produce a second probe-associated molecule (or complex). Additional probes may be provided to hybridize to additional reporter oligonucleotides or characteristic probe binding sequences of additional characteristic binding groups (e.g., groups configured to couple to intracellular/nuclear proteins) of the first cell, nucleus, or population of cell beads, thereby producing additional probe-associated molecules. The second cell, nucleus or cell bead population 1004 may also be treated in the same manner, for example, with a fourth probe, a fifth probe and optionally a sixth probe. The fourth probe and the fifth probe can hybridize to a nucleic acid molecule of the second cell, cell nucleus, or cell bead population to produce a third probe-associated molecule, and optionally, the sixth probe can hybridize to a reporter oligonucleotide or a characteristic probe-binding sequence of a characteristic binding group of the second cell, cell nucleus, or cell bead population to produce a fourth probe-associated molecule. Additional probes may be provided to hybridize with additional reporter oligonucleotides or feature probe binding sequences of additional feature binding groups (e.g., groups configured to couple to intracellular/nuclear proteins) of a second cell, nucleus, or population of cell beads, thereby producing additional probe-associated molecules. As described herein, the first population of cells (or nuclei or cell beads) 1002 and the second population of cells (or nuclei or cell beads) 1004 can be barcoded with a first barcode sequence such that the first population of cells (or components thereof, such as first probe-associated molecules and second probe-associated molecules) 1002 has a first barcode sequence that is different from the second population of cells (or nuclei or cell beads or components within cells, nuclei or cell beads, such as third probe-associated molecules and fourth probe-associated molecules) 1004. In operation 1020, a first population of cells (or nuclei or cell beads) 1002 and a second population of cells (or nuclei or cell beads) 1004 may be combined to produce a mixture of cells (or nuclei or cell beads). In operation 1030, the mixture of cells (or nuclei or cell beads) may be partitioned into a second plurality of partitions. In some cases, the mixture of cells (or nuclei or cell beads) may be partitioned into a second plurality of partitions such that some of the second plurality of partitions include more than one cell (e.g., cell, nucleus, or cell bead multiplex partition). For example, a partition 1035 in the second plurality of partitions may include cells, nuclei, or cell beads ("cell a") from the first population of cells (or nuclei, or cell beads) 1002 and cells, nuclei, or cell beads ("cell B") from the second population of cells (or nuclei, or cell beads) 1004. The partition 1035 may include additional barcode sequences that may be unique to the partition. The cells (or nuclei or cell beads) in each partition may be subjected to additional barcoding operations to supplement the barcoded nucleic acid molecules with additional barcode sequences. In operation 1040, the barcoded nucleic acid molecules may be deconvolved using different barcode sequences (e.g., a first barcode sequence, a second barcode sequence, and additional barcode sequences) to identify the starting cell, cell nucleus, or cell bead. For example, a barcoded nucleic acid molecule comprising an additional barcode sequence from partition 1035 and a first barcode sequence from first cell population 1002 can be used to identify the barcoded nucleic acid molecule as being derived from cell a. Similarly, a barcoded nucleic acid molecule comprising an additional barcode sequence from partition 1035 and a second barcode sequence from second cell population 1004 can be used to identify the barcoded nucleic acid molecule as derived from cell B.

In some cases, the characteristic binding groups (e.g., characteristic binding groups configured to couple to intracellular/nuclear proteins and/or characteristic binding groups configured to couple to intracellular/nuclear proteins) may be pre-indexed to a partition. For example, rather than a characteristic binding group having a characteristic probe binding sequence that can hybridize to a probe (e.g., a third probe) and then be barcoded (e.g., as described in fig. 16A-B) with a barcode sequence identifying a cell, cell nucleus, cell bead, or partition, the characteristic binding group can be provided in a pre-indexed manner in the partition, e.g., using a barcode sequence unique to the partition. For example, after the intracellular nucleic acid molecules are barcoded, characteristic binding groups may be provided in subsequent operations of the method. For example, during a second barcoding operation (e.g., operation 1680 of fig. 16A-B), a feature binding group can be provided and contacted with (or released from) a feature 1650 of a cell, cell nucleus, or cell bead in the second partition. The characteristic binding group may comprise or hybridize to a barcode sequence specific for and different between the second partitions. Thus, instead of using first and second barcode sequences from the first and second partitions, respectively, to identify a partition, cell, nucleus, or cell bead, the barcode sequences may be used to index a feature binding group to a particular partition and back to the starting cell or cell bead.

In other examples, the feature binding moiety can be indexed to the partition by directly attaching or coupling the partition-specific barcode sequence to the feature binding moiety, thereby avoiding the use of a third probe. In such cases, the feature binding group may comprise or be coupled to a reporter oligonucleotide comprising a reporter sequence and an attachment sequence that may be used to attach the barcode molecule directly to the feature binding group. For example, the feature binding group can comprise a probe capture sequence (e.g., 1662), thereby eliminating the need for a third probe comprising the probe capture sequence. The probe capture sequence may then be barcoded, for example, with a first barcode sequence of the barcode molecule in a first partition and a second barcode sequence of the capture molecule in a second partition. In some cases, the attachment sequence may be used to hybridize a probe binding molecule (e.g., a splint molecule or splint oligonucleotide) that may be partially complementary to a barcode molecule (as described herein). For example, the attachment sequence of the reporter oligonucleotide may be used to hybridize a probe-binding molecule, which may be hybridized (or pre-annealed) to a barcode molecule, for example, in a first partition. A second barcode sequence from the capture molecule may be provided in the first partition or a different (e.g., second) partition, which may anneal to a portion of the first barcode molecule. In some cases, additional operations, such as extension, ligation, etc., are performed to produce a barcoded molecule comprising sequences corresponding to the first barcode sequence, the second barcode sequence, and the reporter sequence.

After partition-based barcoding, the contents of the partitions may be combined, and the barcoded molecules may be replicated or amplified by, for example, one or more amplification reactions, which may be isothermal in some cases. The amplification reaction may comprise a Polymerase Chain Reaction (PCR) and may involve the use of one or more primers or polymerases. The one or more primers can comprise one or more functional sequences (e.g., primer sequence/primer binding sequence, sequencing primer sequence (e.g., R1 or R2), partial sequencing primer sequence (e.g., partial R1 or partial R2), sequence configured to be attached to a flow cell of a sequencer (e.g., P5 or P7, or partial sequences thereof), etc.) and can facilitate addition of the one or more functional sequences to the extended nucleic acid molecule. The barcoded molecules or derivatives thereof can be detected via nucleic acid sequencing (e.g., as described herein).

In some aspects, provided herein are systems useful for barcoding nucleic acid molecules. The system may include any of the components described herein, e.g., a plurality of partitions (e.g., droplets, wells), which may be provided in any useful form, e.g., a microfluidic device, a multi-well array, or plate, etc. In some cases, the system may include a first set of partitions and a second set of partitions. The first set of partitions may be the same or different type of partition as the second set of partitions. For example, the first set of partitions may contain micropores and the second set of partitions may contain droplets. As another example, both the first set of partitions and the second set of partitions may include droplets. The system can include a nucleic acid barcode molecule, optionally coupled to a support (e.g., a particle, bead, gel bead, etc.). In some cases, the system can include any of the probes described herein, such as a first probe or probes, a second probe or probes, a third probe or probes, and any useful reaction components (e.g., for performing a nucleic acid reaction, e.g., extension, ligation, amplification, etc.). The system may include one or more feature binding groups. The characteristic binding groups may be the same or different across the partitions, e.g., the characteristic binding groups may comprise multiple antibodies that bind to different epitopes within a single partition, or the partitions may comprise different characteristic binding groups that bind to different epitopes or moieties. The system can include useful reaction components such as, in non-limiting examples, enzymes (e.g., ligases, polymerases, reverse transcriptases, restriction enzymes, etc.), nucleotide bases, and the like.

Also provided herein are compositions useful for systems and methods for barcoding a variety of analytes (e.g., nucleic acid molecules and proteins) (e.g., via nucleic acid molecules, such as reporter oligonucleotides, contained in or coupled to a characteristic binding group). The composition may comprise any of the probes described herein. For example, the composition may comprise a plurality of first probes, a plurality of second probes, a plurality of third probes, and/or a plurality of first probes, a plurality of second probes, and a plurality of third probes. A probe or set of probes may be designed to target a specific sequence or set of specific sequences. Such probes may be designed to have the same or different sequences within different partitions. For example, the first composition may comprise a first probe and a second probe designed to target two regions of a first gene, and the second composition may comprise a first probe and a second probe designed to target two regions of a second gene, the second gene being different from the first gene. Similarly, a third probe (or probe pair) may be designed to target a region of a reporter oligonucleotide (comprising a reporter sequence) or a characteristic probe binding sequence, which may be the same or different from partition to partition. The composition may comprise a nucleic acid barcode molecule and/or a probe binding molecule, which may optionally be provided coupled to a support (e.g., particle, bead). The composition may comprise a capture molecule, optionally coupled to a support. The composition may be part of or comprise a reaction mixture that may comprise reaction components or reagents, such as enzymes, nucleotide bases, catalysts, and the like.

System and method for compartmentalization of samples

In one aspect, the systems and methods described herein provide for compartmentalization, deposition, or separation of one or more particles (e.g., biological particles, macromolecular components of biological particles, beads, reagents, etc.) into discrete compartments or partitions (interchangeably referred to herein as partitions), wherein each partition maintains its own content separate from the content of the other partitions. The partitions may be emulsions or droplets in the wells. A partition may include one or more other partitions.

A partition may include one or more particles. A partition may include one or more types of particles. For example, the partitions of the present disclosure may include one or more biological particles and/or macromolecular components thereof. The partition may include one or more beads. The partition may comprise one or more gel beads. A partition may include one or more cell beads. The partition may comprise a single gel bead, a single cell bead, or both a single cell bead and a single gel bead. The partition may include one or more reagents. Alternatively, the partition may be unoccupied. For example, a partition may not include beads. The cell beads may be one or more of the biological particles and/or their macromolecular components encapsulated within a gel or polymer matrix, e.g. via polymerization of droplets containing the biological particles and a precursor capable of polymerizing or gelling. A unique identifier such as a bar code may be injected into the droplet, such as via a support (e.g., a bead), before, after, or simultaneously with droplet generation, as described elsewhere herein.

The methods and systems of the present disclosure may include methods and systems for generating one or more partitions (such as droplets). The droplets may comprise a plurality of droplets in an emulsion. In some examples, the droplets may include droplets in a colloid. In some cases, the emulsion may include a microemulsion or nanoemulsion. In some examples, the droplets may be generated by means of a microfluidic device and/or by subjecting the mixture of immiscible phases to agitation (e.g., in a vessel). In some cases, a combination of the mentioned methods may be used to form droplets and/or emulsions.

Droplets may be formed by mixing and/or agitating the immiscible phases to produce an emulsion. Mixing or stirring may include various stirring techniques such as vortexing, pipetting, flicking, or other stirring techniques. In some cases, mixing or stirring may be performed without the use of a microfluidic device. In some examples, the droplets may be formed by exposing the mixture to ultrasound or sonication. Systems and methods for producing droplets and/or emulsions by agitation are described in International application No. PCT/US20/17785, which is incorporated herein by reference in its entirety for all purposes.

A microfluidic device or platform comprising a network of microfluidic channels (e.g., on a chip) may be used to create partitions (such as droplets and/or emulsions) as described herein. Methods and systems for producing partitions (such as droplets), methods of encapsulating biological particles, methods of increasing droplet generation throughput, and various geometries, architectures, and configurations of microfluidic devices and channels are described in U.S. patent publication nos. 2019/0367997 and 2019/0064173, each of which is incorporated by reference herein in its entirety for all purposes.

In some examples, individual particles may be separated into discrete partitions by introducing a flow stream of particles in an aqueous fluid into a flow stream of a non-aqueous fluid or a reservoir such that droplets may be generated at a junction of the two streams/reservoirs (such as at a junction of a microfluidic device provided elsewhere herein).

The methods of the present disclosure may include producing zoned and/or encapsulated particles, such as biological particles, in some cases individual biological particles, such as single cells, nuclei, or cell beads. In some examples, the agent may be encapsulated and/or partitioned (e.g., co-partitioned with the biological particle) in a partition. Various mechanisms may be employed in separating the individual particles. One example may include a porous membrane through which an aqueous mixture of cells may be extruded into a fluid (e.g., a non-aqueous fluid).

The partitions may be flowable in the fluid flow. The partition may include, for example, microcapsules having an outer barrier surrounding an inner fluid center or core. In some cases, a partition may include a porous matrix capable of entraining and/or retaining material within its matrix. The partitions may be droplets of a first phase within a second phase, wherein the first phase and the second phase are immiscible. For example, the partitions may be droplets of an aqueous fluid within a non-aqueous continuous phase (e.g., an oil phase). In another example, the partitions may be droplets of a non-aqueous fluid within the aqueous phase. In some examples, the partitions may be provided in the form of a water-in-oil emulsion or an oil-in-water emulsion. A number of different containers are described, for example, in U.S. patent application publication No. 2014/0155295, which is incorporated herein by reference in its entirety for all purposes. Emulsion systems for producing stable droplets in a non-aqueous or oil continuous phase are described, for example, in U.S. patent application publication No. 2010/0105112, which is incorporated herein by reference in its entirety for all purposes.

Fluid properties (e.g., fluid flow rate, fluid viscosity, etc.), particle properties (e.g., volume fraction, particle size, particle concentration, etc.), microfluidic architecture (e.g., channel geometry, etc.), and other parameters may be adjusted to control the occupancy of the resulting zones (e.g., number of biological particles per zone, number of beads per zone, etc.). For example, zone occupancy may be controlled by providing a flow of water at a concentration and/or flow rate of particles. To create single multiparticulate zones, the relative flow rates of the immiscible fluids may be selected such that each of the zones may contain on average less than one multiparticulate to ensure that those occupied are predominantly single occupied. In some cases, a partition of the plurality of partitions may contain at most one biological particle (e.g., a bead, DNA, cell, or cellular material). In some embodiments, various parameters (e.g., fluidic characteristics, particle characteristics, microfluidic architecture, etc.) may be selected or adjusted such that a majority of the zones are occupied, e.g., only a small percentage of the unoccupied zones are allowed. The flow and channel architecture may be controlled to ensure that a given number of single occupied partitions is less than a certain level of unoccupied partitions and/or less than a certain level of multiple occupied partitions.

Fig. 1 shows an example of a microfluidic channel structure 100 for separating individual biological particles. The channel structure 100 may include channel segments 102, 104, 106, and 108 that communicate at a channel connection 110. In operation, a first aqueous fluid 112 comprising suspended biological particles (or cells) 114 may be transported along the channel segment 102 into the junction 110, while a second fluid 116, which is immiscible with the aqueous fluid 112, is delivered from each of the channel segments 104 and 106 to the junction 110 to create discrete droplets 118, 120 of the first aqueous fluid 112 that flow into the channel segment 108 and away from the junction 110. The channel segment 108 may be fluidly coupled to an outlet reservoir in which discrete droplets may be stored and/or harvested. The discrete droplets generated may include individual biological particles 114 (such as droplets 118). The discrete droplets produced may include more than one individual biological particle 114 (not shown in fig. 1). Discrete droplets may be free of biological particles 114 (such as droplets 120). Each discrete partition may keep its own contents (e.g., individual biological particles 114) separate from the contents of the other partitions.

The second fluid 116 may comprise an oil, such as a fluorinated oil, that includes a fluorosurfactant for stabilizing the resulting droplets (e.g., inhibiting subsequent coalescence of the resulting droplets 118, 120). Examples of particularly useful spacer fluids and fluorosurfactants are described, for example, in U.S. patent application publication No. 2010/0105112, which is incorporated herein by reference in its entirety for all purposes.

It should be appreciated that the channel segments described herein may be coupled to any of a variety of different fluid sources or receiving components, including reservoirs, pipes, manifolds, or other system fluid components. It should be understood that the microfluidic channel structure 100 may have other geometries. For example, a microfluidic channel structure may have more than one channel connection. For example, a microfluidic channel structure may have 2, 3, 4, or 5 channel segments each carrying particles (e.g., biological particles, cell beads, and/or gel beads), which meet at a channel junction. Fluid may be directed to flow along one or more channels or reservoirs via one or more fluid flow units. The fluid flow unit may include a compressor (e.g., providing positive pressure), a pump (e.g., providing negative pressure), an actuator, etc., to control the flow of fluid. The fluid may also or alternatively be controlled via an applied pressure differential, centrifugal force, electric pumping, vacuum, capillary or gravity flow, or the like.

The generated droplets may include two subsets of droplets, (1) occupied droplets 118 that contain one or more biological particles 114, and (2) unoccupied droplets 120 that do not contain any biological particles 114. Occupied droplets 118 can include single occupied droplets (with one biological particle) and multiple occupied droplets (with more than one biological particle). As described elsewhere herein, in some cases, each occupied partition of the majority of occupied partitions may include no more than one bio-particle, and some generated partitions may be unoccupied (of any bio-particle). In some cases, however, some occupied zones may include more than one biological particle. In some cases, the partitioning process may be controlled such that less than about 25% of the occupied partition contains more than one biological particle, and in many cases less than about 20% of the occupied partition has more than one biological particle, and in some cases less than about 10% or even less than about 5% of each partition of the occupied partition includes more than one biological particle.

In some cases, it may be desirable to minimize the generation of an excessive number of empty partitions in order to reduce costs and/or increase efficiency. While such miniaturization may be achieved by providing a sufficient number of bio-particles (e.g., bio-particles 114) at the partition connection 110 to ensure that at least one bio-particle is enclosed in a partition, a poisson distribution may be expected to increase the number of partitions comprising multiple bio-particles. Thus, where a single occupied partition is to be obtained, up to about 95%, 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, 50%, 45%, 40%, 35%, 30%, 25%, 20%, 15%, 10%, 5% or less of the resulting partition may be unoccupied.

In some cases, the flow of one or more biological particles (e.g., in channel segment 102) or other fluids directed into the partition junction (e.g., in channel segments 104, 106) may be controlled such that, in many cases, no more than about 50% of the generated partitions, no more than about 25% of the generated partitions, or no more than about 10% of the generated partitions are unoccupied. These flows may be controlled so as to present a non-poisson distribution of single occupied partitions while providing a lower level of unoccupied partitions. The above ranges of unoccupied partitions can be achieved while still providing any of the single occupancy described above. For example, in many cases, partitions obtained using the systems and methods described herein can have multiple occupancy rates of less than about 25%, less than about 20%, less than about 15%, less than about 10%, and in many cases less than about 5%, while unoccupied partitions are less than about 50%, less than about 40%, less than about 30%, less than about 20%, less than about 10%, less than about 5%, or less than a percentage.

It should be understood that the above occupancy rates also apply to partitions comprising both biological particles and additional reagents, including but not limited to supports such as beads (e.g., gel beads) carrying barcoded nucleic acid molecules (e.g., oligonucleotides) (described with respect to fig. 2). Occupied partitions (e.g., at least about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or 99% occupied partitions) may include both supports (e.g., beads) and biological particles having a bar-coded nucleic acid molecule.

In another aspect, in addition to or instead of droplet-based separation, the biological particles may be encapsulated within a support comprising a shell, layer, or porous matrix in which one or more individual biological particles or small groups of biological particles are entrained. The support may include other reagents. Encapsulation of biological particles can be performed by a variety of processes. Such processes may combine an aqueous fluid containing biological particles with a polymer precursor material that may be capable of forming into a gel or other solid or semi-solid matrix upon application of a specific stimulus to the polymer precursor. Such stimuli can include, for example, thermal stimuli (e.g., heating or cooling), photo stimuli (e.g., by photo-curing), chemical stimuli (e.g., by crosslinking, polymerization initiation of precursors (e.g., by added initiators)), mechanical stimuli, or combinations thereof.

The preparation of the support comprising the biological particles (e.g., cells) can be performed by a variety of methods. For example, an air knife droplet or aerosol generator may be used to dispense droplets of precursor fluid into the gelling solution to form beads (e.g., gel beads) comprising individual biological particles or small groups of biological particles. Likewise, membrane-based encapsulation systems can be used to produce beads comprising encapsulated biological particles as described herein. Microfluidic systems of the present disclosure, such as shown in fig. 1, can be readily used to encapsulate biological particles (e.g., cells) as described herein. In particular, and with reference to fig. 1, an aqueous fluid 112 comprising (i) biological particles 114 and (ii) a polymeric precursor material (not shown) flows into a channel connection 110 where the aqueous fluid is separated into droplets 118, 120 by the flow of a non-aqueous fluid 116. In the case of the encapsulation method, the non-aqueous fluid 116 may also include an initiator (not shown) to cause polymerization and/or crosslinking of the polymer precursor to form a porous matrix comprising entrained biological particles. Examples of polymer precursor/initiator pairs include those described in U.S. patent application publication No. 2014/0378345, which is incorporated herein by reference in its entirety for all purposes.

In some cases, the encapsulated biological particles can be selectively released from the support, such as through the passage of time or upon application of a specific stimulus, which degrades the support sufficiently to allow release of the biological particles (e.g., cells) or other contents thereof from the support, such as into a partition (e.g., a droplet). See, for example, U.S. patent application publication No.2014/0378345, which is incorporated herein by reference in its entirety for all purposes.

The biological particles may be subjected to other conditions sufficient to polymerize or gel the precursor. Conditions sufficient to polymerize or gel the precursor may include exposure to heat, cooling, electromagnetic radiation, and/or light. The conditions sufficient to polymerize or gel the precursor may include any conditions sufficient to polymerize or gel the precursor. After polymerization or gelation, a polymer or gel may be formed around the biological particles. The polymer or gel may be diffusion permeable to chemical or biochemical agents. The polymer or gel may be diffusion impermeable to the macromolecular components of the biological particle. In this way, the polymer or gel may act to subject the biological particles to chemical or biochemical manipulations while spatially confining the macromolecular composition to the region of the droplet defined by the polymer or gel. The polymer or gel may comprise one or more of disulfide-crosslinked polyacrylamide, agarose, alginate, polyvinyl alcohol, polyethylene glycol (PEG) -diacrylate, PEG-acrylate, PEG-thiol, PEG-azide, PEG-alkyne, other acrylate, chitosan, hyaluronic acid, collagen, fibrin, gelatin, or elastin. The polymer or gel may comprise any other polymer or gel.

The polymer or gel may be functionalized to bind to a targeted analyte, such as a nucleic acid, protein, carbohydrate, lipid, or other analyte. The polymer or gel may polymerize or gel via a passive mechanism. The polymer or gel may be stable under alkaline conditions or at elevated temperatures. The polymer or gel may have mechanical properties similar to those of the beads. For example, the polymer or gel may have a similar size as the beads. The polymer or gel may have a mechanical strength (e.g., tensile strength) similar to that of the beads. The polymer or gel may have a density lower than the oil. The density of the polymer or gel may be substantially similar to the density of the buffer. The polymer or gel may have an adjustable pore size. The pore size may be selected, for example, to retain denatured nucleic acids. The pore size may be selected to maintain diffusion permeability to exogenous chemicals, such as sodium hydroxide (NaOH), and/or endogenous chemicals, such as inhibitors. The polymer or gel may be biocompatible. The polymer or gel may maintain or enhance cell viability. The polymer or gel may be biochemically compatible. The polymer or gel may be polymerized and/or depolymerized thermally, chemically, enzymatically, and/or optically.

The polymer may comprise poly (acrylamide-co-acrylic acid) crosslinked by disulfide bonds. The preparation of the polymer may involve a two-step reaction. In the first activation step, the poly (acrylamide-co-acrylic acid) may be exposed to an acylating agent to convert the carboxylic acid to an ester. For example, poly (acrylamide-co-acrylic acid) can be exposed to 4- (4, 6-dimethoxy-1, 3, 5-triazin-2-yl) -4-methylmorpholine hydrochloride (DMTMM). The polyacrylamide-co-acrylic acid may be exposed to other salts of 4- (4, 6-dimethoxy-1, 3, 5-triazin-2-yl) -4-methylmorpholinium. In the second crosslinking step, the ester formed in the first step may be exposed to a disulfide crosslinking agent. For example, the ester may be exposed to cystamine (2, 2' -dithiobis (ethylamine)). After these two steps, the biological particles may be surrounded by polyacrylamide chains linked together by disulfide bridges. In this way, the biological particles may be encapsulated within or comprise a gel or matrix (e.g., a polymer matrix) to form "cell beads".

The cell beads can comprise biological particles (e.g., cells) or macromolecular components of biological particles (e.g., RNA, DNA, proteins, etc.). The cell beads may comprise a single cell or a plurality of cells, or a derivative of a single cell or a plurality of cells. For example, after lysing and washing the cells, the inhibitory components of the cell lysate may be washed away and the macromolecular components may be bound as cell beads. The systems and methods disclosed herein may be applicable to cell beads (and/or droplets or other partitions) comprising biological particles and cell beads (and/or droplets or other partitions) comprising macromolecular components of biological particles. The cell beads may be or include cells, cell derivatives, cell material, and/or cell-derived material in, within, or encapsulated in a matrix, such as a polymer matrix. In some cases, the cell beads may comprise living cells. In some cases, living cells may be capable of culturing when encapsulated in a gel or polymer matrix, or may be capable of culturing when comprising a gel or polymer matrix. In some cases, the polymer or gel may be diffusion permeable to certain components and non-diffusion permeable to other components (e.g., macromolecular components).

Hole(s)

As described herein, one or more processes may be performed in a partition, which may be a hole. The well may be a well of a plurality of wells of a substrate, such as a well array or plate, or the well may be a well or a microcavity of a device (e.g., a microfluidic device) containing the substrate. The aperture may be an array of apertures or an aperture of a plate, or the aperture may be an aperture or a chamber of a device (e.g., a fluidic device). Accordingly, the pores or microwells may take an "open" configuration in which the pores or microwells are exposed to the environment (e.g., contain an open surface) and are accessible on one planar surface of the substrate, or alternatively, the pores or microwells may take a "closed" or "sealed" configuration in which the microwells are inaccessible on the planar surface of the substrate. In some cases, the pores or microwells may be configured to switch between an "open" and "closed" configuration. For example, an "open" microwell or set of microwells may be "closed" or "sealed" using a membrane (e.g., a semi-permeable membrane), oil (e.g., fluorinated oil covered with an aqueous solution), or a lid, as described elsewhere herein.

The well may have a volume of less than 1 milliliter (mL). For example, the wells may be configured to hold a volume of at most 1000 microliters (μl), at most 100 μl, at most 10 μl, at most 1 μl, at most 100 nanoliters (nL), at most 10nL, at most 1nL, at most 100 picoliters (pL), at most 10 (pL), or less. The wells may be configured to hold a volume of about 1000 μl, about 100 μl, about 10 μl, about 1 μl, about 100nL, about 10nL, about 1nL, about 100pL, about 10pL, etc. The well may be configured to hold a volume of at least 10pL, at least 100pL, at least 1nL, at least 10nL, at least 100nL, at least 1 μl, at least 10 μl, at least 100 μl, at least 1000 μl, or more. The pores may be configured to accommodate volumes within the volume ranges listed herein, for example, about 5nL to about 20nL, about 1nL to about 100nL, about 500pL to about 100 μl, and the like. The aperture may be a plurality of apertures having different volumes and may be configured to accommodate a volume suitable for accommodating any of the partitioned volumes described herein.

In some cases, the microwell array or plate comprises a single type of microwell. In some cases, the microwell array or plate includes a wide variety of microwells. For example, a microwell array or plate may include one or more types of microwells within a single microwell array or plate. The types of microwells may have different sizes (e.g., length, width, diameter, depth, cross-sectional area, etc.), shapes (e.g., circular, triangular, square, rectangular, pentagonal, hexagonal, heptagonal, octagonal, nonagonal, decagonal, etc.), aspect ratios, or other physical characteristics. The microwell array or plate may include any number of different types of microwells. For example, the microwell array or plate may include 1,2,3,4,5,6,7,8,9,10,20,30,40,50,60,70,80,90,100,200,300,400,500,600,700,800,900,1000 or more different types of microwells. The apertures may have any size (e.g., length, width, diameter, depth, cross-sectional area, volume, etc.), shape (e.g., circular, triangular, square, rectangular, pentagonal, hexagonal, heptagonal, octagonal, nonagonal, decagonal, other polygonal, etc.), aspect ratio, or other physical characteristics described herein with respect to any aperture.

In some cases, the microwell array or plate includes different types of microwells positioned adjacent to each other within the array or plate. For example, a microwell having one set of dimensions may be positioned adjacent and in contact with another microwell having a different set of dimensions. Similarly, micropores of different geometries may be placed adjacent to or in contact with each other. For example, one microwell may be used to house cells, cell beads, or other samples (e.g., cell components, nucleic acid molecules, etc.), while an adjacent microwell may be used to house a support (e.g., beads such as gel beads), droplets, or other reagents. In some cases, adjacent microwells may be configured to fuse the contents held therein, for example, upon application of a stimulus or spontaneously upon contact with an article in each microwell.

As described elsewhere herein, multiple partitions may be used in the systems, compositions, and methods described herein. For example, any suitable number of partitions (e.g., holes or droplets) may be generated or otherwise provided. For example, in the case of using wells, at least about 1,000 wells, at least about 5,000 wells, at least about 10,000 wells, at least about 50,000 wells, at least about 100,000 wells, at least about 500,000 wells, at least about 1,000,000 wells, at least about 5,000,000 wells at least about 10,000,000 wells, at least about 50,000,000 wells, at least about 100,000,000 wells, at least about 500,000,000 wells, at least about 1,000,000,000 wells, or more wells may be created or otherwise provided. Further, the plurality of apertures may include both unoccupied apertures (e.g., voids) and occupied apertures.

The well may include any of the reagents described herein or a combination thereof. These reagents may include, for example, bar code molecules, enzymes, adaptors, and combinations thereof. The reagent may be physically separated from the sample (e.g., cell bead, or cell component, e.g., protein, nucleic acid molecule, etc.) placed in the well. Such physical separation may be achieved by including the reagent in a support (e.g., a bead such as a gel bead) disposed in the well or by coupling with a support disposed in the well. Such physical separation may also be achieved by dispensing the reagents in the wells and covering the reagents with, for example, a dissolvable, meltable or permeable layer prior to introducing the polynucleotide sample into the wells. The layer may be, for example, oil, wax, a membrane (e.g., a semi-permeable membrane), or the like. The wells may be sealed at any point in time, for example, after addition of the support (e.g., beads), after addition of the reagent, or after addition of any of these components. Sealing of the wells may be used for a variety of purposes, including preventing beads or loaded reagents from escaping from the wells, allowing selective delivery of certain reagents (e.g., via use of semi-permeable membranes), storing the wells before or after further processing, and the like.

The well may contain free reagent, and/or reagent encapsulated in, or otherwise coupled or associated with, a support (e.g., a bead) or droplet. Any of the reagents described in this disclosure can be encapsulated in or otherwise coupled to a support (e.g., a bead) or droplet along with any chemicals, particles, and elements suitable for sample processing reactions involving biomolecules such as, but not limited to, nucleic acid molecules and proteins. For example, beads or droplets used in a sample preparation reaction for DNA sequencing may contain one or more reagents such as enzymes, restriction enzymes (e.g., multiple cleaving enzymes), ligases, polymerases, fluorophores, oligonucleotide barcodes, adaptors, buffers, nucleotides (e.g., dNTPs, ddNTPs), and the like.

Additional examples of reagents include, but are not limited to, buffers, acidic solutions, alkaline solutions, temperature sensitive enzymes, pH sensitive enzymes, light sensitive enzymes, metals, metal ions, magnesium chloride, sodium chloride, manganese, aqueous buffers, mild buffers, ionic buffers, inhibitors, enzymes, proteins, polynucleotides, antibodies, sugars, lipids, oils, salts, ions, detergents, ionic detergents, nonionic detergents, oligonucleotides, nucleotides, deoxyribonucleotide triphosphates (dNTPs), dideoxynucleotide triphosphates (ddNTPs), DNA, RNA, peptide polynucleotides, complementary DNA (cDNA), double-stranded DNA (dsDNA), single-stranded DNA (ssDNA), plasmid DNA, cosmid DNA, chromosomal DNA, genomic DNA, viral DNA, bacterial DNA, mtDNA (mitochondrial DNA), mRNA, rRNA, tRNA, nRNA, siRNA, snRNA, snoRNA, scaRNA, microRNA, dsRNA, ribozymes, riboswitches and viral RNA, polymerases, ligases, restriction enzymes, proteases, nuclease inhibitors, reductants, oxidants, fluorophores, probes, chromophores, dyes, organic substances, emulsifiers, surfactants, water-based polymers, small molecules, antibiotics, and drugs. As described herein, one or more reagents in the well may be used to perform one or more reactions including, but not limited to, cell lysis, cell immobilization, permeabilization, nucleic acid reactions (e.g., nucleic acid extension reactions), amplification, reverse transcription, transposase reactions (e.g., tag fragmentation), and the like.

The aperture may be provided as part of a kit. For example, the kit may include instructions, microwell arrays or devices, and reagents (e.g., beads). The kit may include any useful reagents for performing the processes described herein, such as nucleic acid reactions, barcoding nucleic acid molecules, sample processing (e.g., for cell lysis, immobilization, and/or permeabilization).

In some cases, a well includes a support (e.g., a bead) or droplet that contains a set of reagents (e.g., a set of enzymes, a set of minerals, a set of oligonucleotides, a mixture of different barcode molecules, a mixture of identical barcode molecules) with similar properties. In other cases, the support or droplet comprises a heterogeneous mixture of reagents. In some cases, a heterogeneous mixture of reagents may contain all components necessary to carry out the reaction. In some cases, such mixtures may contain all of the components necessary to carry out the reaction, except 1, 2,3, 4,5 or more components necessary to carry out the reaction. In some cases, such additional components are contained within or otherwise coupled to different supports or droplets, or contained in solutions within partitions (e.g., microwells) of the system.

Fig. 5 schematically shows an example of a microwell array. The array may be housed within a substrate 500. The substrate 500 includes a plurality of holes 502. The holes 502 may have any size or shape, and the spacing between holes, the number of holes in each substrate, and the density of holes on the substrate 500 may vary depending on the particular application. In one such example application, sample molecules 506, which may comprise cells or cellular components (e.g., nucleic acid molecules), are co-partitioned with beads 504, which may include nucleic acid barcode molecules coupled thereto. The aperture 502 may be loaded using gravity or other loading techniques (e.g., centrifugation, liquid handling, acoustic loading, optoelectronic devices, etc.). In some cases, at least one well 502 contains a single sample molecule 506 (e.g., a cell) and a single bead 504.

Reagents may be loaded into the wells sequentially or simultaneously. In some cases, reagents are introduced into the device either before or after a particular operation. In some cases, reagents (which in some cases may be provided in a support or droplet) are introduced sequentially so that different reactions or manipulations occur at different steps. The reagents (or supports or droplets) may also be loaded during the operation interspersed with the reaction or operation steps. For example, a support (or droplet) comprising reagents for fragmenting polynucleotides (e.g., restriction enzymes) and/or other enzymes (e.g., transposases, ligases, polymerases, etc.) can be loaded into one or more wells, followed by loading the support or droplet comprising reagents for attaching nucleic acid barcode molecules to sample nucleic acid molecules. The reagent may be provided simultaneously or sequentially with the sample, e.g., cell or cell component (e.g., organelle, protein, nucleic acid molecule, carbohydrate, lipid, etc.). Thus, the use of pores may be useful when performing multi-step operations or reactions.

As described elsewhere herein, the nucleic acid barcode molecules and other reagents may be contained within a support (e.g., a bead) or droplet. These supports or droplets may be loaded into a partition (e.g., microwell) before, after, or simultaneously with loading the cells such that each cell is in contact with a different support or droplet. This technique can be used to attach unique nucleic acid barcode molecules to nucleic acid molecules obtained from each cell. Alternatively or in addition, the sample nucleic acid molecule may be attached to a support. For example, a partition (e.g., a microwell) may include a bead having a plurality of nucleic acid barcode molecules coupled thereto. The sample nucleic acid molecule or derivative thereof may be coupled or linked to a nucleic acid barcode molecule on a support. The resulting barcoded nucleic acid molecules can then be removed from the partition and, in some cases, pooled and sequenced. In such cases, the nucleic acid barcode sequence may be used to track the source of the sample nucleic acid molecule. For example, polynucleotides having the same barcode may be determined to be derived from the same cell or partition, while polynucleotides having different barcodes may be determined to be derived from different cells, nuclei, cell beads, or partitions.

A variety of methods can be used to load the sample or reagent into the well or microwell. The sample (e.g., cells, cell beads, or cell components) or reagent (as described herein) can be loaded into the well or microwell using external forces (e.g., gravity, electricity, magnetic force) or using a mechanism that drives the sample or reagent into the well (e.g., via pressure-driven flow, centrifugation, optoelectronic devices, acoustic loading, electrokinetic pumping, vacuum, capillary flow, etc.). In some cases, a fluid handling system may be used to load a sample or reagent into a well. The loading of the sample or reagent may follow a poisson or non-poisson distribution, such as super-poisson or sub-poisson. The geometry of the microwells, spacing between the wells, density and size can be modified to accommodate useful sample or reagent distribution, for example, the size and spacing of the microwells can be adjusted so that the sample or reagent can be distributed in a superppoisson manner.

In one particular non-limiting example, the microwell array or microwell plate includes pairs of microwells, wherein each pair of microwells is configured to house a droplet (e.g., comprising a single cell) and a single bead (such as those described herein, which may also be encapsulated in a droplet in some cases). The droplets and beads (or droplets containing beads) may be loaded simultaneously or sequentially, and the droplets and beads may fuse, for example, when the droplets are in contact with the beads, or when a stimulus (e.g., external force, stirring, heat, light, magnetic force, or electric force, etc.) is applied. In some cases, the loading of droplets and beads is superppoisson. In other examples of microwell pairs, the wells are configured to hold two droplets containing different reagents and/or samples that fuse upon contact or upon application of a stimulus. In such a case, the droplets of one microwell of the pair may contain a reagent that can react with the reagent in the droplets of the other microwell of the pair. For example, one droplet may contain a reagent configured to release a nucleic acid barcode molecule of a bead contained in another droplet located in an adjacent microwell. Upon droplet fusion, the nucleic acid barcode molecules can be released from the beads into the partitions (e.g., microwells or contacted microwell pairs) and can undergo further processing (e.g., barcode addition, nucleic acid reaction, etc.). In the case where whole cells or living cells are loaded in microwells, one of the droplets may contain a lysing reagent for lysing the cells as the droplets fuse.

The droplets or supports (e.g., beads) may be separated into one well. The droplets may be selected or subjected to a pretreatment prior to loading into the wells. For example, the droplets may contain cells, and only certain droplets, such as those containing a single cell (or at least one cell), may be selected for loading the wells. Such a preselection process may be used to efficiently load individual cells, such as to obtain a non-poisson distribution, or to pre-filter cells with selected characteristics prior to further separation in the well. In addition, the technique may be used to obtain or prevent the formation of cell doublets or multimers prior to or during loading of microwells.

In some cases, the pore may comprise a nucleic acid barcode molecule attached thereto. The nucleic acid barcode molecules may be attached to the surface of the well (e.g., the wall of the well). The nucleic acid barcode molecules of one well (e.g., compartment barcode sequences) may be different from the nucleic acid barcode molecules of another well, which may allow for the identification of the contents contained within a single compartment or well. In some cases, the nucleic acid barcode molecule may comprise a spatial barcode sequence that can identify the spatial coordinates of a well, for example, within a well array or well plate. In some cases, the nucleic acid barcode molecule may include a unique molecular identifier for identifying the individual molecule. In some cases, the nucleic acid barcode molecule may be configured to attach to or capture nucleic acid molecules within a sample or cell distributed in a well. For example, a nucleic acid barcode molecule may comprise a capture sequence that may be used to capture or hybridize to a nucleic acid molecule (e.g., RNA, DNA) within a sample. In some cases, the nucleic acid barcode molecule may be released from the microwell. For example, a nucleic acid barcode molecule may comprise a chemical cross-linker that can be cleaved upon application of a stimulus (e.g., photo-stimulus, magnetic stimulus, chemical stimulus, biological stimulus). The released nucleic acid barcode molecules (which may be hybridized or configured to hybridize to the sample nucleic acid molecules) may be collected and used for further processing, which may include nucleic acid processing (e.g., amplification, extension, reverse transcription, etc.) and/or characterization (e.g., sequencing). In such cases, unique partition barcode sequences may be used to identify the cell or partition from which the nucleic acid molecule is derived.

The sample within the well can be characterized. In a non-limiting example, such characterization may include imaging a sample (e.g., a cell, cell bead, or cell component) or derivative thereof. Characterization techniques (such as microscopy or imaging) can be used to measure the profile of the sample in a fixed spatial location. For example, when cells (nuclei or cell beads) are partitioned, optionally together with the beads, imaging each microwell and the contents contained therein can provide useful information about cell duplex formation (e.g., frequency, spatial location, etc.), cell-bead pair efficiency, cell viability, cell size, cell morphology, expression level of biomarkers (e.g., surface markers, fluorescent labeling molecules therein, etc.), cell or bead loading rate, number of cell-bead pairs, and the like. In some cases, imaging may be used to characterize living cells in a well, including but not limited to dynamic living cell tracking, cell-cell interactions (when two or more cells are separated together), cell proliferation, and the like. Alternatively or in addition, imaging may be used to characterize the amount of amplification product in a well.

In operation, the wells may be loaded with sample and reagent simultaneously or sequentially. When loading cells, nuclei or cell beads, the wells may be washed, for example, to remove excess cells (or nuclei or cell beads) from the wells, microwell arrays or plates. Similarly, a wash may be performed to remove excess beads or other reagents from the wells, microwell arrays or microwell plates. Where living cells are used, these cells may be lysed in individual partitions to release intracellular components or cellular analytes. Alternatively, cells may be fixed or permeabilized in individual partitions. The intracellular components or cellular analytes may be coupled to the support, e.g., on the surface of a microwell, on a solid support (e.g., a bead), or they may be collected for further downstream processing. For example, after cell lysis, intracellular components or cellular analytes may be transferred to individual droplets or other partitions for barcode addition. Alternatively or in addition, intracellular components or cellular analytes (e.g., nucleic acid molecules) can be coupled to beads comprising nucleic acid barcode molecules, which can then be collected and further processed, e.g., subjected to a nucleic acid reaction such as reverse transcription, amplification, or extension, and the nucleic acid molecules thereon can be further characterized, e.g., via sequencing. Alternatively or in addition, intracellular components or cellular analytes may be barcoded in the well (e.g., using beads comprising releasable nucleic acid barcode molecules or on the surface of microwells comprising nucleic acid barcode molecules). The barcoded nucleic acid molecules or analytes may be further processed in the well or the barcoded nucleic acid molecules or analytes may be collected from the individual partitions and subjected to further processing outside the partitions. Further processing may include nucleic acid processing (e.g., performing amplification, extension) or characterization (e.g., fluorescent monitoring, sequencing of amplified molecules). In any convenient or useful step, the wells (or microwell array or microwell plate) may be sealed (e.g., using oil, film, wax, etc.), which enables storage of the assay or selective introduction of additional reagents.

Once sealed, the well may be subjected to conditions that further process the biological particles (e.g., cells, cell beads, or nuclei) in the well. For example, the reagents in the wells may allow for further processing of the biological particles, such as lysing cells or nuclei, as further described herein. Alternatively, the well (or wells, such as those in a well-based array) containing the biological particle (e.g., a cell, cell bead, or cell nucleus) may be subjected to a freeze-thaw cycle to process the biological particle, e.g., lyse the cell or cell nucleus. The well containing the biological particle (e.g., cell bead, or cell nucleus) can be subjected to a freezing temperature (e.g., 0 ℃, less than 0 ℃,5 ℃,10 ℃, 15 ℃,20 ℃,25 ℃, 30 ℃, 35 ℃, 40 ℃, 45 ℃,50 ℃, 55 ℃,60 ℃, 65 ℃, 70 ℃, 80 ℃, or-85 ℃). Freezing can be performed in a suitable manner, such as a sub-zero freezer or a dry ice/ethanol bath. After initial freezing, the well (or wells) containing the biological particles (e.g., cells, cell beads, one or more nuclei) may be subjected to a freeze-thaw cycle to lyse the biological particles. In one embodiment, the initially frozen well (or wells) is thawed to a temperature above the freezing temperature (e.g., room temperature or 25 ℃). In another embodiment, the freezing time is less than 10 minutes (e.g., 5 minutes or 7 minutes), followed by thawing time at room temperature is less than 10 minutes (e.g., 5 minutes or 7 minutes). The freeze-thaw cycle may be repeated multiple times, e.g., 2,3, or 4 times, to obtain a lysate of biological particles (e.g., cells, cell beads, one or more nuclei) in the well (or wells). In one embodiment, the freezing, thawing, and/or freeze/thaw cycles are performed without lysis buffer.

FIG. 6 schematically shows an example workflow for processing nucleic acid molecules within a sample. A substrate 600 including a plurality of micro-holes 602 may be provided. A sample 606, which may comprise cells, cell beads, cellular components, or analytes (e.g., proteins and/or nucleic acid molecules), may be co-partitioned in the plurality of microwells 602 along with the plurality of beads 604 comprising nucleic acid barcode molecules. During process 610, sample 606 may be processed within a partition. For example, in the case of living cells, the cells may be subjected to conditions sufficient to lyse the cells or nuclei and release the analytes contained therein. In process 620, the beads 604 may be further processed. For example, processes 620a and 620b schematically illustrate different workflows depending on the characteristics of the beads 604.

In 620a, the bead includes a nucleic acid barcode molecule attached thereto, and the sample nucleic acid molecule (e.g., RNA, DNA) can be attached to the nucleic acid barcode molecule, e.g., via ligation hybridization. This attachment may occur on the beads. In process 630, beads 604 may be collected from multiple wells 602 and pooled. Further processing may be performed in process 640. For example, one or more nucleic acid reactions, such as reverse transcription, nucleic acid extension, amplification, ligation, transposition, and the like, may be performed. In some cases, the adapter sequence is linked to a nucleic acid molecule or derivative thereof, as described elsewhere herein. For example, sequencing primer sequences may be added to each end of the nucleic acid molecule. In process 650, further characterization (such as sequencing) can be performed to generate sequencing reads. These sequencing reads may yield information about individual cells or cell populations, which may be presented visually or graphically, for example, in map 655.

In 620b, the bead comprises a nucleic acid barcode molecule releasably attached thereto, as described below. The beads may degrade or otherwise release nucleic acid barcode molecules into the wells 602, which may then be used to barcode the nucleic acid molecules within the wells 602. Further processing may be performed either inside the partition or outside the partition. For example, one or more nucleic acid reactions, such as reverse transcription, nucleic acid extension, amplification, ligation, transposition, and the like, may be performed. In some cases, the adapter sequence is linked to a nucleic acid molecule or derivative thereof, as described elsewhere herein. For example, sequencing primer sequences may be added to each end of the nucleic acid molecule. In process 650, further characterization (such as sequencing) can be performed to generate sequencing reads. These sequencing reads may yield information about individual cells or cell populations, which may be presented visually or graphically, for example, in map 655.

Bead particle

The nucleic acid barcode molecules may be delivered to a partition (e.g., droplet or well) via a solid support or carrier (e.g., bead). In some cases, the nucleic acid barcode molecule is initially associated with the solid support and then released from the solid support upon application of a stimulus that allows the nucleic acid barcode molecule to dissociate or release from the solid support. In specific examples, the nucleic acid barcode molecules are initially associated with a solid support (e.g., a bead) and then released from the solid support upon application of a biological stimulus, a chemical stimulus, a thermal stimulus, an electrical stimulus, a magnetic stimulus, and/or a light stimulus.

The nucleic acid barcode molecule may comprise a barcode sequence and a functional sequence, such as a nucleic acid primer sequence or a Template Switch Oligonucleotide (TSO) sequence.

The solid support may be a bead. The solid support (e.g., beads) can be porous, non-porous, hollow (e.g., microcapsules), solid, semi-solid, and/or a combination of the foregoing properties. The beads may be solid, semi-fluid, and/or a combination of the foregoing properties. In some cases, the solid support (e.g., beads) may be at least partially dissolvable, rupturable, and/or degradable. In some cases, the solid support (e.g., beads) may not be degradable. In some cases, the solid support (e.g., bead) may be a gel bead. The gel beads may be hydrogel beads. Gel beads may be formed from molecular precursors (such as polymers or monomeric species). The semi-solid support (e.g., bead) may be a liposome bead. The solid support (e.g., beads) may comprise metals including iron oxide, gold, and silver. In some cases, the solid support (e.g., beads) may be silica beads. In some cases, the solid support (e.g., beads) may be rigid. In other cases, the solid support (e.g., beads) may be flexible and/or compressible.

The partition may include one or more unique identifiers, such as a bar code. The bar code may be pre-delivered, subsequently delivered, or simultaneously delivered to the partition containing the compartmentalized or separated biological particles. For example, the bar code may be injected into the droplet or deposited in the microwell before, after, or simultaneously with the droplet generation or the provision of reagents in the microwells, respectively. The delivery of the bar code to a particular partition allows the later attribution of the characteristics of the individual biological particles to the particular partition. The barcode may be delivered to the partition via any suitable mechanism, such as on a nucleic acid molecule (e.g., an oligonucleotide). The barcoded nucleic acid molecules can be delivered to the partition via a support (e.g., a bead). In some cases, the support may comprise beads. The beads are described in further detail below.

In some cases, the barcoded nucleic acid molecules may be initially associated with a support (e.g., a bead) and then released from the support. The release of the barcoded nucleic acid molecules may be passive (e.g., by diffusion out of the support or out of the support). In addition or alternatively, release from the support may be performed after application of a stimulus that allows the barcode-loaded nucleic acid molecule to dissociate or release from the support (e.g., bead). Such stimulation may disrupt the support, i.e., couple the barcoded nucleic acid molecules to the support or couple within the support or an interaction of both. Such stimuli may include, for example, thermal stimuli, optical stimuli, chemical stimuli (e.g., pH changes or use of a reducing agent), mechanical stimuli, radiation stimuli, biological stimuli (e.g., enzymes), or any combination thereof. Methods and systems for separating bar code bearing beads into droplets are provided in U.S. patent publication nos. 2019/0367997 and 2019/0064173, and international application No. PCT/US20/17785, each of which is incorporated herein by reference in its entirety for all purposes.

In some examples, the beads, biological particles, and droplets may flow along a channel (e.g., a channel of a microfluidic device), in some cases, at a substantially regular flow profile (e.g., at a regular flow rate). Such regular flow patterns may allow the droplets to include a single bead and a single biological particle. Such regular flow patterns may allow droplets to have an occupancy rate of greater than 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 95% (e.g., droplets with beads and biological particles). Such regular flow patterns and devices that may be used to provide such regular flow patterns are provided, for example, in U.S. patent publication No. 2015/0292988, which is incorporated herein by reference in its entirety.

The beads may be porous, non-porous, solid, semi-fluid, and/or any combination thereof. In some cases, the beads may be dissolvable, destructible, or degradable. In some cases, the beads may not be degradable. In some cases, the beads may be gel beads. The gel beads may be hydrogel beads. Gel beads may be formed from molecular precursors (such as polymers or monomeric species). The semi-solid beads may be liposome beads. The solid beads may comprise metals including iron oxide, gold, and silver. In some cases, the beads may be silica beads. In some cases, the beads may be rigid. In other cases, the beads may be flexible and/or compressible.

The beads may have any suitable shape. Examples of bead shapes include, but are not limited to, spherical, non-spherical, oval, oblong, amorphous, circular, cylindrical, and variations thereof.

The beads may be of uniform or non-uniform size. In some cases, the beads may have a diameter of at least about 10 nanometers (nm), 100nm, 500nm, 1 micrometer (μm), 5 μm, 10 μm, 20 μm, 30 μm, 40 μm, 50 μm, 60 μm, 70 μm, 80 μm, 90 μm, 100 μm, 250 μm, 500 μm, 1mm or more. In some cases, the beads may have a diameter of less than about 10nm, 100nm, 500nm, 1 μm, 5 μm, 10 μm, 20 μm, 30 μm, 40 μm, 50 μm, 60 μm, 70 μm, 80 μm, 90 μm, 100 μm, 250 μm, 500 μm, 1mm or less. In some cases, the beads may have a diameter in the range of about 40-75 μm, 30-75 μm, 20-75 μm, 40-85 μm, 40-95 μm, 20-100 μm, 10-100 μm, 1-100 μm, 20-250 μm, or 20-500 μm.

In certain aspects, the beads may be provided in the form of a population of beads or a plurality of beads having a relatively monodisperse size distribution. Where it may be desirable to provide a relatively consistent amount of reagent within a partition, maintaining relatively consistent bead characteristics (such as size) may contribute to overall consistency. In particular, the beads described herein can have a size distribution with a coefficient of variation of the bead cross-sectional dimension of less than 50%, less than 40%, less than 30%, less than 20%, and in some cases less than 15%, less than 10%, less than 5% or less.

The beads may comprise natural and/or synthetic materials. For example, the beads may comprise natural polymers, synthetic polymers, or both natural and synthetic polymers. Examples of natural polymers include proteins and sugars such as deoxyribonucleic acid, rubber, cellulose, starch (e.g., amylose, amylopectin), proteins, enzymes, polysaccharides, silk, polyhydroxyalkanoates, chitosan, dextran, collagen, carrageenan, psyllium, gum arabic, agar, gelatin, shellac, karaya, xanthan, corn gum, guar gum, karaya, agarose, alginic acid, alginate, or natural polymers thereof. Examples of synthetic polymers include acrylic, nylon, silicone, spandex (spandex), viscose rayon, polycarboxylic acid, polyvinyl acetate, polyacrylamide, polyacrylate, polyethylene glycol, polyurethane, polylactic acid, silica, polystyrene, polyacrylonitrile, polybutadiene, polycarbonate, polyethylene terephthalate, poly (chlorotrifluoroethylene), poly (ethylene oxide), poly (ethylene terephthalate), polyethylene, polyisobutylene, poly (methyl methacrylate), poly (formaldehyde), polyoxymethylene, polypropylene, polystyrene, poly (tetrafluoroethylene), poly (vinyl acetate), poly (vinyl alcohol), poly (vinyl chloride), poly (vinylidene fluoride), poly (vinyl fluoride), and/or combinations (e.g., copolymers) thereof. The beads may also be formed from materials other than polymers including lipids, micelles, ceramics, glass-ceramics, material composites, metals, other inorganic materials, and the like.

In some cases, the beads may contain molecular precursors (e.g., monomers or polymers) that can form a polymer network via polymerization of the molecular precursors. In some cases, the precursor may be an already polymerized species capable of undergoing further polymerization (e.g., via chemical crosslinks). In some cases, the precursor may include one or more of an acrylamide or methacrylamide monomer, oligomer, or polymer. In some cases, the beads may comprise a prepolymer, which is an oligomer capable of further polymerization. For example, polyurethane beads can be prepared using a prepolymer. In some cases, the beads may contain separate polymers that may be further polymerized together. In some cases, the beads may be generated via polymerization of different precursors such that they comprise mixed polymers, copolymers, and/or block copolymers. In some cases, the beads may comprise covalent or ionic bonds between polymer precursors (e.g., monomers, oligomers, linear polymers), nucleic acid molecules (e.g., oligonucleotides), primers, and other entities. In some cases, the covalent bond may be a carbon-carbon bond, thioether bond, or carbon-heteroatom bond.

The crosslinking may be permanent or reversible depending on the particular crosslinking agent used. Reversible crosslinking may allow linearization or dissociation of the polymer under appropriate conditions. In some cases, reversible crosslinking may also allow for reversible attachment of materials that bind to the surface of the beads. In some cases, the crosslinker may form disulfide bonds. In some cases, the disulfide-forming chemical cross-linking agent may be cystamine or a modified cystamine.

In some cases, disulfide bonds can be formed between molecular precursor units (e.g., monomers, oligomers, or linear polymers) or precursors and nucleic acid molecules (e.g., oligonucleotides) incorporated into the beads. Cystamine (including modified cystamine) is for example an organic agent comprising disulfide bonds, which can be used as a cross-linking agent between individual monomers or polymer precursors of the beads. Polyacrylamide can be polymerized in the presence of cystamine or cystamine-containing species (e.g., modified cystamine) to produce polyacrylamide gel beads comprising disulfide linkages (e.g., chemically degradable beads comprising a chemically reducible cross-linking agent). Disulfide bonds may allow the beads to be degraded (or dissolved) when the beads are exposed to a reducing agent.

In some cases, chitosan (a linear polysaccharide polymer) can be crosslinked with glutaraldehyde via hydrophilic chains to form beads. Crosslinking of the chitosan polymer may be achieved by chemical reactions initiated by heat, pressure, pH changes and/or radiation.

In some cases, the beads may comprise acrydite moieties, which in some aspects may be used to attach one or more nucleic acid molecules (e.g., barcode sequences, barcoded nucleic acid molecules, barcoded oligonucleotides, primers, or other oligonucleotides) to the beads. In some cases, an acrydite moiety may refer to an acrydite analog generated from the reaction of acrydite with one or more species (such as the reaction of acrydite with other monomers and crosslinkers during a polymerization reaction). The acrydite moiety can be modified to form a chemical bond with a species to be ligated, such as a nucleic acid molecule (e.g., a barcode sequence, a barcoded nucleic acid molecule, a barcoded oligonucleotide, a primer, or other oligonucleotide). The Acrydite moiety may be modified with a thiol group capable of forming a disulfide bond, or may be modified with a group already containing a disulfide bond. The sulfhydryl group or disulfide bond (via disulfide interchange) may be used as an anchor point for the species to be linked, or another part of the acrydite moiety may be used for linking. In some cases, the linkage may be reversible such that when the disulfide bond breaks (e.g., in the presence of a reducing agent), the linked species is released from the bead. In other cases, the acrydite moiety may contain reactive hydroxyl groups that may be used for attachment.

Functionalization of the beads for attachment of nucleic acid molecules (e.g., oligonucleotides) can be accomplished by a number of different methods, including activation of chemical groups within the polymer, incorporation of active or activatable functional groups in the polymer structure, or attachment at the prepolymer or monomer stage of bead generation.

For example, a precursor (e.g., monomer, crosslinker) that polymerizes to form a bead may comprise acrydite moieties such that when the bead is produced, the bead also comprises acrydite moieties. The acrydite moiety can be attached to a nucleic acid molecule (e.g., an oligonucleotide) comprising one or more functional sequences, such as a TSO sequence or a primer sequence (e.g., a poly-T sequence, or a nucleic acid primer sequence complementary to a target nucleic acid sequence and/or used to amplify a target nucleic acid sequence, a random primer, or a primer sequence of a messenger RNA, etc.) that can be incorporated into a bead. The one or more barcode sequences may include sequences that are the same for all nucleic acid molecules coupled to a given bead and/or sequences that are different in all nucleic acid molecules coupled to a given bead. The nucleic acid molecules can be incorporated into beads.

In some cases, the nucleic acid molecule may comprise a functional sequence (e.g., for ligation to a sequencing flow cell), e.g., forA sequenced P5 sequence (or portion thereof). In some cases, the nucleic acid molecule or derivative thereof (e.g., an oligonucleotide or polynucleotide generated from the nucleic acid molecule) may comprise another functional sequence, e.g., a P7 sequence (or portion thereof) for attachment to a sequencing flow cell for Illumina sequencing. In some cases, the nucleic acid molecule may comprise a barcode sequence. In some cases, the nucleic acid molecule may also comprise a Unique Molecular Identifier (UMI). In some cases, the nucleic acid molecule may comprise an R1 primer sequence for Illumina sequencing. In some cases, the nucleic acid molecule may comprise an R2 primer sequence for Illumina sequencing. Examples of such nucleic acid molecules (e.g., oligonucleotides, polynucleotides, etc.) and uses thereof that may be used with the compositions, devices, methods, and systems of the present disclosure are provided in U.S. patent publication nos. 2014/0378345 and 2015/0376609, each of which is incorporated herein by reference in its entirety.

In some cases, the nucleic acid molecule may comprise one or more functional sequences. For example, the functional sequence may include a sequence for attachment to a sequencing flow cell, e.g., for use inSequenced P5 sequence. In some cases, the nucleic acid molecule or derivative thereof (e.g., an oligonucleotide or polynucleotide generated from the nucleic acid molecule) may comprise another functional sequence, e.g., a P7 sequence for attachment to a sequencing flow cell for Illumina sequencing. In some cases, the functional sequence may comprise a barcode sequence or a plurality of barcode sequences. In some cases, the functional sequence may comprise a Unique Molecular Identifier (UMI). In some cases, the functional sequence may include a primer sequence (e.g., an R1 primer sequence for Illumina sequencing, an R2 primer sequence for Illumina sequencing, etc.). In some cases, the functional sequence may comprise a partial sequence, such as a partial barcode sequence, a partial anchor sequence, a partial sequencing primer sequence (e.g., a partial R1 sequence, a partial R2 sequence, etc.), a partial sequence configured to be attached to a flow cell of a sequencer (e.g., a partial P5 sequence, a partial P7 sequence, etc.), or a partial sequence of any other type of sequence described elsewhere herein. For example, a partial sequence may contain contiguous or consecutive portions or segments of the complete sequence, but not all. In some cases, the downstream procedure may extend the partial sequence or derivative thereof to obtain the complete sequence of the partial sequence or derivative thereof.

Examples of such nucleic acid molecules (e.g., oligonucleotides, polynucleotides, etc.) and uses thereof that may be used with the compositions, devices, methods, and systems of the present disclosure are provided in U.S. patent publication nos. 2014/0378345 and 2015/0376609, each of which is incorporated herein by reference in its entirety.

Fig. 3 shows an example of a bead carrying a bar code. Nucleic acid molecules 302 (such as oligonucleotides) can be coupled to beads 304 via releasable linkages 306 (such as disulfide linkers). The same bead 304 may be coupled (e.g., via a releasable bond) to one or more other nucleic acid molecules 318, 320. The nucleic acid molecule 302 may be or comprise a barcode. As described elsewhere herein, the structure of a bar code may comprise a plurality of sequential elements. Nucleic acid molecule 302 may comprise functional sequence 308 that may be used in subsequent processing. For example, the functional sequence 308 may include a sequencer-specific flow cell junction sequence (e.g., forP5 sequence of a sequencing system) and sequencing primer sequences (e.g., for use inR1 primer of a sequencing system) or a partial sequence thereof. The nucleic acid molecule 302 can comprise a barcode sequence 310 for barcode a sample (e.g., DNA, RNA, protein, etc.). In some cases, the barcode sequence 310 may be bead-specific such that the barcode sequence 310 is common to all nucleic acid molecules (e.g., including the nucleic acid molecule 302) coupled to the same bead 304. Alternatively or in addition, the barcode sequence 310 may be partition-specific such that the barcode sequence 310 is common to all nucleic acid molecules coupled to one or more beads that are partitioned into the same partition. The nucleic acid molecule 302 can comprise a specific primer sequence 312, such as an mRNA-specific primer sequence (e.g., a poly-T sequence), a targeting primer sequence, and/or a random primer sequence. The nucleic acid molecule 302 may include an anchor sequence 314 to ensure that the specific primer sequence 312 hybridizes at the sequence end (e.g., of an mRNA). For example, the anchor sequence 314 can comprise a random short sequence of nucleotides, such as a 1-mer, 2-mer, 3-mer, or longer sequence, which can ensure that the poly-T fragment is more likely to hybridize at the sequence end of the poly-a tail of the mRNA.

The nucleic acid molecule 302 can comprise a unique molecular identification sequence 316 (e.g., a Unique Molecular Identifier (UMI)). In some cases, the unique molecular identification sequence 316 may comprise about 5 to about 8 nucleotides. Alternatively, the unique molecular identification sequence 316 may be compressed by less than about 5 or more than about 8 nucleotides. Unique molecular identification sequence 316 can be a unique sequence that varies between individual nucleic acid molecules (e.g., 302, 318, 320, etc.) coupled to a single bead (e.g., bead 304). In some cases, the unique molecular identification sequence 316 can be a random sequence (e.g., such as a random N-mer sequence). For example, UMI may provide a unique identifier of the captured starting mRNA molecule in order to allow quantification of the amount of RNA initially expressed. It should be appreciated that although FIG. 3 shows three nucleic acid molecules 302, 318, 320 coupled to the surface of bead 304, individual beads may be coupled to any number of individual nucleic acid molecules, e.g., from one to tens to hundreds of thousands or even millions of individual nucleic acid molecules. The respective barcodes of individual nucleic acid molecules may comprise common sequence fragments or relatively common sequence fragments (e.g., 308, 310, 312, etc.) and variable or unique sequence fragments (e.g., 316) between different individual nucleic acid molecules coupled to the same bead.

In operation, biological particles (e.g., cells, DNA, RNA, etc.) can be co-partitioned along with the barcoded beads 304. The nucleic acid barcode molecules 302, 318, 320 may be released from the beads 304 in the partition. For example, in the context of analyzing sample RNA, the poly-T fragment (e.g., 312) of one of the released nucleic acid molecules (e.g., 302) can hybridize to the poly-a tail of an mRNA molecule. Reverse transcription can produce a cDNA transcript of mRNA, but the transcript includes each of the sequence segments 308, 310, 316 of the nucleic acid molecule 302. Since nucleic acid molecule 302 comprises anchor sequence 314, it is more likely to hybridize to the sequence end of the poly-A tail of mRNA and initiate reverse transcription. Within any given partition, all cDNA transcripts of individual mRNA molecules may contain one common barcode sequence fragment 310. However, transcripts made from different mRNA molecules within a given partition may vary at unique molecular identification sequence 312 fragment (e.g., UMI fragment). Advantageously, even after any subsequent amplification of the contents of a given partition, the number of different UMIs may be indicative of the amount of mRNA originating from the given partition, and thus the amount of mRNA originating from a biological particle (e.g., a cell). As described above, transcripts can be amplified, purified and sequenced to identify the sequence of cDNA transcripts of mRNA, as well as to sequence barcode and UMI fragments. Although a poly-T primer sequence is described, other targeting or random primer sequences may be used to initiate a reverse transcription reaction. Also, while described as releasing barcoded oligonucleotides into a partition, in some cases, nucleic acid molecules that bind to beads (e.g., gel beads) can be used to hybridize and capture mRNA on a bead solid phase, e.g., to facilitate separation of RNA from other cellular content. In such cases, further processing may be performed in the partition or outside the partition (e.g., in bulk). For example, the RNA molecules on the beads may undergo reverse transcription or other nucleic acid processing, additional adaptor sequences may be added to the barcoded nucleic acid molecules, or other nucleic acid reactions (e.g., amplification, nucleic acid extension) may be performed. The beads or products thereof (e.g., barcoded nucleic acid molecules) may be collected from the partitions and/or pooled together prior to being subjected to cleaning and further characterization (e.g., sequencing).

The operations described herein may be performed in any useful or convenient step. For example, beads comprising nucleic acid barcode molecules may be introduced into a partition (e.g., a well or droplet) before, during, or after introduction of a sample into the partition. The nucleic acid molecules of the sample may be barcoded, which may occur on the beads (in case the nucleic acid molecules remain coupled to the beads) or after release of the nucleic acid barcode molecules into the partitions. Where nucleic acid molecules from the sample remain attached to the beads, the beads from the various partitions may be collected, pooled, and then subjected to further processing (e.g., reverse transcription, attachment of adaptors, amplification, clearance, sequencing). In other cases, the processing may occur in a partition. For example, conditions sufficient to perform barcoding, adaptor ligation, reverse transcription or other nucleic acid processing operations may be provided in the partitions, and these operations performed prior to cleaning and sequencing.

In some cases, the beads may comprise a capture sequence or a binding sequence configured to bind to a corresponding capture sequence or binding sequence. In some cases, the bead may comprise a plurality of different capture or binding sequences configured to bind to different respective capture or binding sequences. For example, the beads may comprise a first subset of one or more capture sequences each configured to bind to a first corresponding capture sequence, a second subset of one or more capture sequences each configured to bind to a second corresponding capture sequence, a third subset of one or more capture sequences each configured to bind to a third corresponding capture sequence, and so on. The beads may contain any number of different capture sequences. In some cases, the beads may comprise at least 2,3,4,5,6,7,8,9,10 or more different capture sequences or binding sequences configured to bind to different respective capture sequences or binding sequences, respectively. Alternatively or in addition, the beads may comprise up to about 10,9,8,7,6,5,4,3, or 2 different capture or binding sequences configured to bind to different respective capture or binding sequences. In some cases, different capture sequences or binding sequences may be configured to facilitate analysis of the same type of analyte. In some cases, different capture sequences or binding sequences may be configured to facilitate analysis of different types of analytes (with the same beads). The capture sequences may be designed to ligate to corresponding capture sequences. Advantageously, such corresponding capture sequences can be introduced or otherwise induced into biological particles (e.g., cells, cell beads, etc.) for different assays in various forms (e.g., an antibody comprising a barcoding for the corresponding capture sequence, MHC dextramer comprising a barcoding for the corresponding capture sequence, a guide RNA molecule comprising a barcoding for the corresponding capture sequence, etc.), such that the corresponding capture sequence can later interact with the capture sequence associated with the bead. In some cases, the capture sequence coupled to the bead (or other support) may be configured to be linked to a linker molecule, such as a splint molecule, wherein the linker molecule is configured to couple the bead (or other support) to other molecules, such as to one or more analytes or one or more other linker molecules, through the linker molecule.

Fig. 4 shows another example of a bead carrying a bar code. Nucleic acid molecules 405 (such as oligonucleotides) can be coupled to beads 404 by releasable linkages 406 (such as disulfide linkers). The nucleic acid molecule 405 may comprise a first capture sequence 460. The same bead 404 may be coupled (e.g., via releasable linkages) to one or more other nucleic acid molecules 403, 407 comprising other capture sequences. The nucleic acid molecule 405 may be or comprise a barcode. As noted elsewhere herein, the structure of the barcode may include a number of sequence elements, such as functional sequence 408 (e.g., flow cell attachment sequences, sequencing primer sequences, etc.), barcode sequence 410 (e.g., bead-specific sequences shared by the beads, partition-specific sequences shared by the partitions, etc.), and unique molecular identifiers 412 (e.g., unique sequences within different molecules attached to the beads), or partial sequences thereof. The capture sequence 460 may be configured to connect to a corresponding capture sequence 465. In some cases, the corresponding capture sequence 465 may be coupled to another molecule, which may be an analyte or an intermediate carrier. For example, as shown in fig. 4, a corresponding capture sequence 465 is coupled to a guide RNA molecule 462 that comprises a target sequence 464, wherein the target sequence 464 is configured to be linked to an analyte. Another oligonucleotide molecule 407 attached to the bead 404 comprises a second capture sequence 480 configured to be attached to a second corresponding capture sequence 485. As shown in fig. 4, a second corresponding capture sequence 485 is coupled to the antibody 482. In some cases, the antibody 482 may have binding specificity for an analyte (e.g., a surface protein). Alternatively, the antibody 482 may not have binding specificity. Another oligonucleotide molecule 403 that is linked to bead 404 comprises a third capture sequence 470 that is configured to be linked to a second corresponding capture sequence 475. As shown in fig. 4, a third corresponding capture sequence 475 is coupled to molecule 472. The molecule 472 may or may not be configured to target an analyte. Other oligonucleotide molecules 403, 407 may comprise other sequences (e.g., functional sequences, barcode sequences, UMI, etc.) described with respect to oligonucleotide molecule 405. While fig. 4 shows a single oligonucleotide molecule comprising each capture sequence, it is to be understood that the bead may comprise a set of one or more oligonucleotide molecules for each capture sequence, wherein each oligonucleotide molecule comprises a capture sequence. For example, the beads may comprise any number of sets having one or more different capture sequences. Alternatively or in addition, the beads 404 may comprise other capture sequences. Alternatively or in addition, the beads 404 may contain fewer types of capture sequences (e.g., two capture sequences). Alternatively or in addition, the beads 404 may comprise an oligonucleotide molecule comprising a promoter sequence, such as a specific promoter sequence, such as an mRNA specific promoter sequence (e.g., a poly-T sequence), a targeted promoter sequence, and/or a random promoter sequence, for example, to facilitate determination of gene expression.

In operation, the barcoded oligonucleotides can be released (e.g., in a partition), as described elsewhere herein. Alternatively, nucleic acid molecules bound to beads (e.g., gel beads) can be used to hybridize and capture analytes (e.g., one or more types of analytes) on a bead solid phase.

In some cases, precursors containing functional groups that are reactive or that can be activated such that they become reactive can be polymerized with other precursors to produce gel beads containing activated or activatable functional groups. This functional group can then be used to attach additional species (e.g., disulfide linkers, primers, other oligonucleotides, etc.) to the gel beads. For example, some precursors comprising carboxylic acid (COOH) groups may be copolymerized with other precursors to form gel beads that also comprise COOH functional groups. In some cases, acrylic acid (species containing free COOH groups), acrylamide, and bis (acryloyl) cystamine may be copolymerized together to form gel beads containing free COOH groups. The COOH groups of the gel beads may be activated (e.g., via 1-ethyl-3- (3-dimethylaminopropyl) carbodiimide (EDC) and N-hydroxysuccinimide (NHS) or 4- (4, 6-dimethoxy-1, 3, 5-triazin-2-yl) -4-methylmorpholine hydrochloride (DMTMM)) so that they are reactive (e.g., reactive to amine functionality in the case EDC/NHS or DMTMM is used for activation). The activated COOH groups can then be reacted with an appropriate species comprising the moiety to be attached to the bead (e.g., a species comprising an amine functionality where the carboxylic acid group is activated to be reactive with the amine functionality).

Beads containing disulfide bonds in their polymer network can be functionalized with additional species by reducing some of the disulfide bonds to free sulfhydryl groups. Disulfide bonds can be reduced by, for example, the action of a reducing agent (e.g., DTT, TCEP, etc.) to form free sulfhydryl groups without dissolving the beads. The free thiol of the bead may then react with the free thiol of the species or with a species comprising another disulfide bond (e.g., by thiol-disulfide exchange) such that the species may be attached to the bead (e.g., by the disulfide bond generated). In some cases, the free thiol groups of the beads may react with any other suitable group. For example, the free thiol groups of the beads may react with species comprising acrydite moieties. The free thiol groups of the beads can be reacted with acrydite by michael addition chemistry such that species comprising acrydite are attached to the beads. In some cases, uncontrolled reactions can be prevented by adding thiol capping agents such as N-ethylmaleimide (N-ETHYLMALIEAMIDE) and iodoacetate.

The activation of disulfide bonds within the beads can be controlled such that only a small amount of disulfide bonds are activated. Control may be exercised, for example, by controlling the concentration of reducing agents used to generate free sulfhydryl groups and/or controlling the concentration of reagents used to form disulfide bonds in bead polymerization. In some cases, low concentrations (e.g., a molecular ratio of reducing agent: gel beads of less than or equal to about 1:100,000,000,000, less than or equal to about 1:10,000,000, less than or equal to about 1:1,000,000,000, less than or equal to about 1:100,000,000, less than or equal to about 1:10,000,000, less than or equal to about 1:1,000,000, less than or equal to about 1:100,000, less than or equal to about 1:10,000) of reducing agent may be used for the reduction. Controlling the number of disulfide bonds reduced to free sulfhydryl groups may be useful to ensure bead structural integrity during functionalization. In some cases, a photoactive agent such as a fluorescent dye may be coupled to the beads via free thiol groups of the beads and used to quantify the number of free thiol groups present in the beads and/or track the beads.

In some cases, it may be advantageous to add a portion to the gel beads after they are formed. For example, the addition of oligonucleotides (e.g., barcoded oligonucleotides) after gel bead formation can avoid loss of species during chain transfer termination that may occur during polymerization. In addition, smaller precursors (e.g., monomers or crosslinkers that do not contain side chain groups and attached moieties) can be used for polymerization and can be minimally hindered from growing chain ends by viscous effects. In some cases, functionalization after gel bead synthesis can minimize exposure of the species to be loaded (e.g., oligonucleotides) to potential damaging agents (e.g., free radicals) and/or chemical environments. In some cases, the resulting gel may have an upper critical dissolution temperature (UCST) that may allow the temperature driven swelling and collapsing of the beads. Such functionality may aid in the permeation of oligonucleotides (e.g., primers) into the beads during subsequent functionalization of the beads with the oligonucleotides. Post-production functionalization can also be used to control the loading ratio of species in the beads so that, for example, variability in loading ratio is minimized. The loading of the species may also be performed in a batch process, such that multiple beads may be functionalized with the species in a single batch.

Beads injected or otherwise introduced into the partition may contain a releasably, cleavable, or reversibly linked barcode. Beads injected or otherwise introduced into the partition may contain activatable barcodes. The beads injected or otherwise introduced into the partition may be degradable, destructible, or dissolvable beads.

The barcode may be releasably, cleavable, or reversibly attached to the bead such that the barcode may be released or releasable by cleavage of the bond between the barcode molecule and the bead, or by degradation of the base bead itself, allowing the barcode to be accessed or accessible by other reagents, or both. In non-limiting examples, cleavage may be achieved by reducing disulfide bonds, using restriction enzymes, photoactivated cleavage, or cleavage and/or reaction via other types of stimuli (e.g., chemical, thermal, pH, enzymatic, etc.), as described elsewhere herein. Releasable barcodes may sometimes be referred to as activatable because they are available for reaction once released. Thus, for example, an activatable barcode may be activated by releasing the barcode from the bead (or other suitable type of partition as described herein). Other activatable configurations are also contemplated in the context of the described methods and systems.

In addition to or instead of cleavable linkages between the bead and associated molecules, such as barcode-containing nucleic acid molecules (e.g., barcoded oligonucleotides), the bead may be degradable, destructible, or dissolvable, either spontaneously or upon exposure to one or more stimuli (e.g., temperature change, pH change, exposure to specific chemical species or chemical phases, exposure to light, reducing agents, etc.). In some cases, the beads may be dissolvable such that the material component of the beads dissolves when exposed to a particular chemical species or environmental change (such as a temperature change or pH change). In some cases, the gel beads may degrade or dissolve under elevated temperature and/or alkaline conditions. In some cases, the beads may be thermally degradable such that when the beads are exposed to an appropriate temperature change (e.g., heat), the beads degrade. Degradation or dissolution of the beads bound to the species (e.g., nucleic acid molecules, such as barcoded oligonucleotides) can result in release of the species from the beads.

It will be appreciated from the above disclosure that degradation of the beads may refer to dissociation of bound or entrained species from the beads, with and without concomitant structural degradation of the physical beads themselves. For example, the degradation of the beads may involve cleavage of cleavable bonds via one or more of the species and/or methods described elsewhere herein. In another example, the entrained species may be released from the beads by, for example, osmotic pressure differences due to chemical environmental changes. For example, changes in bead pore size due to osmotic pressure differences may typically occur without structural degradation of the beads themselves. In some cases, an increase in pore size due to osmotic swelling of the beads may allow release of the species entrained within the beads. In other cases, the osmotic shrinkage of the beads may allow the beads to better retain entrained species due to the reduced pore size.

Degradable beads can be introduced into a partition (such as a droplet or well of an emulsion) such that when appropriate stimulus is applied, the beads degrade within the partition and any associated species (e.g., oligonucleotides) are released into the droplet. The free species (e.g., oligonucleotides, nucleic acid molecules) may interact with other reagents contained in the partition. For example, polyacrylamide beads containing cystamine and linked to a barcode sequence via disulfide bonds can be combined with a reducing agent within droplets of a water-in-oil emulsion. Within the droplet, the reducing agent can break down individual disulfide bonds, resulting in bead degradation and release of the barcode sequence into the aqueous internal environment of the droplet. In another example, heating a droplet containing bead-bound barcode sequences in an alkaline solution can also result in bead degradation and release of the attached barcode sequences into the aqueous internal environment of the droplet.

Any suitable number of molecular tag molecules (e.g., primers, barcoded oligonucleotides) may be associated with the beads such that upon release from the beads, the molecular tag molecules (e.g., primers, e.g., barcoded oligonucleotides) are present in the partitions at a predefined concentration. Such predefined concentrations may be selected to facilitate certain reactions, such as amplification, for generating sequencing libraries within a partition. In some cases, the predefined concentration of the primer may be limited by the process of producing beads with nucleic acid molecules (e.g., oligonucleotides).

In some cases, the beads may be non-covalently loaded with one or more reagents. The beads may be non-covalently supported, for example, by subjecting the beads to conditions sufficient to swell the beads, allowing sufficient time for the reagent to diffuse into the interior of the beads, and subjecting the beads to conditions sufficient to deswelle the beads. Swelling of the beads may be accomplished, for example, by placing the beads in a thermodynamically favored solvent, subjecting the beads to higher or lower temperatures, subjecting the beads to higher or lower ion concentrations, and/or subjecting the beads to an electric field. Swelling of the beads can be accomplished by various swelling methods. The deswelling of the beads may be accomplished, for example, by transferring the beads into a thermodynamically unfavorable solvent, subjecting the beads to lower or higher temperatures, subjecting the beads to lower or higher ion concentrations, and/or removing the electric field. The deswelling of the beads can be accomplished by various deswelling methods. Transferring the beads may result in Kong Shousu in the beads. Shrinkage may then prevent the agent within the bead from diffusing out of the interior of the bead. This obstruction may be due to spatial interactions between the reagent and the interior of the beads. Transfer may be accomplished by microfluidics. For example, the transfer may be accomplished by moving the beads from one co-current solvent stream to a different co-current solvent stream. The swellability and/or pore size of the beads can be adjusted by changing the polymer composition of the beads.

In some cases, the acrydite moiety attached to the precursor, another species attached to the precursor, or the precursor itself may contain labile bonds, such as chemical, thermal, or photosensitive bonds, e.g., disulfide bonds, UV-sensitive bonds, and the like. Once the acrydite moiety or other moiety comprising an labile bond is incorporated into the bead, the bead may also comprise the labile bond. Labile bonds can be useful, for example, in reversibly linking (covalently linking) species (e.g., barcodes, primers, etc.) to beads. In some cases, the thermally labile bond can include an attachment based on nucleic acid hybridization (e.g., where the oligonucleotide hybridizes to a complementary sequence attached to the bead) such that the thermal melting of the hybrid releases the oligonucleotide, e.g., a sequence containing a barcode, from the support (e.g., the bead, such as a gel bead).

Adding multiple types of labile bonds to the gel beads can enable the production of beads that are capable of responding to different stimuli. Each type of labile bond may be sensitive to an associated stimulus (e.g., chemical stimulus, light, temperature, enzymatic, etc.), such that release of the species attached to the bead via each labile bond may be controlled by application of an appropriate stimulus. Such functionality may be useful for the controlled release of species from gel beads. In some cases, another species comprising an labile bond may be attached to the gel bead after the gel bead is formed via an activated functional group of the gel bead, e.g., as described above. As will be appreciated, barcodes releasably, cleavable, or reversibly attached to the beads described herein include barcodes that are released or releasable by cleavage of the bond between the barcode molecule and the bead, or released by degradation of the base bead itself, allowing the barcodes to be accessed or accessible by other reagents, or both.

In some cases, a species (e.g., an oligonucleotide molecule comprising a barcode) attached to a solid support (e.g., a bead) may comprise a U-excision element that allows release of the species from the bead. In some cases, the U-excision element can comprise a single-stranded DNA (ssDNA) sequence comprising at least one uracil. The species may be attached to the solid support via ssDNA sequences containing at least one uracil. The species may be released by a combination of uracil-DNA glycosylase (e.g., to remove uracil) and endonuclease (e.g., to induce ssDNA breaks). If the endonuclease produces a 5' phosphate group from cleavage, additional enzymatic treatments may be included in downstream processing to eliminate the phosphate group, for example, additional sequencing handle elements such as Illumina complete P5 sequence, partial P5 sequence, complete R1 sequence, and/or partial R1 sequence, before attaching additional sequencing handle elements.

Releasable barcodes as described herein may sometimes be referred to as activatable in that they are available for reaction once released. Thus, for example, an activatable barcode may be activated by releasing the barcode from the bead (or other suitable type of partition as described herein). Other activatable configurations are also contemplated in the context of the described methods and systems.

In addition to thermally cleavable bonds, disulfide bonds, and UV-sensitive bonds, other non-limiting examples of labile bonds that can be coupled to a precursor or bead include ester bonds (e.g., cleavable with an acid, base, or hydroxylamine), vicinal glycol bonds (e.g., cleavable via sodium periodate), diels-Alder (e.g., cleavable via thermal cleavage), sulfone bonds (e.g., cleavable via a base), silyl ether bonds (e.g., cleavable via an acid), glycosidic bonds (e.g., cleavable via an amylase), peptide bonds (e.g., cleavable via a protease), or phosphodiester bonds (e.g., cleavable via a nuclease (e.g., dnase)). The bond may be cleaved via other nucleic acid molecule targeting enzymes such as restriction enzymes (e.g., restriction endonucleases), as described further below.

The species may be encapsulated in the beads during bead formation (e.g., during precursor polymerization). Such species may or may not participate in the polymerization. Such species may be incorporated into the polymerization reaction mixture such that the beads produced upon bead formation comprise the species. In some cases, such species may be added to the gel beads after they are formed. Such species may include, for example, nucleic acid molecules (e.g., oligonucleotides), reagents for nucleic acid amplification reactions (e.g., primers, polymerase, dntps, cofactors (e.g., ionic cofactors), buffers) (including those described herein), reagents for enzymatic reactions (e.g., enzymes, cofactors, substrates, buffers), reagents for nucleic acid modification reactions (such as polymerization, ligation, or digestion), and/or reagents for one or more sequencing platforms (e.g.,A kind of electronic device) Reagents for template preparation (e.g., tag fragmentation). Such species may include one or more enzymes described herein, including but not limited to polymerases, reverse transcriptases, restriction enzymes (e.g., endonucleases), transposases, ligases, proteases K, DNA enzymes, and the like. Such species may include one or more agents (e.g., lysing agents, inhibitors, inactivating agents, chelating agents, stimulating agents) described elsewhere herein. The capture of such species may be controlled by the density of the polymer network generated during the precursor polymerization, the control of the ionic charge within the gel beads (e.g., via ionic species attached to the polymeric species), or by the release of other species. The encapsulated species may be released from the beads upon degradation of the beads and/or by application of a stimulus that enables release of the species from the beads. Alternatively or in addition, the species may be partitioned in the partition (e.g., droplet) during or after partition formation. Such species may include, but are not limited to, the above-described species that may also be encapsulated in beads.

The degradable beads may contain one or more species with labile bonds such that when the beads/species are exposed to an appropriate stimulus, the bonds are broken and the beads degrade. The labile bond may be a chemical bond (e.g., covalent bond, ionic bond), or may be another type of physical interaction (e.g., van der Waals interactions, dipole-dipole interactions, etc.). In some cases, the cross-linking agent used to generate the beads may contain labile bonds. Upon exposure to appropriate conditions, the labile bonds may be broken and the beads degraded. For example, when polyacrylamide gel beads containing a cystamine crosslinker are exposed to a reducing agent, the disulfide bonds of cystamine can be broken and the beads degraded.

Degradable beads can be used to release attached substances (e.g., nucleic acid molecules, barcode sequences, primers, etc.) from the beads more quickly when appropriate stimuli are applied to the beads than non-degradable beads. For example, for a substance that binds to the inner surface of a porous bead or in the case of an encapsulated substance, the substance may have higher mobility when the bead is degraded and higher accessibility to other substances in solution. In some cases, the species may also be attached to the degradable beads through degradable linkers (e.g., disulfide linkers). The degradable linker may be responsive to the same stimulus as the degradable bead, or the two degradable species may be responsive to different stimuli. For example, the barcode sequence may be attached to a polyacrylamide bead comprising cystamine via disulfide bonds. Upon exposure of the barcoded beads to the reducing agent, the beads degrade and the barcode sequence is released upon cleavage of disulfide bonds between the barcode sequence and the beads and disulfide bonds of cystamine in the beads.

It will be appreciated from the above disclosure that, although referred to as degradation of the beads, in many of the cases mentioned above, this degradation may refer to dissociation of bound or entrained species from the beads, with and without concomitant structural degradation of the physical beads themselves. For example, entrained species may be released from the beads by, for example, osmotic pressure differences due to chemical environmental changes. For example, changes in bead pore size due to osmotic pressure differences may typically occur without structural degradation of the beads themselves. In some cases, an increase in pore size due to osmotic swelling of the beads may allow release of the species entrained within the beads. In other cases, the osmotic shrinkage of the beads may allow the beads to better retain entrained species due to the reduced pore size.

Where degradable beads are provided, it may be advantageous to avoid exposing such beads to one or more stimuli that lead to such degradation prior to a given time, for example, to avoid premature degradation of the beads and problems caused by such degradation, including, for example, poor flow characteristics and aggregation. For example, where the beads contain reducible crosslinking groups such as disulfide groups, it would be desirable to avoid contacting such beads with a reducing agent (e.g., DTT or other disulfide cleavage reagent). In such cases, treatment of the beads described herein will in some cases be provided in the absence of a reducing agent (such as DTT). Since reducing agents are typically provided in commercial enzyme preparations, it may be desirable to provide an enzyme preparation that is free of reducing agents (or DTT-free) when handling the beads described herein. Examples of such enzymes include, for example, polymerase preparations, reverse transcriptase preparations, ligase preparations, and many others that may be used to treat the beads described herein. The term "reducing agent-free" or "DTT-free" formulation may refer to a formulation having a lower limit range of such materials used in degrading the beads of less than about 1/10, less than about 1/50, or even less than about 1/100. For example, for DTT, the formulation without reducing agent may have less than about 0.01 millimoles (mM), 0.005mM, 0.001mM DTT, 0.0005mM DTT, or even less than about 0.0001mM DTT. In many cases, the amount of DTT may not be detectable.

A number of chemical triggers can be used to trigger the degradation of the beads. Examples of such chemical changes may include pH-mediated changes in the integrity of the components within the beads, degradation of the bead components via cleavage of cross-links, and depolymerization of the bead components.

In some embodiments, the beads may be formed from a material comprising a degradable chemical cross-linking agent (such as BAC or cystamine). Degradation of such degradable crosslinkers can be achieved by a variety of mechanisms. In some examples, the beads may be contacted with a chemical degradation agent that induces oxidation, reduction, or other chemical change. For example, the chemical degradation agent may be a reducing agent, such as Dithiothreitol (DTT). Additional examples of reducing agents may include beta-mercaptoethanol, (2S) -2-amino-1, 4-dimercaptobutane (dithiobutylamine or DTBA), tris (2-carboxyethyl) phosphine (TCEP), or combinations thereof. The reducing agent may degrade the disulfide bonds formed between the gel precursors forming the beads, thus degrading the beads. In other cases, a change in the pH of the solution (such as an increase in pH) may trigger degradation of the beads. In other cases, exposure to an aqueous solution (such as water) may trigger hydrolytic degradation, thus degrading the beads. In some cases, any combination of stimuli may trigger degradation of the beads. For example, a change in pH may enable a chemical agent (e.g., DTT) to be an effective reducing agent.

The beads may also be induced to release their contents when a thermal stimulus is applied. The change in temperature can cause a variety of changes in the beads. For example, heat may cause the solid beads to liquefy. The change in heat may cause melting of the beads, degrading a portion of the beads. In other cases, the heat may increase the internal pressure of the bead component, causing the bead to rupture or explode. Heat may also be applied to the heat sensitive polymer used as a material for constructing the beads.

Any suitable agent can degrade the beads. In some embodiments, a change in temperature or pH can be used to degrade heat-sensitive or pH-sensitive bonds within the beads. In some embodiments, chemical degradation agents may be used to degrade chemical bonds within the beads by oxidation, reduction, or other chemical changes. For example, the chemical degradation agent may be a reducing agent, such as DTT, wherein the DTT may degrade disulfide bonds formed between the crosslinking agent and the gel precursor, thereby degrading the beads. In some embodiments, a reducing agent may be added to degrade the beads, which may or may not cause the beads to release their contents. Examples of reducing agents may include Dithiothreitol (DTT), beta-mercaptoethanol, (2S) -2-amino-1, 4-dimercaptobutane (dithiobutylamine or DTBA), tris (2-carboxyethyl) phosphine (TCEP), or combinations thereof. The reducing agent may be present at a concentration of about 0.1mM, 0.5mM, 1mM, 5mM, 10 mM. The reducing agent may be present at a concentration of at least about 0.1mM, 0.5mM, 1mM, 5mM, 10mM, or greater than 10 mM. The reducing agent may be present at a concentration up to about 10mM, 5mM, 1mM, 0.5mM, 0.1mM, or less.

Any suitable number of molecular tag molecules (e.g., primers, barcoded oligonucleotides) may be associated with the beads such that upon release from the beads, the molecular tag molecules (e.g., primers, e.g., barcoded oligonucleotides) are present in the partitions at a predefined concentration. Such predefined concentrations may be selected to facilitate certain reactions, such as amplification, for generating sequencing libraries within a partition. In some cases, the predefined concentration of the primer may be limited by the process of generating the oligonucleotide-bearing bead.

In some examples, one of the plurality of partitions may include a single biological particle (e.g., a single cell or a single nucleus of a cell). In some examples, one of the plurality of partitions may include a plurality of biological particles. Such partitions may be referred to as multiple occupied partitions, and may comprise, for example, two, three, four, or more cells and/or supports (e.g., beads) comprising barcoded nucleic acid molecules (e.g., oligonucleotides) within a single partition. Thus, as described above, the flow characteristics of the fluid containing the biological particles and/or beads and the spacer fluid can be controlled to provide such multiple occupied zones. In particular, the flow parameters may be controlled to provide a given occupancy greater than about 50%, greater than about 75%, and in some cases greater than about 80%, 90%, 95% or higher of the partitions.

In some cases, additional reagents may be delivered to the partition using additional supports (e.g., beads). In such cases, it may be advantageous to introduce different beads into a common channel or droplet-generating connection from different bead sources (e.g., containing different associated reagents) through different channel inlets into such a common channel or droplet-generating connection. In such cases, the flow and frequency of different beads into the channel or junction can be controlled to provide a specific ratio of support from each source while ensuring that a given pairing and combination of such beads enter a partition with a given number of biological particles (e.g., one biological particle and one bead per partition).

The partitions described herein may have a small volume, for example, less than about 10 microliters (L), 5L, 1L, 900 picoliters (pL), 800pL, 700pL, 600pL, 500pL, 400pL, 300pL, 200pL, 100pL, 50pL, 20pL, 10pL, 1pL, 500 nanoliters (nL), 100nL, 50nL, or less.

For example, in the case of drop-based partitioning, the total volume of the drop may be less than about 1000pL, 900pL, 800pL, 700pL, 600pL, 500pL, 400pL, 300pL, 200pL, 100pL, 50pL, 20pL, 10pL, 1pL or less. In the case of co-partitioning with a support, it is to be understood that the sample fluid volume within the partition (e.g., including co-partitioned biological particles and/or beads) may be less than about 90% of the aforementioned volume, less than about 80% of the aforementioned volume, less than about 70% of the aforementioned volume, less than about 60% of the aforementioned volume, less than about 50% of the aforementioned volume, less than about 40% of the aforementioned volume, less than about 30% of the aforementioned volume, less than about 20% of the aforementioned volume, or less than about 10% of the aforementioned volume.

As described elsewhere herein, a partition class may produce a partitioned population or multiple partitions. In such cases, any suitable number of partitions may be generated or otherwise provided. For example, at least about 1,000 partitions, at least about 5,000 partitions, at least about 10,000 partitions, at least about 50,000 partitions, at least about 100,000 partitions, at least about 500,000 partitions, at least about 1,000,000 partitions, at least about 5,000,000 partitions, at least about 10,000,000 partitions, at least about 50,000,000 partitions, at least about 100,000,000 partitions, at least about 500,000,000 partitions, at least about 1,000,000 partitions, or more partitions may be generated or otherwise provided. Further, the plurality of partitions may include unoccupied partitions (e.g., empty partitions) and occupied partitions.

Multiplex analysis

The present disclosure provides methods and systems for multiplex analysis and otherwise increasing the throughput of the analysis. For example, a single or integrated processing workflow may allow for the processing, identification, and/or analysis of more or multiple analytes, more or multiple types of analytes, and/or more or multiple types of analyte characterization. For example, in the methods and systems described herein, one or more labeling agents capable of binding to or otherwise coupling to one or more cellular features may be used to characterize a biological particle and/or cellular feature. In some cases, the cell characteristic comprises a cell surface characteristic. Cell surface features may include, but are not limited to, receptors, antigens, surface proteins, transmembrane proteins, clusters of differentiated proteins, protein channels, protein pumps, carrier proteins, phospholipids, glycoproteins, glycolipids, cell-cell interactions, protein complexes, antigen presenting complexes, major histocompatibility complexes, engineered T cell receptors, B cell receptors, chimeric antigen receptors, gap junctions and adhesion junctions, or any combination thereof. In some cases, the cellular features may include intracellular analytes, such as proteins, protein modifications (e.g., phosphorylation states or other post-translational modifications), nuclear proteins, nuclear membrane proteins, or any combination thereof. The labeling agent may include, but is not limited to, proteins, peptides, antibodies (or epitope-binding fragments thereof), lipophilic moieties (such as cholesterol), cell surface receptor binding molecules, receptor ligands, small molecules, bispecific antibodies, bispecific T cell adaptors, T cell receptor adaptors, B cell receptor adaptors, antibody prodrugs, aptamers, monoclonal antibodies, affimer, darpin, and protein scaffolds, or any combination thereof. The labeling agent may include (e.g., be linked to) a reporter oligonucleotide that indicates the cell surface characteristics to which the binding group binds. For example, the reporter oligonucleotide may comprise a barcode sequence that allows identification of the marker agent. For example, a labeling agent specific for one type of cell feature (e.g., a first cell surface feature) may have a first reporter oligonucleotide coupled thereto, while a labeling agent specific for a different cell feature (e.g., a second cell surface feature) may have a different reporter oligonucleotide coupled thereto. For a description of example labeling agents, reporter oligonucleotides, and methods of use, see, e.g., U.S. patent 10,550,429, U.S. patent publication 20190177800, and U.S. patent publication 20190367969, each of which is incorporated by reference herein in its entirety for all purposes.

In particular examples, a library of potential cellular feature labeling agents or binding groups may be provided, wherein the respective cellular feature labeling agent is associated with a nucleic acid reporter (or reporter oligonucleotide) such that a different reporter oligonucleotide sequence is associated with each labeling agent capable of binding to a particular cellular feature. In some aspects, different members of the library can be characterized by the presence of different oligonucleotide sequence tags. For example, an antibody capable of binding to a first protein may have a first reporter oligonucleotide sequence associated therewith, while an antibody capable of binding to a second protein may have a different reporter oligonucleotide sequence associated therewith. The presence of a particular oligonucleotide sequence may be indicative of the presence of a particular antibody or a cellular feature that may be recognized or bound by a particular antibody.

A labeling agent capable of binding or otherwise coupling to one or more biological particles may be used to characterize the biological particles as belonging to a particular group of biological particles. For example, the labeling agent may be used to label a sample or a group of cells, nuclei or cell beads. In this way, one set of cells may be labeled differently than another set of cells (or nuclei or beads). In one example, the first set of cells may be derived from a first sample and the second set of cells may be derived from a second sample. The tagging agent may allow the first and second sets to have different tagging agents (or reporter oligonucleotides associated with the tagging agents). This may, for example, facilitate multiplexing, wherein the cells of the first and second groups may be labeled separately and then pooled together for downstream analysis. Downstream detection of the tag may indicate that the analyte belongs to a particular group.

For example, the reporter oligonucleotide may be linked to an antibody or epitope-binding fragment thereof, and labeling the biological particle may include subjecting the antibody-linked barcode molecule or epitope-binding fragment-linked barcode molecule to conditions suitable for binding the antibody to a molecule present on the surface of the biological particle. The binding affinity between the antibody or epitope-binding fragment thereof and the molecule present on the surface may be within a useful range to ensure that the antibody or epitope-binding fragment thereof remains bound to the molecule. For example, the binding affinity may be within a useful range to ensure that the antibody or epitope-binding fragment thereof remains bound to the molecule during various sample processing steps (e.g., partitioning and/or nucleic acid amplification or extension). The dissociation constant (Kd) between the antibody or epitope-binding fragment thereof and the molecule to which it binds may be less than about 100μM,90μM,80μM,70μM,60μM,50μM,40μM,30μM,20μM,10μM,9μM,8μM,7μM,6μM,5μM,4μM,3μM,2μM,1μM,900nM,800nM,700nM,600nM,500nM,400nM,300nM,200nM,100nM,90nM,80nM,70nM,60nM,50nM,40nM,30nM,20nM,10nM,9nM,8nM,7nM,6nM,5nM,4nM,3nM,2nM,1nM,900pM,800pM,700pM,600pM,500pM,400pM,300pM,200pM,100pM,90pM,80pM,70pM,60pM,50pM,40pM,30pM,20pM,10pM,9pM,8pM,7pM,6pM,5pM,4pM,3pM,2pM or 1pM. For example, the dissociation constant may be less than about 10 μm.

In another example, the reporter oligonucleotide may be coupled to a Cell Penetrating Peptide (CPP), and labeling the cell may include delivering the CPP-coupled reporter oligonucleotide into a biological particle. Labeling the biological particle may include delivering the CPP-conjugated oligonucleotide into the cell and/or cell bead by a cell penetrating peptide. The cell penetrating peptide that may be used in the methods provided herein may comprise at least one nonfunctional cysteine residue, which may be free or derivatized, for disulfide bond formation with an oligonucleotide that has been modified for such bond formation. Non-limiting examples of cell penetrating peptides that may be used in embodiments herein include permeabilizing agents, transporters, plsl, TAT (48-60), pVEC, MTS, and MAP. Cell penetrating peptides useful in the methods provided herein may have the ability to induce cell penetration of at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% of cells of a cell population. The cell penetrating peptide may be an arginine-rich peptide transporter. The cell penetrating peptide may be a permeant or Tat peptide.

In another example, the reporter oligonucleotide may be coupled to a fluorophore or dye, and labeling the cell (or nucleus or cell bead) may include subjecting the fluorophore-linked barcode molecule to conditions suitable for binding the fluorophore to the surface of the biological particle. In some cases, the fluorophore can interact strongly with the lipid bilayer, and labeling the biological particle can include subjecting the fluorophore-linked barcode molecule to conditions such that the fluorophore binds to or intercalates into the membrane of the biological particle. In some cases, the fluorophore is a water-soluble organic fluorophore. In some cases, the fluorophore is Alexa532 maleimide, tetramethylrhodamine-5-maleimide (TMR maleimide), BODIPY-TMR maleimide, sulfo-Cy 3 maleimide, alexa 546 carboxylic acid/succinimidyl ester, atto 550 maleimide, cy3 carboxylic acid/succinimidyl ester, cy3B carboxylic acid/succinimidyl ester, atto 565 biotin, sulforhodamine B, alexa 594 maleimide, texas Red maleimide, alexa 633 maleimide, abberior STAR 635P azide, atto647N maleimide, atto647 SE, or sulfo-Cy 5 maleimide. See, for example, hughes L D et al, PLoS one.2014, month 2, 4; 9 (2): e87649, which is hereby incorporated by reference in its entirety for all purposes, for the purpose of illustrating organic fluorophores.

The reporter oligonucleotide may be coupled to a lipophilic molecule and labeling the biological particle may include delivering the nucleic acid barcode molecule from the lipophilic molecule to a membrane or nuclear membrane of the biological particle. The lipophilic molecules may be associated with and/or intercalated into lipid membranes, such as cell membranes and nuclear membranes. In some cases, the insertion may be reversible. In some cases, the binding between the lipophilic molecule and the biological particle may be such that the biological particle retains the lipophilic molecule (e.g., and its bound components, such as a nucleic acid barcode molecule) during subsequent processing (e.g., partitioning, cell permeabilization, amplification, pooling, etc.). The reporter nucleotide may enter the intracellular space and/or nucleus.

The reporter oligonucleotide may be part of a nucleic acid molecule comprising any number of functional sequences as described elsewhere herein, such as a target capture sequence, a random primer sequence, etc., and coupled to another nucleic acid molecule that is or is derived from an analyte.

Cells (or nuclei or beads) may be incubated with a library of labeling agents, either before, during or after partitioning, which may be labeling agents for a wide variety of different cellular features (e.g., receptors, proteins, etc.), and include their associated reporter oligonucleotides. Unbound labeling agent can be washed from cells, and these cells (or nuclei or cell beads) can then be co-partitioned (e.g., co-partitioned into droplets or wells) as described elsewhere herein along with the partition-specific barcode oligonucleotides (e.g., attached to a support such as a bead or gel bead). Thus, a partition may include one or more cells as well as bound labeling agents and their known associated reporter oligonucleotides.

In other cases, for example to facilitate sample multiplexing, a labeling agent specific for a particular cellular feature may have a first plurality of labeling agents (e.g., antibodies or lipophilic moieties) coupled to a first reporter oligonucleotide and a second plurality of labeling agents coupled to a second reporter oligonucleotide. For example, the first plurality of labeling agents and the second plurality of labeling agents may interact with different cells, cell populations, or samples, thereby allowing a particular reporter oligonucleotide to indicate a particular cell population (or cell or sample) and cell characteristics. In this way, different samples or groups may be processed independently and then combined together for pooled analysis (e.g., partition-based barcoding as described elsewhere herein). See, for example, U.S. patent publication 20190323088, which is hereby incorporated by reference in its entirety for all purposes.

As described elsewhere herein, a library of markers can be associated with a particular cellular feature, and can also be used to identify that an analyte originates from a particular biological particle, population, or sample. The biological particles may be incubated with multiple libraries, and a given biological particle may contain multiple labeling agents. For example, the cells may comprise a lipophilic labelling agent and an antibody coupled thereto. The lipophilic labelling agent may indicate that the cell is a member of a particular cell sample, and the antibody may indicate that the cell contains a particular analyte. In this way, the reporter oligonucleotide and the labeling agent may allow for multiplex analysis of multiple analytes.

In some cases, these reporter oligonucleotides may comprise nucleic acid barcode sequences that allow identification of the labeling agent to which the reporter oligonucleotide is coupled. The use of oligonucleotides as reporters can provide the advantage of being able to generate significant diversity in sequence while also being readily attachable to most biomolecules (e.g., antibodies, etc.), and being readily detectable (e.g., using sequencing or array techniques).

The attachment (coupling) of the reporter oligonucleotide to the labeling agent may be accomplished by any of a variety of direct or indirect, covalent or non-covalent associations or linkages. For example, oligonucleotides can be conjugated using chemical conjugation techniques (e.g., available from Innova BiosciencesAntibody labeling kit) to a portion of a labeling agent (such as a protein, e.g., an antibody or antibody fragment), and using other non-covalent attachment mechanisms, e.g., using biotinylated antibodies and oligonucleotides with avidin or streptavidin linkers (or beads comprising one or more biotinylated linkers coupled to the oligonucleotides). Antibodies and oligonucleotide biotinylation techniques are available. See, for example, fang et al, ,"Fluoride-Cleavable Biotinylation Phosphoramidite for5′-end-Labelling and Affinity Purification of Synthetic Oligonucleotides,"Nucleic Acids Res.2003, 1, 15, 31 (2): 708-715, which is incorporated herein by reference in its entirety for all purposes. Also, protein and peptide biotinylation techniques have been developed and are ready for use. See, for example, U.S. patent No. 6,265,552, incorporated by reference herein in its entirety for all purposes. In addition, click chemistry such as methyltetrazine-PEG 5-NHS ester reaction, TCO-PEG4-NHS ester reaction, and the like can be used to couple the reporter oligonucleotide to the labeling agent. Commercially available kits (such as those from Thunderlink and Abcam) may be used to couple the reporter oligonucleotide to the labeling agent as appropriate. In another example, the labeling agent is coupled indirectly (e.g., via hybridization) to a reporter oligonucleotide that comprises a barcode sequence that identifies the labeling agent. For example, the labeling agent can be directly coupled (e.g., covalently bound) to a hybridization oligonucleotide comprising a sequence that hybridizes to a sequence of the reporter oligonucleotide. Hybridization of the hybridization oligonucleotide to the reporter oligonucleotide couples the labeling agent to the reporter oligonucleotide. In some embodiments, the reporter oligonucleotide may be released from the tagging agent, such as upon application of a stimulus. For example, the reporter oligonucleotide may be linked to the labeling agent by an labile bond (e.g., chemically labile, photolabile, thermally labile, etc.), as generally described elsewhere herein for release of molecules from the support. In some cases, the reporter oligonucleotides described herein may include one or more functional sequences useful for subsequent processing, such as an adapter sequence, a Unique Molecular Identifier (UMI) sequence, a sequencer-specific flow cell ligation sequence (such as a P5, P7 or partial P5 or P7 sequence), a primer or primer binding sequence, a sequencing primer or primer binding sequence (such as an R1, R2 or partial R1 or R2 sequence).

In some cases, the labeling agent may comprise a reporter oligonucleotide and a tag. The label may be a fluorophore, a radioisotope, a molecule capable of undergoing a colorimetric reaction, a magnetic particle, or any other suitable molecule or compound capable of detection. The tag may be conjugated directly or indirectly to a labeling agent (or reporter oligonucleotide) (or the tag may be conjugated to a molecule that can bind to a labeling agent or reporter oligonucleotide). In some cases, the tag is conjugated to an oligonucleotide that is complementary to the sequence of the reporter oligonucleotide, and the oligonucleotide may be allowed to hybridize to the reporter oligonucleotide.

FIG. 11 depicts an exemplary labeling agent (1110, 1120, 1130) comprising a reporter oligonucleotide (1140) attached thereto. The labeling agent 1110 (e.g., any of the labeling agents described herein) is attached (either directly (e.g., covalently) or indirectly) to the reporter oligonucleotide 1140. Reporter oligonucleotide 1140 may comprise barcode sequence 1142 identifying marker 1110. Reporter oligonucleotide 1140 may also comprise one or more functional sequences useful for subsequent processing, such as an adapter sequence, a Unique Molecular Identifier (UMI) sequence, a sequencer-specific flow cell attachment sequence (such as P5, P7 or a portion of P5 or P7 sequences), a primer or primer binding sequence, a sequencing primer or primer binding sequence (such as R1, R2 or a portion of R1 or R2 sequences).

Referring to fig. 11, in some cases, reporter oligonucleotide 1140 conjugated to a labeling agent (e.g., 1110, 1120, 1130) comprises a primer sequence 1141, a barcode sequence identifying the labeling agent (e.g., 1110, 1120, 1130), and a functional sequence 1143. Functional sequence 1143 may be configured to hybridize to complementary sequences, such as those present on nucleic acid barcode molecule 1190 (not shown), such as those described elsewhere herein. In some cases, nucleic acid barcode molecules 1190 are attached to a support (e.g., a bead, such as a gel bead), such as those described elsewhere herein. For example, nucleic acid barcode molecule 1190 may be attached to a support via releasable bonds (e.g., including labile bonds), such as those described elsewhere herein. In some cases, reporter oligonucleotide 1140 comprises one or more additional functional sequences, such as those described above.

In some cases, the tagging agent 1110 is a protein or polypeptide (e.g., an antigen or a desired antigen) comprising a reporter oligonucleotide 1140. Reporter oligonucleotide 1140 comprises a barcode sequence 1142 that identifies polypeptide 1110 and can be used to infer the presence of an analyte (e.g., a binding partner of polypeptide 1110 (e.g., a molecule or compound to which polypeptide 1110 can bind)). In some cases, the tagging agent 1110 is a lipophilic moiety (e.g., cholesterol) comprising the reporter oligonucleotide 1140, wherein the lipophilic moiety is selected such that the tagging agent 1110 is integrated into a cell membrane or nucleus. Reporter oligonucleotide 1140 comprises a barcode sequence 1142 identifying a lipophilic moiety 1110, which in some cases is used to tag cells (e.g., clustered cells, cell samples, etc.) and can be used to perform multiplex assays as described elsewhere herein. In some cases, the labeling agent is an antibody 1120 (or epitope-binding fragment thereof) comprising a reporter oligonucleotide 1140. Reporter oligonucleotide 1140 comprises a barcode sequence 1142 that identifies antibody 1120 and can be used to infer, for example, the presence of a target of antibody 1120 (e.g., a molecule or compound to which antibody 1120 binds). In other embodiments, the labeling agent 1130 comprises an MHC molecule 1131 with a peptide 1132 and a reporter oligonucleotide 1140 that identifies the peptide 1132. In some cases, MHC molecules are coupled to support 1133. In some cases, support 1133 may be a polypeptide (such as streptavidin), or a polysaccharide (such as dextran). In some cases, reporter oligonucleotide 1140 can be coupled directly or indirectly to MHC-labeling agent 1130 in any suitable manner. For example, reporter oligonucleotide 1140 may be coupled to MHC molecule 1131, support 1133, or peptide 1132. In some embodiments, the labeling agent 1130 comprises a plurality of MHC molecules (e.g., MHC multimers, which may be coupled to a support (e.g., 1133)). There are many possible configurations of class I and/or class II MHC multimers that can be used with the compositions, methods, and systems disclosed herein, e.g., MHC tetramers, MHC pentamers (MHC assembled via coiled-coil domains, e.g., MHCMHC class I pentamer (promimune, ltd.)), MHC octamers, MHC dodecamers, MHC-decorated dextran molecules (e.g., MHC)(Immudex)) and the like. For a description of example labeling agents (including antibody and MHC-based labeling agents), reporter oligonucleotides, and methods of use, see, e.g., U.S. patent 10,550,429 and U.S. patent publication 20190367969, each of which is incorporated by reference herein in its entirety for all purposes.

Fig. 13 shows another example of a bead carrying a bar code. In some embodiments, the analysis of multiple analytes (e.g., RNA and one or more analytes, using the labeling agents described herein) can include a nucleic acid barcode molecule, as generally depicted in fig. 13. In some embodiments, nucleic acid barcode molecules 1310 and 1320 are attached to support 1330 via releasable bonds 1340 (e.g., including labile bonds) as described elsewhere herein. The nucleic acid barcode molecule 1310 may comprise an adaptor sequence 1311, a barcode sequence 1312, and an adaptor sequence 1313. The nucleic acid barcode molecule 1320 may comprise an adapter sequence 1321, a barcode sequence 1312, and an adapter sequence 1323, wherein the adapter sequence 1323 comprises a different sequence than the adapter sequence 1313. In some cases, the adapter 1311 and the adapter 1321 comprise the same sequence. In some cases, the adapter 1311 and the adapter 1321 comprise different sequences. Although support 1330 is shown to contain nucleic acid barcode molecules 1310 and 1320, any suitable number of barcode molecules that contain a common barcode sequence 1312 are contemplated herein. For example, in some embodiments, support 1330 further comprises a nucleic acid barcode molecule 1350. The nucleic acid barcode molecule 1350 can comprise an adaptor sequence 1351, a barcode sequence 1312, and an adaptor sequence 1353, wherein the adaptor sequence 1353 comprises a different sequence than the adaptor sequences 1313 and 1323. In some cases, the nucleic acid barcode molecule (e.g., 1310, 1320, 1350) comprises one or more additional functional sequences, such as UMI or other sequences described herein. The nucleic acid barcode molecules 1310, 1320, or 1350 may interact with an analyte as described elsewhere herein, e.g., as depicted in fig. 12A-C.

Referring to FIG. 12A, in the case of labeling cells with a labeling agent, sequence 1223 may be complementary to the adaptor sequence of the reporter oligonucleotide. The cells (or nuclei or cell beads) may be contacted with one or more reporter oligonucleotide 1220 conjugated tagging agents 1210 (e.g., polypeptides, antibodies, or other tagging agents described elsewhere herein). In some cases, the cells (or nuclei or cell beads) may be further processed prior to bar coding. For example, such treatments may include one or more washing and/or cell sorting operations. In some cases, cells bound to a labeling agent 1210 conjugated to an oligonucleotide 1220 and a support 1230 (e.g., a bead, such as a gel bead) comprising nucleic acid barcode molecules 1290 are partitioned into one of a plurality of partitions (e.g., a droplet of a droplet emulsion or a well of a microwell array). In some cases, the partition comprises at most a single cell bound to the labeling agent 1210. In some cases, the reporter oligonucleotide 1220 conjugated to a labeling agent 1210 (e.g., a polypeptide, antibody, pMHC molecule such as MHC multimer, etc.) comprises a first adapter sequence 1211 (e.g., a primer sequence), a barcode sequence 1212 identifying the labeling agent 1210 (e.g., a peptide of a polypeptide, antibody, or pMHC molecule or complex), and an adapter sequence 1213. The adaptor sequence 1213 may be configured to hybridize to a complementary sequence, such as sequence 1223 present on the nucleic acid barcode molecule 1290. In some cases, oligonucleotide 1220 comprises one or more additional functional sequences, such as those described elsewhere herein.

The barcoded nucleic acids may be generated from the constructs described in fig. 12A-C (e.g., via a nucleic acid reaction, such as nucleic acid extension or ligation). For example, sequence 1213 can then be hybridized with complementary sequence 1223 to produce (e.g., via a nucleic acid reaction, such as nucleic acid extension or ligation) a barcoded nucleic acid molecule comprising cellular (e.g., partition-specific) barcode sequence 1222 (or its reverse complement) and reporter barcode sequence 1212 (or its reverse complement). The barcoded nucleic acid molecules can then optionally be treated as described elsewhere herein, for example, to amplify the molecules and/or to supplement the sequencing platform specific sequences to the fragments. See, for example, U.S. patent publication 2018/0105808, which is hereby incorporated by reference in its entirety for all purposes. The barcoded nucleic acid molecules or derivatives generated therefrom can then be sequenced on a suitable sequencing platform.

In some embodiments, multiple analytes (e.g., a nucleic acid and one or more analytes, using a labeling agent as described herein) can be analyzed. For example, the workflow may include the workflow generally depicted in any of fig. 12A-12C, or a combination of workflows for individual analytes as described elsewhere herein. For example, multiple analytes may be analyzed using a combination of the workflows generally depicted in fig. 12A-12C.

In some cases, analysis of analytes (e.g., nucleic acids, polypeptides, carbohydrates, lipids, etc.) includes the workflow generally depicted in fig. 12A. Nucleic acid barcode molecule 1290 may be co-partitioned with one or more analytes. In some cases, nucleic acid barcode molecules 1290 are attached to a support 1230 (e.g., a bead, such as a gel bead), such as those described elsewhere herein. For example, nucleic acid barcode molecules 1290 can be attached to support 1230 via releasable bonds 1240 (e.g., including labile bonds), such as those described elsewhere herein. Nucleic acid barcode molecule 1290 may comprise barcode sequence 1221 and optionally other additional sequences, such as UMI sequence 1222 (or other functional sequences described elsewhere herein). Nucleic acid barcode molecule 1290 may comprise sequence 1223, which may be complementary to another nucleic acid sequence such that it may hybridize to a particular sequence.

For example, sequence 1223 may comprise a poly-T sequence and may be used to hybridize to mRNA. Referring to fig. 12C, in some embodiments, nucleic acid barcode molecule 1290 comprises a sequence 1223 that is complementary to the sequence of RNA molecule 1260 from the cell. In some cases, sequence 1223 comprises a sequence specific for an RNA molecule. Sequence 1223 may comprise a known sequence or a targeting sequence, or a random sequence. In some cases, a nucleic acid extension reaction can be performed to produce a barcoded nucleic acid product comprising sequence 1223, barcode sequence 1221, UMI sequence 1222, any other functional sequences, and a sequence corresponding to RNA molecule 1260.

In another example, sequence 1223 may be complementary to an overhang sequence or an adapter sequence that has been added to the analyte. For example, referring to fig. 12B diagram 1201, in some embodiments, primer 1250 comprises a sequence that is complementary to a sequence of a nucleic acid molecule 1260 (such as an RNA encoding a BCR sequence) from a biological particle. In some cases, primer 1250 comprises one or more sequences 1251 that are not complementary to RNA molecule 1260. Sequence 1251 may be a functional sequence as described elsewhere herein, e.g., an adapter sequence, a sequencing primer sequence, or a sequence that facilitates coupling to a flow cell of a sequencer. In some cases, primer 1250 comprises a poly-T sequence. In some cases, primer 1250 comprises a sequence that is complementary to a target sequence in an RNA molecule. In some cases, primer 1250 comprises a sequence that is complementary to a region of an immune molecule (such as a constant region of a TCR or BCR sequence). Primer 1250 hybridizes to nucleic acid molecule 1260 and generates complementary molecule 1270 (see panel 1202). For example, complementary molecule 1270 may be a cDNA produced in a reverse transcription reaction. In some cases, additional sequences may be added to the complementary molecule 1270. For example, reverse transcriptase may be selected such that several non-template bases 1280 (e.g., poly-C sequences) are added to the cDNA. In another example, terminal transferases may also be used to supplement the additional sequence. Nucleic acid barcode molecule 1290 comprises a sequence 1224 complementary to a non-template base, and reverse transcriptase performs a template switching reaction on nucleic acid barcode molecule 1290 to produce a barcoded nucleic acid molecule comprising a cellular (e.g., partition specific) barcode sequence 1222 (or reverse complement thereof) and a complementary molecule 1270 sequence (or portion thereof). In some cases, sequence 1223 comprises a sequence complementary to a region of an immune molecule (such as a constant region of a TCR or BCR sequence). Sequence 1223 hybridizes to nucleic acid molecule 1260, producing complementary molecule 1270. For example, complementary molecule 1270 may be generated in a reverse transcription reaction that generates a barcoded nucleic acid molecule comprising a cell (e.g., partition specific) barcode sequence 1222 (or reverse complement thereof) and a complementary molecule 1270 sequence (or a portion thereof). Additional methods and compositions suitable for barcoding cdnas generated from mRNA transcripts, including those encoding the V (D) J region of immune cell receptors, and/or including template switching oligonucleotides are described in international patent application WO2018/075693, U.S. patent publication No. 2018/0105808, U.S. patent publication No. 2015/0376609 filed at 26 months 2015, and U.S. patent publication No. 2019/0367969, each of which is incorporated herein by reference in its entirety for all purposes.

Reagent(s)

According to certain aspects, the biological particles may be partitioned along with the lysing agent to release the contents of the biological particles within the partition. In such cases, the lysing agent may be contacted with the biological particle suspension at the same time as or immediately prior to introduction of the biological particles into the separation junction/droplet generation zone (e.g., junction 210), such as through one or more additional channels upstream of the channel junction. According to other aspects, additionally or alternatively, the biological particles may be separated along with other reagents, as will be described further below.

Methods and systems of the present disclosure may include microfluidic devices that may be used to co-separate biological particles or biological particles from reagents and methods of use thereof. Such systems and methods are described in U.S. patent publication No. US/20190367997, which is incorporated by reference herein in its entirety for all purposes.

Advantageously, when the lysing agent and the biological particles are co-partitioned, the lysing agent may facilitate release of the contents of the biological particles within the partition. The contents released in a partition may remain discrete from the contents of other partitions.

It should be understood that the channel segments of the microfluidic devices described elsewhere herein may be coupled to any of a variety of different fluid sources or receiving components, including reservoirs, tubes, manifolds, or other system fluid components. It should be appreciated that the microfluidic channel structure may have a variety of geometries and/or configurations. For example, a microfluidic channel structure may have more than two channel connections. For example, a microfluidic channel structure may have 2,3, 4,5 or more channel segments each carrying the same or different types of beads, reagents and/or biological particles, which meet at a channel junction. The fluid flow in each channel segment can be controlled to control the separation of different elements into droplets. Fluid may be directed to flow along one or more channels or reservoirs via one or more fluid flow units. The fluid flow unit may include a compressor (e.g., providing positive pressure), a pump (e.g., providing negative pressure), an actuator, etc., to control the flow of fluid. The fluid may also or alternatively be controlled via an applied pressure differential, centrifugal force, electric pumping, vacuum, capillary or gravity flow, or the like.

Examples of lysing agents include bioactive agents, such as, for example, lysing enzymes for lysing different cell types (e.g., gram positive or negative bacteria, plants, yeast, mammals, etc.), such as lysozyme, leucopeptidase, lysostaphin, labiase, rhizoctonia solani lyase (kitalase), lywallase, and a variety of other lysing enzymes available from, for example, sigma-Aldrich, inc. (St Louis, MO), as well as other commercially available lysing enzymes. Other lysing agents may additionally or alternatively be co-partitioned with the biological particles to cause the contents of the biological particles to be released into the partition. For example, in some cases, cells may be lysed using surfactant-based lysis solutions, but these solutions may be less desirable for emulsion-based systems where surfactants may interfere with stable emulsions. In some cases, the lysis solution may contain nonionic surfactants, such as Triton X-100 and Tween 20. In some cases, the lysis solution may contain ionic surfactants such as sodium dodecyl sarcosinate and Sodium Dodecyl Sulfate (SDS). Electroporation, thermal, acoustic or mechanical cell disruption may also be used in certain situations, for example non-emulsion based partitioning, such as encapsulation of biological particles, which may be in addition to or instead of droplet partitioning, wherein any pore size of the encapsulate is sufficiently small to retain a nucleic acid fragment of a given size after cell disruption.

Alternatively or in addition to the lysis agent co-separated from the biological particles described above, other agents may also be co-separated from the biological particles, including, for example, dnase and rnase inactivating agents or inhibitors, e.g., proteinase K, chelating agents such as EDTA, and other agents for removing or otherwise reducing the negative activity or impact of different cell lysate components on subsequent nucleic acid treatment. Furthermore, in the case of encapsulated biological particles (e.g., cells or nuclei in a polymer matrix), the biological particles may be exposed to an appropriate stimulus to release the biological particles or their contents from a co-partitioned support (e.g., beads). For example, in some cases, chemical stimuli may be co-compartmentalized with encapsulated biological particles to allow for support degradation and release of cells or their contents into a larger compartment. In some cases, the stimulus may be the same as the stimulus described elsewhere herein for releasing nucleic acid molecules (e.g., oligonucleotides) from their respective supports (e.g., beads). In alternative examples, this may be a different and non-overlapping stimulus, so as to allow the encapsulated biological particles to be released into the partition at a different time than the nucleic acid molecules are released into the same partition. For a description of methods, compositions, and systems for encapsulating cells (also referred to as "cell beads"), see, e.g., U.S. patent 10,428,326 and U.S. patent publication 20190100632, each of which is incorporated by reference in its entirety.

Additional reagents such as endonucleases can also be co-partitioned with the biological particles to fragment DNA of the biological particles, DNA polymerase and dntps used to amplify nucleic acid fragments of the biological particles, and to attach barcode molecular tags to amplified fragments. Other enzymes may be co-partitioned, including, but not limited to, polymerases, transposases, ligases, proteases K, DNA enzymes, and the like. Additional reagents may also include reverse transcriptase (including enzymes having terminal transferase activity), primers and oligonucleotides, and switch oligonucleotides (also referred to herein as "switch oligonucleotides" or "template switch oligonucleotides") that may be used for template switching. In some cases, template switching may be used to increase the length of the cDNA. In some cases, template switching may be used to supplement a predefined nucleic acid sequence to the cDNA. In the example of template switching, the cDNA may be produced by reverse transcription of a template (e.g., cellular mRNA), where a reverse transcriptase having terminal transferase activity may add additional nucleotides, such as poly-C, to the cDNA in a template-independent manner. The transition oligonucleotide may comprise a sequence complementary to an additional nucleotide, such as poly-G. An additional nucleotide on the cDNA (e.g., polyC) may hybridize to an additional nucleotide on the switch oligonucleotide (e.g., polyG), whereby the reverse transcriptase may use the switch oligonucleotide as a template to further extend the cDNA. The template switching oligonucleotide may comprise a hybridization region and a template region. The hybridization region may comprise any sequence capable of hybridizing to a target. In some cases, as previously described, the hybridization region comprises a series of G bases to complement the overhanging C base at the 3' end of the cDNA molecule. The series of G bases can include 1G base, 2G bases, 3G bases, 4G bases, 5G bases, or more than 5G bases. The template sequence may comprise any sequence to be incorporated into the cDNA. In some cases, the template region comprises at least 1 (e.g., at least 2,3,4, 5, or more) tag sequences and/or functional sequences. The transition oligonucleotide may comprise deoxyribonucleic acid, ribonucleic acid, modified nucleic acids including 2-aminopurine, 2, 6-diaminopurine (2-amino-dA), inverted dT, 5-methyl dC, 2' -deoxyinosine, super T (5-hydroxybutyrine-2 ' -deoxyuridine), super G (8-aza-7-deazaguanosine), locked Nucleic Acid (LNA), unlocked nucleic acid (UNA, e.g., UNA-A, UNA-U, UNA-C, UNA-G), iso-dG, iso-dC, 2' fluoro bases (e.g., fluoro C, fluoro U, fluoro A, and fluoro G), or any combination.

In some cases, the transition oligonucleotide may be at least about 2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136,137,138,139,140,141,142,143,144,145,146,147,148,149,150,151,152,153,154,155,156,157,158,159,160,161,162,163,164,165,166,167,168,169,170,171,172,173,174,175,176,177,178,179,180,181,182,183,184,185,186,187,188,189,190,191,192,193,194,195,196,197,198,199,200,201,202,203,204,205,206,207,208,209,210,211,212,213,214,215,216,217,218,219,220,221,222,223,224,225,226,227,228,229,230,231,232,233,234,235,236,237,238,239,240,241,242,243,244,245,246,247,248,249 or 250 nucleotides in length or more.

In some cases, the transition oligonucleotide may be up to about 2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136,137,138,139,140,141,142,143,144,145,146,147,148,149,150,151,152,153,154,155,156,157,158,159,160,161,162,163,164,165,166,167,168,169,170,171,172,173,174,175,176,177,178,179,180,181,182,183,184,185,186,187,188,189,190,191,192,193,194,195,196,197,198,199,200,201,202,203,204,205,206,207,208,209,210,211,212,213,214,215,216,217,218,219,220,221,222,223,224,225,226,227,228,229,230,231,232,233,234,235,236,237,238,239,240,241,242,243,244,245,246,247,248,249 or 250 nucleotides in length.

Once the contents of the cells (or nuclei or cell beads) are released into their respective partitions, the macromolecular components contained therein (e.g., macromolecular components of the biological particles such as RNA, DNA, or proteins) may be further processed within the partitions. According to the methods and systems described herein, the macromolecular component contents of individual biological particles may be provided with unique identifiers such that when characterizing those macromolecular components, they may be attributed as having been derived from one or more identical biological particles. The ability to attribute features to individual biological particles or groups of biological particles is provided by assigning unique identifiers Fu Te to individual biological particles or groups of biological particles. Individual biological particles or groups of biological particles may be assigned or associated with a unique identifier, for example in the form of a nucleic acid barcode, in order to tag or label the macromolecular components of the biological particles (and thus their characteristics) with the unique identifier. These unique identifiers can then be used to attribute the components and characteristics of the biological particles to individual biological particles or groups of biological particles.

In some aspects, this is performed by co-segregating individual biological particles or groups of biological particles from a unique identifier, such as described above (with reference to fig. 2). In some aspects, the unique identifier is provided in the form of a nucleic acid molecule (e.g., an oligonucleotide) comprising a nucleic acid barcode sequence that can be attached to or otherwise associated with the nucleic acid content of a separate biological particle or to other components of the biological particle, particularly to fragments of such nucleic acids. The nucleic acid molecules are partitioned such that, when between nucleic acid molecules in a given partition, the nucleic acid barcode sequences contained therein are identical, but when between different partitions, the nucleic acid molecules may and do have different barcode sequences, or at least represent a large number of different barcode sequences in all partitions in a given analysis. In some aspects, only one nucleic acid barcode sequence may be associated with a given partition, but in some cases, there may be two or more different barcode sequences.

The nucleic acid barcode sequence may comprise about 6 to about 20 or more nucleotides within the sequence of a nucleic acid molecule (e.g., an oligonucleotide). The nucleic acid barcode sequence may comprise about 6 to about 20,30,40,50,60,70,80,90,100 or more nucleotides. In some cases, the barcode sequence may be about 6,7,8,9,10,11,12,13,14,15,16,17,18,19,20 nucleotides in length or longer. In some cases, the barcode sequence may be at least about 6,7,8,9,10,11,12,13,14,15,16,17,18,19,20 nucleotides in length or longer. In some cases, the barcode sequence may be up to about 6,7,8,9,10,11,12,13,14,15,16,17,18,19,20 nucleotides in length or less. These nucleotides may be completely contiguous, e.g., in a single stretch of adjacent nucleotides, or they may be divided into two or more separate subsequences separated by 1 or more nucleotides. In some cases, the separate barcode sequences may be about 4 to about 16 nucleotides in length. In some cases, the barcode sequence may be about 4,5,6,7,8,9,10,11,12,13,14,15,16 nucleotides or longer. In some cases, the barcode sequence may be at least about 4,5,6,7,8,9,10,11,12,13,14,15,16 nucleotides or longer. In some cases, the barcode sequence may be up to about 4,5,6,7,8,9,10,11,12,13,14,15,16 nucleotides or less.

The co-partitioned nucleic acid molecules may also comprise other functional sequences that may be used in the processing of nucleic acids from the co-partitioned biological particles. These sequences include, for example, targeting or random/universal amplification primer sequences (for amplifying nucleic acids (e.g., mRNA, genomic DNA) from individual biological particles within a partition, while ligating associated barcode sequences), sequencing primers or primer recognition sites, hybridization or detection sequences (e.g., nucleic acids for identifying the presence of sequences or for pulling down a barcode), or any of a number of other potential functional sequences. Other mechanisms of co-partitioning oligonucleotides may also be employed, including, for example, coalescence of two or more droplets, one of which contains the oligonucleotide, or microdispensing the oligonucleotide (e.g., attached to a bead) into a partition, such as a droplet within a microfluidic system.

In one example, a support, such as a bead, is provided that each includes a plurality of the above-described barcoded nucleic acid molecules (e.g., barcoded oligonucleotides) releasably attached to the bead, wherein all nucleic acid molecules attached to a particular bead will include the same nucleic acid barcode sequence, but exhibit a plurality of diverse barcode sequences among the population of beads used. In some embodiments, hydrogel beads comprising, for example, a polyacrylamide polymer matrix are used as solid support and delivery vehicles for nucleic acid molecules into the partitions, as they are capable of carrying a large number of nucleic acid molecules, and may be configured to release these nucleic acid molecules upon exposure to a specific stimulus, as described elsewhere herein. In some cases, the bead population provides a diverse barcode sequence library comprising at least about 1,000 different barcode sequences, at least about 5,000 different barcode sequences, at least about 10,000 different barcode sequences, at least about 50,000 different barcode sequences, at least about 100,000 different barcode sequences, at least about 1,000,000 different barcode sequences, at least about 5,000,000 different barcode sequences, or at least about 10,000,000 different barcode sequences or more. In addition, each bead may be provided with a large number of nucleic acid (e.g., oligonucleotide) molecules attached. In particular, the number of molecules of a nucleic acid molecule comprising a barcode sequence on an individual bead can be at least about 1,000 nucleic acid molecules, at least about 5,000 nucleic acid molecules, at least about 10,000 nucleic acid molecules, at least about 50,000 nucleic acid molecules, at least about 100,000 nucleic acid molecules, at least about 500,000 nucleic acid molecules, at least about 1,000,000 nucleic acid molecules, at least about 5,000,000 nucleic acid molecules, at least about 10,000,000 nucleic acid molecules, at least about 50,000,000 nucleic acid molecules, at least about 100,000,000 nucleic acid molecules, at least about 250,000,000 nucleic acid molecules, and in some cases at least about 10 hundred million nucleic acid molecules or more. The nucleic acid molecules of a given bead may comprise the same (or a common) barcode sequence, different barcode sequences, or a combination of both. The nucleic acid molecules of a given bead may include multiple sets of nucleic acid molecules. A given set of nucleic acid molecules may include identical barcode sequences. The same barcode sequence may be different from the barcode sequence of another set of nucleic acid molecules.

In addition, when partitioning a population of beads, the resulting partitioned population can also include a diverse barcode library including at least about 1,000 different barcode sequences, at least about 5,000 different barcode sequences, at least about 10,000 different barcode sequences, at least about 50,000 different barcode sequences, at least about 100,000 different barcode sequences, at least about 1,000,000 different barcode sequences, at least about 5,000,000 different barcode sequences, or at least about 10,000,000 different barcode sequences. Further, each partition of the population may include at least about 1,000 nucleic acid molecules, at least about 5,000 nucleic acid molecules, at least about 10,000 nucleic acid molecules, at least about 50,000 nucleic acid molecules, at least about 100,000 nucleic acid molecules, at least about 500,000 nucleic acids, at least about 1,000,000 nucleic acid molecules, at least about 5,000,000 nucleic acid molecules, at least about 10,000,000 nucleic acid molecules, at least about 50,000,000 nucleic acid molecules, at least about 100,000,000 nucleic acid molecules, at least about 250,000,000 nucleic acid molecules, and in some cases, at least about 10 hundred million nucleic acid molecules.

In some cases, it may be desirable to incorporate multiple different barcodes into a given partition, the barcodes being attached to a single or multiple beads within the partition. For example, in some cases, mixed but known sets of barcode sequences may provide greater assurance of authentication in subsequent processing, e.g., by providing a stronger address or home of the barcode to a given partition, as a duplicate acknowledgement or independent acknowledgement of the output of the given partition.

Upon application of a specific stimulus to the bead, the nucleic acid molecule (e.g., oligonucleotide) may be released from the bead. In some cases, the stimulus may be a light stimulus, for example by cleavage of a photolabile bond, thereby releasing the nucleic acid molecule. In other cases, thermal stimulation may be used, wherein an increase in the temperature of the bead environment will cause cleavage or other release of the bond from the bead. In other cases, chemical stimulus may be used that cleaves the bond of the nucleic acid molecule to the bead, or otherwise causes release of the nucleic acid molecule from the bead. In one instance, such compositions include the polyacrylamide matrices described above for encapsulating biological particles, and can be degraded by exposure to a reducing agent (such as DTT) to release the attached nucleic acid molecules.

In some aspects, systems and methods for controlled separation are provided. The droplet size may be controlled by adjusting certain geometric features in the channel architecture (e.g., microfluidic channel architecture). For example, the spread angle, width, and/or length of the channel may be adjusted to control droplet size.

Fig. 2 shows an example of a microfluidic channel structure for controlled separation of beads into discrete droplets. The channel structure 200 may include a channel segment 202 that communicates with a reservoir 204 at a channel connection 206 (or intersection). The reservoir 204 may be a chamber. As used herein, any reference to a "reservoir" may also refer to a "chamber. In operation, the aqueous fluid 208 containing the suspended beads 212 may be transported along the channel segment 202 into the connection 206 to encounter the second fluid 210 that is immiscible with the aqueous fluid 208 in the reservoir 204, thereby producing droplets 216, 218 of the aqueous fluid 208 flowing into the reservoir 204. At the junction 206 where the aqueous fluid 208 and the second fluid 210 meet, droplets may be formed based on certain geometric parameters (e.g., w, h ₀, α, etc.) such as the hydrodynamic forces at the junction 206, the flow rates of the two fluids 208, 210, the fluid properties, and the channel structure 200. By continuously injecting the aqueous fluid 208 from the channel segment 202 through the connection 206, a plurality of droplets may be collected in the reservoir 204.

The discrete droplets generated may include beads (e.g., as in occupied droplets 216). Alternatively, the discrete droplets generated may comprise more than one bead. Alternatively, the discrete droplets produced may not include any beads (e.g., as in unoccupied droplets 218). In some cases, the discrete droplets produced may contain one or more biological particles, as described elsewhere herein. In some cases, the discrete droplets produced may comprise one or more reagents, as described elsewhere herein.

In some cases, the aqueous fluid 208 may have a substantially uniform concentration or frequency of beads 212. Beads 212 may be introduced into channel segment 202 from a separate channel (not shown in fig. 2). The frequency of the beads 212 in the channel section 202 may be controlled by controlling the frequency of introduction of the beads 212 into the channel section 202 and/or the relative flow rates of the fluids in the channel section 202 and the individual channels. In some cases, beads may be introduced into channel segment 202 from a plurality of different channels, and the frequencies controlled accordingly.

In some cases, the aqueous fluid 208 in the channel segment 202 may contain biological particles. In some cases, the aqueous fluid 208 may have a substantially uniform concentration or frequency of biological particles. As with the beads, the biological particles may be introduced into the channel segment 202 from a separate channel. The frequency or concentration of biological particles in the aqueous fluid 208 in the channel section 202 may be controlled by controlling the frequency of introduction of biological particles into the channel section 202 and/or the relative flow rates of the fluid in the channel section 202 and the separate channel. In some cases, biological particles may be introduced into channel segment 202 from a plurality of different channels, and the frequency controlled accordingly. In some cases, a first individual channel may introduce beads into channel segment 202, and a second individual channel may introduce biological particles into the channel segment. The first separate channel into which the beads are introduced may be upstream or downstream of the second separate channel into which the biological particles are introduced.

The second fluid 210 may comprise an oil, such as a fluorinated oil, that includes a fluorosurfactant for stabilizing the resulting droplets (e.g., inhibiting subsequent coalescence of the resulting droplets).

In some cases, the second fluid 210 may not experience and/or be directed to any flow into or out of the reservoir 204. For example, the second fluid 210 may be substantially stationary in the reservoir 204. In some cases, the second fluid 210 may be subject to flow within the reservoir 204, but not flow into or out of the reservoir 204, such as by applying pressure to the reservoir 204 and/or being affected by an incoming flow of aqueous fluid 208 at the connection 206. Alternatively, the second fluid 210 may be subjected to and/or directed to flow into or out of the reservoir 204. For example, reservoir 204 may be a channel that directs second fluid 210 from upstream to downstream, transporting the generated droplets.

The channel structure 200 at or near the connection 206 may have certain geometric features that at least partially determine the size of the droplets formed by the channel structure 200. The channel segment 202 may have a height h ₀ and a width w at or near the connection 206. For example, the channel segment 202 may have a rectangular cross-section that leads to a reservoir 204 having a wider cross-section (such as in width or diameter). Alternatively, the cross-section of the channel segment 202 may be other shapes, such as a circular shape, a trapezoidal shape, a polygonal shape, or any other shape. The top and bottom walls of the reservoir 204 at or near the connection 206 may be sloped at an expansion angle α. The spread angle α allows the tongue (the portion of the aqueous fluid 208 that exits the channel segment 202 at the junction 206 and enters the reservoir 204 prior to droplet formation) to increase in depth and facilitate reducing the curvature of the intermediately formed droplets. The droplet size may decrease with increasing spread angle. The final drop radius R _d can be predicted by the following equations for the h ₀, w, and α geometric parameters described above:

For example, for channel structures w=21 μm, h=21 μm, and α=3°, the predicted droplet size is 121 μm. In another example, for a channel structure w=25 μm, h=25 μm, and α=5°, the predicted droplet size is 123 μm. In another example, for a channel structure of w=28 μm, h=28 μm, and α=7°, the predicted droplet size is 124 μm.

In some cases, the spread angle α may be in the range of about 0.5 ° to about 4 °, about 0.1 ° to about 10 °, or about 0 ° to about 90 °. For example, the spread angle may be at least about 0.01°,0.1°,0.2°,0.3°,0.4°,0.5°,0.6°,0.7°,0.8°,0.9°,1°,2°,3°,4°,5°,6°,7°,8°,9°,10°,15°,20°,25°,30°,35°,40°,45°,50°,55°,60°,65°,70°,75°,80°,85° or greater. In some cases, the spread angle may be up to about 89°,88°,87°,86°,85°,84°,83°,82°,81°,80°,75°,70°,65°,60°,55°,50°,45°,40°,35°,30°,25°,20°,15°,10°,9°,8°,7°,6°,5°,4°,3°,2°,1°,0.1°,0.01° or less. In some cases, the width w may be in the range of about 100 micrometers (μm) to about 500 μm. In some cases, the width w may be in the range of about 10 μm to about 200 μm. Alternatively, the width may be less than about 10 μm. Alternatively, the width may be greater than about 500 μm. In some cases, the flow rate of the aqueous fluid 208 entering the connection 206 may be between about 0.04 microliters (μl)/minute (min) and about 40 μl/min. In some cases, the flow rate of the aqueous fluid 208 entering the connection 206 may be between about 0.01 microliters (μl)/minute (min) and about 100 μl/min. Alternatively, the flow rate of the aqueous fluid 208 entering the connection 206 may be less than about 0.01 μl/min. Alternatively, the flow rate of the aqueous fluid 208 entering the connection 206 may be greater than about 40 μl/min, such as 45μL/min、50μL/min、55μL/min、60μL/min、65μL/min、70μL/min、75μL/min、80μL/min、85μL/min、90μL/min、95μL/min、100μL/min、110μL/min、120μL/min、130μL/min、140μL/min、150μL/min or greater. At lower flow rates (such as flow rates less than or equal to about 10 microliters/minute), the droplet radius may not depend on the flow rate of the aqueous fluid 208 entering the connection 206.

In some cases, at least about 50% of the droplets generated may have a uniform size. In some cases, at least about 55%,60%,65%,70%,75%,80%,85%,90%,95%,96%,97%,98%,99%, or larger droplets produced may be of uniform size. Alternatively, less than about 50% of the droplets generated may have a uniform size.

The flux of droplet generation may be increased by increasing the point of generation, for example, increasing the number of connections (e.g., connection 206) between the channel segments (e.g., channel segments 202) of the aqueous fluid 208 and the reservoir 204. Alternatively or in addition, the flux of droplet generation may be increased by increasing the flow rate of the aqueous fluid 208 in the channel section 202.

The methods and systems described herein may be used to greatly improve the efficiency of single cell applications and/or other applications that receive droplet-based inputs. For example, subsequent operations that may be performed after sorting the occupied cells and/or cells of an appropriate size may include generating amplification products, purification (e.g., via Solid Phase Reversible Immobilization (SPRI)), further processing (e.g., cleavage, ligation, and subsequent amplification (e.g., via PCR)) of the functional sequences. These operations may occur in the ontology (e.g., outside the partition). In the case where the partition is a droplet in an emulsion, the emulsion may be broken and the contents of the droplet combined for additional operations. Additional reagents that may be co-partitioned with the barcoded beads may include oligonucleotides for blocking ribosomal RNA (rRNA) and nucleases for digesting genomic DNA in cells. Alternatively, rRNA removers may be applied during additional processing operations. The configuration of the constructs generated by this method can help minimize (or avoid) sequencing of the poly-T sequence and/or sequence the 5' end of the polynucleotide sequence during sequencing. The amplification products (e.g., the first amplification product and/or the second amplification product) can be sequenced for sequence analysis. In some cases, amplification may be performed using a partial hairpin sequencing amplification (PHASE) method.

A variety of applications require assessment of the presence and quantification of different biological particles or organism types within a population of biological particles, including, for example, microbiome analysis and characterization, environmental testing, food safety testing, epidemiological analysis (e.g., in tracking contamination), and the like.

Fig. 40 shows a flow chart of a method 4000 for analyzing nucleic acid molecules of an embedded, immobilized tissue, according to some embodiments. In operation 4010, method 4000 can include providing an embedded fixed tissue. The embedded fixed tissue may comprise a plurality of cells. One cell of the plurality of cells may comprise a nucleic acid molecule. The embedded fixed tissue may be embedded in a solid medium. Nucleic acid molecules can be as described elsewhere herein. For example, the nucleic acid molecule may comprise a ribonucleic acid (RNA) molecule. In another example, the nucleic acid molecule may comprise a messenger RNA molecule.

The fixed tissue may be fixed for at least about 1 day (d), 2d, 3d, 4d, 5d, 6d, 1 week (w), 2w, 3w, 1 month (m), 2m, 3m, 4m, 5m, 6m, 7m, 8m, 9m, 10m, 11m, 1 year (y), 2y, 3y, 4y, 5y, 10y, 15y, 20y, or more prior to operation 4010. The fixed tissue may be fixed up to about 20y, 15y, 10y, 5y, 4y, 3y, 2y, 1y, 11m, 10m, 9m, 8m, 7m, 6m, 5m, 4m, 3m, 2m, 1m, 3w, 2w, 1w, 6d, 5d, 4d, 3d, 2d, 1d, or less prior to operation 4010. The fixed tissue may be fixed at a time defined by the range of any two in-progress values.

The fixed tissue may include, for example, one or more of connective tissue, epithelial tissue, organ tissue, muscle tissue, ligament, tendon, skin tissue, breast tissue, bladder, kidney tissue, liver tissue, colon tissue, thyroid tissue, cervical tissue, prostate tissue, lung tissue, heart tissue, muscle tissue, pancreatic tissue, anal tissue, bile duct tissue, bone marrow, uterine tissue, ovarian tissue, endometrial tissue, vaginal tissue, vulval tissue, stomach tissue, ocular tissue, nasal tissue, sinus tissue, penile tissue, salivary gland tissue, intestinal tissue, gall bladder tissue, gastrointestinal tissue, bladder tissue, brain tissue, spinal tissue, neurons, blood representative of the blood brain barrier, hair, nails, keratin, collagen, or any combination thereof. For example, the fixed tissue may comprise fixed human tissue. For example, the fixed tissue may include fixed cancerous tissue (e.g., fixed breast cancer tissue).

In some cases, the solid medium comprises paraffin. The solid medium may include, for example, other long chain hydrophobic compounds (e.g., waxes, etc.), glycerin, gels, epoxies, resins, etc., or any combination thereof. The fixed tissue may be fixed using, for example, formaldehyde (e.g., formalin), ethanol fixative, polyethylene glycol, glycerol, acetic acid, zinc-based fixative, khandsari, honey, and the like, or any combination thereof.

In another operation 4020, method 4000 can include removing at least a portion of the solid medium from the embedded fixed tissue, thereby obtaining the fixed tissue comprising the plurality of cells.

In some cases, operation 4020 may include adding a first solvent to the embedded fixed tissue to dissolve the solid medium to obtain the fixed tissue. In some cases, the first solvent comprises a non-polar solvent. Examples of nonpolar solvents include, but are not limited to, alkanes (e.g., butane, pentane, hexane, etc.), aromatics (e.g., benzene, toluene, xylenes, etc.), xylene substitute compounds (e.g.,Xylene substitutes, etc.), or any combination thereof. The nonpolar solvent may have no or substantially no dipole moment on the solvent molecule. The non-polar solvent may be configured to dissolve at least a portion of the solid medium. For example, the solid medium may comprise a non-polar moiety that is soluble by the first solvent.

Operation 4020 may include removing the first solvent from the fixed tissue. The removing may include removing without damaging the fixed tissue. For example, the first solvent may be removed without physically damaging the fixed tissue. In this example, the fixed tissue may remain as a tissue roll after the solid medium and the first solvent are removed. The removing may include using a pipette. For example, the first solvent may be removed via pipetting the first solvent without damaging the fixed tissue. The removing may include decanting the first solvent from the fixed tissue.

Operation 4020 may further include adding a second solvent to the fixed tissue. The second solvent may include a polar solvent. Examples of polar solvents include, but are not limited to, alcohols (e.g., methanol, ethanol, propanol, isopropanol, etc.), carbonyl-containing compounds (e.g., acetone, ethyl acetate, etc.), haloforms (e.g., chloroform, etc.), dimethyl sulfoxide (DMSO), N-methyl formamide (NMF), etc., or any combination thereof. In one example, the polar solvent is ethanol. Operation 4020 may include removing the second solvent from the fixed tissue. The removal may be as described elsewhere herein. For example, the removing may include not destroying the fixed tissue.

Operation 4020 may include adding a rehydrating agent to the fixed tissue. The rehydration agent may comprise water. In one example, the rehydrating agent may be water. In another example, the rehydration agent can comprise a buffer solution. The rehydrating agent may comprise one or more of buffers, salts, ions, etc., or any combination thereof. Operation 4020 may include removing the rehydration agent from the tissue as described elsewhere herein. Operation 4020 may include adding a buffer to the fixed tissue. Examples of buffers include, but are not limited to, tris buffer, phosphate buffer, glycine buffer, boric acid buffer, carboxylic acid buffer (e.g., malic acid, formic acid, lactic acid, etc.), and the like, or any combination thereof. In some cases, the buffer may be removed from the fixed tissue. The removal may be as described elsewhere herein.

In another operation 4030, method 4000 may include dissociating the fixed tissue into cells in a plurality of cells. Cells may include cells containing nucleic acid molecules.

Operation 4030 may include dissociating the tissue. Dissociating the tissue may include using a manual disruptor (e.g., manual disruptor, mortar and pestle, manual chopper, etc.), an automatic disruptor (e.g., GENTLEMACS ^TM Octo disruptor,PCT pulverizer,CryoPREP automatic dry pulverizer, etc.), or any combination thereof.

Operation 4030 may include resuspending the tissue. Resuspension may include adding a resuspension solution to the tissue. The resuspension solution may comprise, for example, water, a surfactant, a buffer, a salt, or the like, or any combination thereof. The resuspension solution can be configured to solubilize at least a portion of the tissue and/or a component of the tissue (e.g., a biomolecule, a nucleic acid, etc.). Resuspension can make the contents of the tissue available for use in the methods and systems described elsewhere herein. The resuspension may provide organization for additional processing operations.

Operation 4030 may include washing the tissue. The washing may remove non-predetermined molecules (e.g., waste molecules, non-target molecules, etc.). Washing may include the use of one or more solvents (e.g., water, non-polar solvents, etc.) as described elsewhere herein. The tissue may be washed at least about 1,2,3,4,5,6,7,8,9,10 or more times. The tissue may be washed up to about 10,9,8,7,6,5,4,3,2,1 or less times. After washing the tissue, the tissue may be resuspended. For example, the tissue may be resuspended as described elsewhere herein.

Method 4000 can provide a cell yield of at least about 1x 10 ³ (1E 3), 5E3,1E4,5E4,1E4,5E5,1E5,5E6,1E6,5E7,1E7,5E8,1E8,5E9,1E9,5E10 cells or more from two 25 micron sections of the fixed tissue. Method 4000 can provide cell yields of up to about 5E10,1E10,5E10,1E9,5E9,1E8,5E8,1E7,5E7,1E6,5E6,1E5,5E5,1E4,5E4,1E4,5E3,1E3 cells or less from two 25 micron sections of fixed tissue. Non-limiting examples of cell yields per two 50 micron sections achieved by experimentation include human breast tissue (0.6E6 cells), human lymph node (2.33E6 cells), human lung cancer (5.05E6 cells), human brain (9.36E6 cells), human pancreas (5.45E6 cells), human heart (1.11E6 cells), human testis (1.99E6 cells), and human tonsils (5.6E6 cells). Non-limiting examples of cell yields per two 25 micron sections achieved by experimentation include normal human pancreas (0.6E6 cells), human brain glioblastoma multiforme (0.14E6 cells), human tonsil chronic tonsillitis with lymphoid hyperplasia (5.6E6 cells), human lung cancer (0.38E6 cells), human liver-hepatocellular carcinoma (0.52E6 cells), normal human thymus (2.56E6 cells), human normal adjacent kidney 2.49E6 cells, human ovarian cancer (0.33E6 cells), normal human spleen (1.89E6 cells), human cardiac myxoma (0.34E6 cells), human skin melanoma (0.51E6 cells), human diseased prostate (0.3E6 cells), human lymph node adenocarcinoma (0.46E6 cells), and human colorectal cancer (1.34E6 cells).

The nucleic acid molecules may be partitioned into partitions. The separation may occur prior to operation 4040. The partition may be one of a plurality of holes. For example, partitioning may include placing nucleic acid molecules into wells of a 96-well plate. As described elsewhere herein, a partition may be one of a plurality of droplets. For example, the droplet may be one of a plurality of oil emulsion droplets. The droplets may be a phase that is immiscible with the surrounding phase. The droplets may be droplets surrounded by a gas (e.g., air). In some cases, the partition may include a support. The support may be as described elsewhere herein. For example, the support may comprise a solid support. In another example, the support may comprise beads. The barcode sequence may be attached to a support.

In another operation 4040, the method 4000 can include using the barcode sequence in a cell to generate a barcoded nucleic acid molecule comprising the barcode sequence and a sequence corresponding to the nucleic acid molecule.

In some cases, operation 4040 may include hybridizing a barcode sequence to the nucleic acid molecule. Hybridization may be as described elsewhere herein. The barcode may be extended to form a barcoded nucleic acid molecule as described elsewhere herein. For example, extending may include using enzymes. The extending may include using a connection. For example, extending may include ligating a barcode sequence to a probe as described elsewhere herein. The probe may comprise a moiety that is at least partially complementary to the nucleic acid molecule.

FIG. 41 illustrates an example of a nucleic acid profiling workflow according to some embodiments. Nucleic acid profiling workflows can be combined with other portions of the disclosure, such as fixed tissue preparation methods. The nucleic acid profiling workflow of FIG. 41 may be a single-fold workflow. Nucleic acid profiling workflow for one or more samples (e.g., cells, nucleic acids) may in some cases include staining operations.

After the staining procedure, the workflow may include fixing and/or permeabilizing the cells. The securing may include securing as described elsewhere herein. Fixation may include embedding. For example, the immobilized cells may be immobilized and then embedded in a matrix. Permeabilizing the cell can include the use of one or more permeabilizing reagents (e.g., methanol, ethanol, acetone, detergents, etc.). Permeabilization may include at least partial degradation of the cell wall of the cells of the sample. Permeabilization can be configured to make the contents of a cell (e.g., nucleic acid) accessible. After fixation and/or permeabilization of the sample, the sample can be stored. The storage may last for at least about 1 day (d), 2d, 3d, 4d, 5d, 6d, 1 week (w), 2w, 3w, 1 month (m), 2m, 3m, 4m, 5m, 6m, 7m, 8m, 9m, 10m, 11m,1 year (y), 2y, 3y, 4y, 5y,10y,15 y, 20y, or longer. The storage may last up to about 20y,15y,10y,5y,4y,3y,2y,1y,11m,10m,9m,8m,7m,6m,5m,4m,3m,2m,1m,3w,2w,1w,6d,5d,4d,3d,2d,1d or less.

After immobilization and/or permeabilization (and optional storage, if performed), the workflow may include hybridizing one or more probes to a sample as described elsewhere herein. Hybridization may include the use of probe molecules, such as the probe molecules of FIG. 43. In the workflow of fig. 41, hybridization may not include multiplex analysis. For example, each sample may be run separately through a workflow. Each sample run through the workflow may contain a probe barcode. In some cases, the singleplex format may be compatible with a cell labeling protocol (e.g., cell surface protein labeling).

After hybridization, the barcoded nucleic acids may be partitioned as described elsewhere herein. For example, the barcoded nucleic acid may be separated into a plurality of droplets. In another example, the barcoded nucleic acid may be partitioned into multiple wells. After separation, the connection and extension may be performed as described elsewhere herein. Library construction may include one or more of construction of a cell surface protein library and a gene expression library as described elsewhere herein. The separation may be as described in fig. 43. FIG. 43 illustrates an example of droplet partition generation according to some embodiments. A plurality of bar-coded beads 4301 (e.g., bar-coded gel beads) may be provided. The washed immobilized sample may be provided to the illustrated barcoded beads and separated into a plurality of droplets suspended in oil. In this way, the samples may be partitioned such that, on average, there is less than one sample in each partition shown.

After library construction, nucleic acid molecules from the sample may be sequenced as described elsewhere herein. For example, the sequencing may be sequencing-by-synthesis. After sequencing, the data generated by sequencing can be analyzed as described elsewhere herein.

FIG. 42 shows an example of a nucleic acid profiling workflow according to some embodiments. The nucleic acid profiling workflow of fig. 42 may be a multiplex workflow (e.g., multiple samples may be processed simultaneously). In some cases, multiple workflows may be incompatible with cell labeling. The operation of the nucleic acid profiling workflow may be as described elsewhere herein. For example, cells may be immobilized in formaldehyde, and nucleic acid molecules from the cells may be hybridized to the probe set. Each probe set may include a unique probe barcode to allow sample multiplexing and read-level demultiplexing. Samples may be hybridized overnight, pooled, washed and partitioned. The probe may be attached together with an additional bar code. Library construction, sequencing and data analysis can be performed after ligation. The nucleic acid profiling workflow may include the use of a sample as described elsewhere herein. For example, the sample may comprise nucleic acids. For example, the immobilization and/or permeabilization operations may be as described elsewhere herein. As described elsewhere herein, the sample may be stored after immobilization and/or permeabilization. After immobilization, the probe may be hybridized to a nucleic acid molecule of the sample. For example, different probes may be hybridized to different nucleic acid molecules in the sample. In some cases, each sample used in the workflow may be associated with a different barcode probe. Multiplex assays may include simultaneous hybridization of at least about 2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20, individual or more samples. Multiplex assays may include simultaneous hybridization of up to about 20,19,18,17,16,15,14,13,12,11,10,9,8,7,6,5,4,3 or fewer samples. As described elsewhere herein, the nucleic acid molecules may be stored after hybridization of the probes.

After hybridization, the nucleic acid molecules may be combined and separated as described elsewhere herein (e.g., fig. 43). The barcode molecules may be linked and/or extended as described elsewhere herein. A library (e.g., a gene expression library) can be constructed, barcode molecules can be sequenced, and sequencing data can be analyzed.

In some cases, the combining operation described elsewhere herein may include one or more washing operations. Washing may include removing supernatant from the centrifuged solution. In some cases, the removal of the supernatant may be complete (e.g., all of the supernatant may be removed). In some cases, removing the supernatant may be a partial removal (e.g., a portion of the supernatant may be left behind). For example, up to about 100,90,80,70,60,50,40,30,20,10 or less microliters of supernatant may be left during the washing operation. Leaving the supernatant can provide benefits for processing samples having low cell numbers (e.g., less than about 500,000 cells in the sample). Incomplete removal of the supernatant may enhance cell recovery without significantly affecting the performance of the assays of the present disclosure.

Fig. 43 illustrates an example of a plurality of probe molecules 4301 and 4302, according to some embodiments. In some cases, the probe may be as described elsewhere herein. The probe may be configured to analyze a gene. In some cases, the probe may be configured with a barcode portion 4303. The barcode portion may be a barcode as described elsewhere herein. The barcode may be attached to the probe prior to the user's analysis of the nucleic acid molecule using the probe. For example, the bar code may be pre-attached to the probe. In this example, the probe may be provided to the user, the barcode already being part of the probe. Within a partition (e.g., a droplet suspended in oil), the probe molecules may bind to the analyte nucleic acid 4304. After binding, the probes may be ligated together. The connection may include sealing a slit (e.g., connecting adjacent probes without filling a gap). Ligation may include gap-filling ligation reactions (e.g., filling gaps of one or more nucleotides between probes during ligation). After ligation, the single unified probe 4305 can dissociate from the analyte nucleic acid and be captured by the capture region 4306 of nucleic acid associated with the bead 4307. The probes may then be extended to incorporate one or more barcode molecules, primers, etc. as described elsewhere herein. The probes may then be amplified as described elsewhere herein.

In some cases, the methods and systems of the present disclosure may provide improvements in analytical efficacy of a fixed sample. Fixation may increase the mechanical elasticity of the cell sample and lock in transient cellular states. In this way, analysis of the sample cells may provide information about the state of the cells at the time of fixation, which may provide additional insight into the short term cell state. The improvements of the present disclosure can also enable greater longitudinal research. For example, high quality data generated from a fixed sample allows the sample to be studied over a longer period of time. In addition, modern technology will be used with historical samples, releasing new information that was not previously available. The methods and systems of the present disclosure may also reduce transportation and/or storage constraints on the sample. For example, fixed and embedded samples are more compatible with harsh storage conditions than non-fixed samples. This may enable a new paradigm of sample analysis in which samples are transported a greater distance for analysis (e.g., worldwide). This may allow all samples under study to be transported to a single laboratory for analysis, which may reduce errors associated with processing samples in different laboratories. In addition, sample-to-sample variability can be reduced by matching samples, and samples can be multiplexed with unique sample barcodes. The immobilization of the sample may also reduce or eliminate infectious agents present within the sample, which may increase the safety of such sample analysis.

Computer system

The present disclosure provides a computer system programmed to implement the methods of the present disclosure. FIG. 14 illustrates a computer system 1401 that is programmed or otherwise configured to process or analyze sequencing reads. The computer system 1401 may adjust various aspects of the disclosure, e.g., align sequencing reads, index sequencing reads to cells, partitions, etc. The computer system 1401 may be a user's electronic device or a computer system located at a remote location relative to the electronic device. The electronic device may be a mobile electronic device.

The computer system 1401 includes a central processing unit (CPU, also referred to herein as a "processor" and a "computer processor") 1405, which may be a single-core processor or a multi-core processor, or a plurality of processors for parallel processing. The computer system 1401 also includes memory or memory locations 1410 (e.g., random access memory, read only memory, flash memory), an electronic storage unit 1415 (e.g., a hard disk), a communication interface 1420 (e.g., a network adapter) for communicating with one or more other systems, and peripheral devices 1425 such as cache, other memory, data storage, and/or electronic display adapters. The memory 1410, the storage unit 1415, the interface 1420, and the peripheral 1425 communicate with the CPU 1405 through a communication bus (solid line) such as a motherboard. The storage unit 1415 may be a data storage unit (or data repository) for storing data. The computer system 1401 may be operatively coupled to a computer network ("network") 1430 by means of a communication interface 1420. The network 1430 may be the internet, and/or an extranet, or an intranet and/or extranet in communication with the internet. Network 1430 is in some cases a telecommunications network and/or a data network. Network 1430 may include one or more computer servers that may support distributed computing, such as cloud computing. The network 1430 may in some cases implement a point-to-point network with the aid of the computer system 1401, which may enable devices coupled to the computer system 1401 to function as clients or servers.

The CPU 1405 may execute a sequence of machine-readable instructions, which may be embodied in a program or software. The instructions may be stored in a memory location, such as memory 1410. Instructions may be directed to CPU 1405, which may then program or otherwise configure CPU 1405 to implement the methods of the present disclosure. Examples of operations performed by CPU 1405 may include fetching, decoding, executing, and writing back.

CPU 1405 may be part of a circuit such as an integrated circuit. One or more other components of system 1401 may be included in the circuit. In some cases, the circuit is an Application Specific Integrated Circuit (ASIC).

The storage unit 1415 may store files such as drivers, libraries, and saved programs. The storage unit 1415 may store user data such as user preferences and user programs. In some cases, the computer system 1401 may include one or more additional data storage units located outside the computer system 1401, such as on a remote server in communication with the computer system 1401 via an intranet or the Internet.

The computer system 1401 may communicate with one or more remote computer systems over a network 1430. For example, the computer system 1401 may communicate with a remote computer system of a user (e.g., an operator). Examples of remote computer systems include personal computers (e.g., portable PCs), tablet PCs or tablet PCs (e.g.,iPad、Galaxy Tab), phone, smart phone (e.g.,IPhone, android supporting device,) Or a personal digital assistant. A user may access the computer system 1401 via the network 1430.

The methods as described herein may be implemented by way of machine (e.g., computer processor) executable code stored on an electronic storage location of the computer system 1401, such as on the memory 1410 or the electronic storage unit 1415. The machine executable code or machine readable code may be provided in the form of software. During use, the code may be executed by the processor 1405. In some cases, the code may be retrieved from the storage unit 1415 and stored on the memory 1410 for ready access by the processor 1405. In some cases, electronic storage 1415 may be eliminated and machine executable instructions stored on memory 1410.

The code may be pre-compiled and configured for use with a machine having a processor adapted to execute the code, or may be compiled during runtime. The code may be provided in a programming language that is selectable to enable execution of the code in a pre-compiled or compiled manner.

Aspects of the systems and methods provided herein, such as the computer system 1401, may be embodied in programming. Aspects of the technology may be considered an "article of manufacture" or "article of manufacture" which is typically in the form of machine-executable code and/or associated data carried on or embodied in one type of machine-readable medium. The machine executable code may be stored on an electronic storage unit, such as a memory (e.g., read only memory, random access memory, flash memory) or a hard disk. A "storage" medium may include any or all of the tangible memory of a computer, processor, etc., or its associated modules, such as various semiconductor memories, tape drives, disk drives, etc., which may provide non-transitory storage for software programming at any time. All or part of the software may sometimes communicate over the internet or various other telecommunications networks. Such communication may enable, for example, software to be loaded from one computer or processor into another computer or processor, for example, from a management server or host into a computer platform of an application server. Thus, another type of medium that may carry software elements includes light waves, electric waves, and electromagnetic waves, such as those used across physical interfaces between local devices, through wired and optical landline networks, and through various air links. Physical elements carrying such waves, such as wired or wireless links, optical links, etc., may also be considered as media carrying software. As used herein, unless limited to a non-transitory tangible "storage" medium, terms, such as computer or machine "readable medium," refer to any medium that participates in providing instructions to a processor for execution.

Thus, a machine-readable medium, such as computer-executable code, may take many forms, including but not limited to, tangible storage media, carrier wave media, or physical transmission media. Nonvolatile storage media includes, for example, optical or magnetic disks, any storage devices, etc., such as in any computer, such as those shown in the accompanying drawings, which may be used to implement a database. Volatile storage media include dynamic memory, such as the main memory of such a computer platform. Tangible transmission media include coaxial cables, copper wire and fiber optics, including the wires that comprise a bus within a computer system. Carrier wave transmission media can take the form of electrical or electromagnetic signals, or acoustic or light waves, such as those generated during Radio Frequency (RF) and Infrared (IR) data communications. Thus, common forms of computer-readable media include, for example, a floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, DVD, or DVD-ROM, any other optical medium, punch cards, paper tape, any other physical storage medium with patterns of holes, RAM, ROM, PROM and EPROMs, FLASH-EPROMs, any other memory chip or cartridge, a carrier wave transporting data or instructions, a cable or link transporting such a carrier wave, or any other medium from which a computer can read programming code and/or data. Many of these forms of computer-readable media may be involved in carrying one or more sequences of one or more instructions to a processor for execution.

The computer system 1401 may include or communicate with an electronic display 1435 that includes a User Interface (UI) 1440 for providing, for example, sequencing analysis results, and the like. Examples of UIs include, but are not limited to, graphical User Interfaces (GUIs) and Web-based user interfaces.

The methods and systems of the present disclosure may be implemented by one or more algorithms. The algorithm may be implemented by software when executed by the central processing unit 1405. For example, the algorithm may perform sequencing.

The devices, systems, compositions, and methods of the present disclosure can be used in a variety of applications, such as processing a single analyte (e.g., RNA, DNA, or protein) or multiple analytes (e.g., DNA and RNA, DNA and protein, RNA and protein, or RNA, DNA and protein) from a single cell. For example, a biological particle (e.g., a cell or cell bead) is partitioned in a partition (e.g., a droplet) and multiple analytes from the biological particle are processed for subsequent processing. The plurality of analytes may be from a single cell. This can allow, for example, simultaneous proteomics, transcriptomics and genomic analysis of cells.

Examples

Prophetic example 1-RNA templated ligation and barcoding

The generation of one or more barcoded molecules (e.g., within or on a cell or cell bead) can be performed sequentially within one or more sets of partitions. For example, a cell or cell bead may comprise a target RNA molecule for barcoding and/or a feature that may have a feature binding group comprising a reporter oligonucleotide (comprising a reporter sequence) coupled thereto. The target RNA molecule can be hybridized to the first probe and the second probe, e.g., the target RNA molecule can have a first target region and a second target region that are complementary to the first probe sequence of the first probe and the second probe sequence of the second probe. In some cases, a probe-linked molecule may be produced, for example, via ligation of the probe upon hybridization to an RNA molecule, or using one or more nucleic acid reactions, for example, via an extension reaction and/or enzymatic or chemical ligation. The probe-linked molecules may be barcoded in one or more sets of partitions.

In one example, cells (or nuclei or cell beads) can be separated in a first set of partitions (e.g., microwells or other containers) and contacted with hybridization buffer comprising first probes, second probes, probe binding molecules (e.g., splint oligonucleotides), and barcode molecules. The hybridization buffer can include reagents (e.g., formamide, ethylene carbonate, salts, etc.) to facilitate hybridization of the first and second probes to the target nucleic acid molecule. Cells (or nuclei or cell beads) from multiple partitions can then be pooled and washed, e.g., to remove unhybridized probes. Cells (or nuclei or cell beads) may then be counted and re-partitioned into a second set of partitions (e.g., droplets). Within the droplet or within the cell or cell bead within the droplet, ligation and extension reactions can be performed to produce a barcoded nucleic acid molecule. In some cases, the droplet may additionally comprise a capture molecule comprising an additional barcode sequence. Thus, a barcoded nucleic acid molecule within a droplet may comprise two barcode sequences.

In another example, cells (or nuclei or cell beads) can be partitioned in a first set of partitions (e.g., microwells) and contacted with a hybridization buffer comprising a first probe, a second probe, a probe binding molecule (e.g., a splint oligonucleotide), and a barcode molecule. Cells (or nuclei or cell beads) within the partition may then be washed, for example, to remove unhybridized probes, and subsequently pooled together. Cells (or nuclei or cell beads) may then be counted and re-partitioned into a second set of partitions (e.g., droplets). Within the droplet, ligation and extension reactions can be performed to produce a barcoded nucleic acid molecule. In some cases, the droplet may additionally comprise a capture molecule comprising an additional barcode sequence. Thus, a barcoded nucleic acid molecule within a droplet may comprise two barcode sequences.

In yet another example, cells (or nuclei or cell beads) can be partitioned in a first set of partitions (e.g., microwells) and contacted with a hybridization buffer comprising a first probe, a second probe, a probe binding molecule (e.g., a splint oligonucleotide), and a barcode molecule. Cells (or nuclei or cell beads) from multiple partitions can then be pooled and washed, e.g., to remove unhybridized probes. Cells (or nuclei or cell beads) can then be counted and subjected to conditions sufficient to ligate the barcode molecule to the nucleic acid molecule to which the probe hybridizes. The linked molecules can then be separated into, for example, droplets. Within the droplet, an extension reaction may be performed to produce a barcoded nucleic acid molecule. In some cases, the droplet may additionally comprise a capture molecule comprising an additional barcode sequence. Thus, a barcoded nucleic acid molecule within a droplet may comprise two barcode sequences.

In another example, cells (or nuclei or cell beads) can be partitioned in a first set of partitions (e.g., microwells) and contacted with a hybridization buffer comprising a first probe, a second probe, a probe binding molecule (e.g., a splint oligonucleotide), and a barcode molecule. Cells (or nuclei or cell beads) within the partition may then be washed, for example, to remove unhybridized probes, and subsequently pooled together. Cells (or nuclei or cell beads) can then be counted and subjected to conditions sufficient to ligate the barcode molecule to the nucleic acid molecule to which the probe hybridizes. The linked molecules can then be separated into, for example, droplets. Within the droplet, an extension reaction may be performed to produce a barcoded nucleic acid molecule. In some cases, the droplet may additionally comprise a capture molecule comprising an additional barcode sequence. Thus, a barcoded nucleic acid molecule within a droplet may comprise two barcode sequences.

After each instance, the barcoded nucleic acid molecules may be subjected to additional barcoding operations in additional partitions, e.g., in droplets. Alternatively or in addition, the contents of the droplets may be combined and processed downstream for analysis (e.g., via sequencing).

In some cases, several operations may be performed in a different order. For example, cells, nuclei, or cell beads that may optionally be immobilized and permeabilized may first be hybridized to a set of probes, followed by barcoding (e.g., in a partition).

In one example, the cells (or nuclei or cell beads) may be contacted with a hybridization buffer comprising a first probe and a second probe, e.g., in a bulk solution. The cells (or nuclei or cell beads) may then be washed, e.g., to remove unhybridized probes, and subsequently partitioned into a first set of partitions (e.g., microwells). The first set of partitions may each include a probe-binding molecule and a barcode molecule. The cells (or nuclei or cell beads) in the first set of partitions can be subjected to conditions sufficient to hybridize the probe-binding molecules and barcode molecules to the target nucleic acid molecules, probe molecules, or derivatives thereof (e.g., extended, probe-associated molecules, etc.).

In some examples, the contents of the partitions may then be combined together and optionally washed. Cells (or nuclei or cell beads) can then be counted and partitioned into a second set of partitions and subjected to conditions sufficient to extend and/or ligate the barcode molecule to the nucleic acid molecules hybridized to the probes, thereby producing the barcode-added nucleic acid molecules. In some cases, the droplet may additionally comprise a capture molecule comprising an additional barcode sequence. Thus, a barcoded nucleic acid molecule within a droplet may comprise two barcode sequences.

In other examples, the partitions may be washed and then merged together. Cells (or nuclei or cell beads) can then be counted and partitioned into a second set of partitions and subjected to conditions sufficient to extend and/or ligate the barcode molecule to the nucleic acid molecules hybridized to the probes, thereby producing the barcode-added nucleic acid molecules. In some cases, the droplet may additionally comprise a capture molecule comprising an additional barcode sequence. Thus, a barcoded nucleic acid molecule within a droplet may comprise two barcode sequences.

In other examples, the contents of the partitions may then be combined together and optionally washed. The cells (or nuclei or cell beads) may then be subjected to conditions sufficient to ligate the barcode molecule to the nucleic acid molecule to which the probe hybridizes. The linked molecules can be separated in a second set of partitions (e.g., droplets) and subjected to conditions sufficient to extend the linked molecules to produce the barcoded nucleic acid molecules. In some cases, the droplet may additionally comprise a capture molecule comprising an additional barcode sequence. Thus, a barcoded nucleic acid molecule within a droplet may comprise two barcode sequences.

In other examples, the partitions may be washed first, and then the contents of the partitions may be combined together. The cells (or nuclei or cell beads) may then be subjected to conditions sufficient to ligate the barcode molecule to the nucleic acid molecule to which the probe hybridizes. The linked molecules can be separated in a second set of partitions (e.g., droplets) and subjected to conditions sufficient to extend the linked molecules to produce the barcoded nucleic acid molecules. In some cases, the droplet may additionally comprise a capture molecule comprising an additional barcode sequence. Thus, a barcoded nucleic acid molecule within a droplet may comprise two barcode sequences.

In some cases, cells, nuclei, or cell beads may first be hybridized to a set of probes, washed, counted, subjected to conditions sufficient to ligate the probes or create probe-ligated nucleic acid molecules, washed again, and then partitioned. In one example, cells (or nuclei or cell beads) can be partitioned in a first set of partitions (e.g., microwells) along with probe-binding molecules and barcode molecules. Within the first set of partitions, the probe-binding molecules and barcode molecules can hybridize to the molecules associated with the probes (or probe-linked molecules), pooled, washed (alternatively, washed in partitions, then pooled), counted, and then loaded into the second set of partitions (e.g., droplets). The cells (or nuclei or cell beads) may then be subjected to conditions sufficient to extend and/or ligate the barcode molecule to the probe-associated or probe-linked nucleic acid molecule, thereby producing a barcode-added nucleic acid molecule. Alternatively, cells (or nuclei or cell beads) may be linked in the bulk and extend within the second component region. In some cases, the droplet may additionally comprise a capture molecule comprising an additional barcode sequence. Thus, a barcoded nucleic acid molecule within a droplet may comprise two barcode sequences.

Prophetic example 2-multiplex assay barcoding of RNA templated ligation products and probe associated reporter oligonucleotides

As described herein, it may be advantageous to determine multiple analytes in a cell, nucleus, or population of cell beads. As described herein, a cell or cell bead can be contacted with a characteristic binding group that comprises or is coupled to a reporter oligonucleotide (comprising a reporter sequence). The feature binding group may be coupled to one or more features (e.g., proteins) of the cell. The cell may also comprise a target nucleic acid molecule (e.g., an RNA molecule) for the assay.

Example protocol 1 in one example protocol, cells, nuclei, or cell beads having characteristic binding groups coupled thereto may be partitioned in a first set of partitions. Each partition of the first set of partitions may include, for example, about 50,000 cells in a 50 microliter volume. Each of these partitions may include a set of probes (e.g., a first probe, a second probe, and a third probe), which may be provided at a concentration of 2 micromolar. Each partition may additionally include 5 micromolar splint oligonucleotides (probe binding molecules) and 7.5 micromolar barcode molecules. The barcode molecules may vary from partition to partition. Within the first set of partitions, the probe molecules can hybridize to (i) target nucleic acid (e.g., RNA) molecules (e.g., via the first and second probes) and (ii) feature binding groups (e.g., via the third probe). The contents of the first set of partitions may then be combined, washed, and analyzed, for example, using optical methods (such as absorbance, fluorescence, etc.), gel electrophoresis, or via sequencing.

Example protocol 2 in another example protocol, a cell, nucleus, or cell bead having a characteristic binding group coupled thereto may be hybridized in bulk with a first set of probes (e.g., a first probe, a second probe, and a third probe), which may be provided at a2 micromolar concentration. The cell, nucleus or cell bead may be subjected to conditions sufficient to hybridize the probe molecule to the target nucleic acid and/or the characteristic binding moiety. The cells, nuclei or cell beads may then be washed and subsequently partitioned in a first set of partitions. Each partition of the first set of partitions may include, for example, about 50,000 cells in a 50 microliter volume. Each of these partitions may include 1 micromolar splint oligonucleotide (probe binding molecule) and 2 micromolar barcode molecule. The barcode molecules may differ between the first set of partitions. Within the first set of partitions, the barcode molecules may hybridize to molecules associated with the probes. The contents of the first set of partitions may then be combined, washed, and analyzed, for example, using optical methods (such as absorbance, fluorescence, etc.), gel electrophoresis, or via sequencing.

Fig. 18 shows example data for the bar code addition scheme (example scheme 1) described herein. FIG. 18 shows two graphs of fluorescence intensity as a function of sequence length (in base pairs). Each line represents one sample of the barcoded cells, with a different barcode sequence attached to each sample (n=8 samples). The left panel shows the bar codes performed in Peripheral Blood Mononuclear Cells (PBMCs) and the right panel shows the bar codes performed in various cell lines. Two peaks, 230bp and 270 base pair sequences, can be identified. The 270 base pair sequence corresponds to a first target region and a second target region of the target RNA molecule of the first probe and the second probe.

FIG. 19 shows exemplary data for DNA gel electrophoresis of the barcoded molecules described herein. The left plot indicates an annealing temperature of 67 degrees celsius and the right plot indicates an annealing temperature of 63 degrees celsius. The upper diagram indicates a method of verifying bar code products using LEDs. The lower panel indicates another method of verifying barcode products using V2.

The first lane in each gel electrophoresis pattern ("lane 0") is a nucleic acid standard ladder. Lane 1 is PBMC without barcode (negative control), lane 2 is a cell line sample without barcode (negative control), lane 3 is PBMC with synthetic barcode (positive control), lane 4 is a cell line with synthetic barcode (positive control), lane 5 is PBMC with splint molecule according to example scheme 1, lane 6 is a cell line with splint molecule according to example scheme 1, lane 7 is PBMC with splint molecule according to example scheme 2, and lane 8 is a cell line with splint according to example scheme 2. It can be seen that a 63 degree anneal temperature gives higher throughput (darker bands).

Fig. 20 illustrates additional example data for the barcoding schemes described herein. The left panel shows fluorescence intensity as a function of sequence length for PBMC cells using example protocol 1 plus bar codes. The right plot shows fluorescence intensity as a function of sequence length for cell lines using example protocol 1 plus bar codes. In summary, the results demonstrate that relatively complete barcoding was achieved with the dual probe barcoding using example scheme 1 and example scheme 2.

EXAMPLE 3 fixed RNA profiling of PBMC samples

PBMC samples can be paraformaldehyde fixed and then stored at 4 ℃ for 7 days. The immobilized cells (or nuclei or beads) can be treated according to the protocols described herein. Sequencing libraries can be prepared, enriched using 2000 gene immunooncology combinations and analyzed. Fig. 21A-C show example data comparing fixed cells to non-fixed control samples. Figure 21A shows a bar graph and shows that fixed cells exhibited stable cell type annotation in seven days of storage when compared to day 0 non-fixed control samples. Figure 21B shows a line graph of the combined reads per cell as a function of detected UMI. This data illustrates the comparable median of gene and UMI counts per cell. Fig. 21C shows a log plot of single gene UMI counts between day 0 and day 7 samples. A good correlation can be seen between single gene UMI counts between day 0 control and day 7. The results show that the fixed samples are effectively stabilized for 7 days, which allows further various manipulations after fixation, e.g. sample collection, storage, transportation, batching with other samples, etc.

In addition, fixed PBMCs can be treated according to the protocols described herein and the resulting library compared to fresh PBMCs treated with 3' single cell gene expression solutions (10 x Genomics). UMI detection on the 2000 genome illustrates comparable sensitivity between fresh and fixed workflows. In addition, cell type annotation was also similar between the two samples. The major PBMC cell types could be detected in both samples.

Example 4 multiplex assay barcoding of RNA templated ligation products and Probe associated reporter oligonucleotides

As described herein, it may be advantageous to determine multiple analytes in a cell, nucleus, or population of cell beads. As described herein, a cell, nucleus, or cell bead may be contacted with a characteristic binding group that comprises or is coupled to a reporter oligonucleotide (comprising a reporter sequence). The feature binding group may be coupled to one or more features (e.g., proteins) of the cell. The cell may also comprise a target nucleic acid molecule (e.g., an RNA molecule) for the assay.

In one example, cells are contacted with two sets of antibodies, as schematically depicted in fig. 22. The first set of antibodies ("antibody a") 2252 comprises reporter oligonucleotides having two target sequences. A second set of antibodies ("antibody B") 2253 comprises reporter oligonucleotides with capture sequences. The cells (e.g., 2200) are then contacted with a pair of probes. The pair of probes ("probe 1"2206 and "probe 2" 2216) are configured to hybridize to a first target region 2202 and a second target region 2204 of a nucleic acid molecule 2201 (e.g., mRNA) in a cell, thereby producing a probe-associated molecule 2230. At least one probe of the pair of probes may comprise a capture sequence (e.g., 2210 and/or 2218). Furthermore, in some cases, the probe pair 2206, 2216 is configured to hybridize to two target sequences of antibody a. In other cases, additional pairs of probes different from probes 1 and 2 (probe 3 and probe 4, not shown) may be provided, which may comprise complementary sequences to the target sequence of antibody A, and may hybridize to the reporter oligonucleotide of antibody A.

Subsequent bar coding (e.g., operation 2280) may be performed in the body or in a partition (e.g., a well or droplet). The barcode molecule 2220 comprising the first barcode sequence may hybridize directly or via a splint molecule to (i) the probe-associated molecule 2230 or a derivative thereof (e.g., a complement or amplification product thereof), (ii) an antibody a-probe pair complex comprising a probe pair that hybridizes to a reporter oligonucleotide of antibody a 2252 (e.g., probe 1 2206 and probe 2 2216 or probe 3 and probe 4 (not shown)), and/or (iii) antibody B2253 (e.g., via a capture sequence of the reporter oligonucleotide). The barcode molecule 2220 may optionally be coupled to a bead. In fig. 22, the barcode molecule is shown as hybridizing directly to the probe or capture sequence, but the barcode molecule hybridization may be mediated via a splint molecule (e.g., as shown in fig. 16A). Additional bar code adding operations (not shown) may also be performed. The barcoded molecules or derivatives thereof are then sequenced.

FIG. 23 shows example data resulting from such a bar code addition operation as described in FIG. 22. Fig. 23 shows gene expression profiles of four biomarkers (CD 4, CD8, CD3, and CD 14) obtained from sequencing of a barcoded RNA product (e.g., a barcoded, probe-associated molecule 2230). The intensity of the spots indicates the overlap between gene expression detected using the double probe and protein expression detected using the first set of antibodies ("antibody a") or the second set of antibodies ("antibody B"). In general, these figures demonstrate that the use of either set of antibodies (antibody a or antibody B) has similar coverage in detecting an analyte of interest. Thus, either or both sets of antibodies can be used to detect a protein analyte (e.g., CD4, CD8, CD3, CD 14). In some cases, it may be advantageous to use a first set of antibodies ("antibody a") because the dual probes used to barcode the reporter oligonucleotide allow for additional multiplex analysis or combined barcode, which allows for improved indexing and determination of the cell, sample, or partition source. Alternatively or in addition, the use of one or more probes to add a barcode to a reporter oligonucleotide may be used to supplement additional functional sequences (e.g., primers, capture sequences, UMI, barcode sequences, etc.) to the reporter oligonucleotide or derivative thereof.

In some cases, the data shown in fig. 23 may be generated without barcoding any RNA molecules. For example, it may be useful to compare the efficacy of using two different methods to characterize binding groups plus barcodes. In one example, referring again to fig. 22, a cell can be contacted with (i) a first set of antibodies ("antibody a") 2252 comprising reporter oligonucleotides having two target sequences and (ii) a second set of antibodies ("antibody B") 2253 comprising capture sequences. The cell (e.g., 2200) may then be contacted with a pair of probes. The pair of probes ("probe 1"2206 and "probe 2" 2216) are configured to hybridize to a target region of a reporter oligonucleotide of antibody a. At least one probe of the pair of probes may comprise a capture sequence (e.g., 2210 and/or 2218). The barcoding described above may be performed to produce two barcoded products, (i) an antibody A-probe pair comprising a probe pair (e.g., probe 12206 and probe 22216) that hybridizes to a reporter oligonucleotide of antibody A2252, and (ii) antibody B2253 (e.g., via a capture sequence). The barcoded products or derivatives thereof can then be sequenced and sequence reads can be overlapped to generate the map of FIG. 23. By comparing the barcoded products of antibody a and antibody B, it can be inferred that the barcoding efficiency using either method (antibody a versus antibody B) is similar and that either or both methods are viable when detecting an analyte (e.g., a protein). As described above, in some cases, it may be advantageous to use a first set of antibodies ("antibody a") because the dual probes used to barcode the reporter oligonucleotide allow for additional multiplex analysis or combined barcode, which allows for improved indexing and determination of cell, sample or partition sources. Alternatively or in addition, the use of one or more probes to add a barcode to a reporter oligonucleotide may be used to supplement additional functional sequences (e.g., primers, capture sequences, UMI, barcode sequences, etc.) to the reporter oligonucleotide or derivative thereof.

Example 5 multiplex assay barcoding of reporter oligonucleotides for RNA templated ligation products and characteristic binding groups

In one example, referring to fig. 22, a cell may be contacted with a set of antibodies ("antibody B") 2253, wherein the set of antibodies comprises a capture sequence. In some cases, the cells may be treated, e.g., subjected to fixation and/or permeabilization, which may occur before, after, or both before and after contacting the cells with the antibody. The cells (or immobilized and/or permeabilized cells) (e.g., 2200) are then contacted with a pair of probes. The pair of probes ("probe 1"2206 and "probe 2" 2216) are configured to hybridize to a first target region 2202 and a second target region 2204 of a nucleic acid molecule 2201 (e.g., mRNA) in a cell, thereby producing a probe-associated molecule 2230. Probes may optionally be attached to each other (e.g., using an extension reaction, a ligation reaction, and/or chemical ligation). At least one probe of the pair of probes may comprise a capture sequence (e.g., 2210 and/or 2218).

Subsequent bar coding (e.g., operation 2280) may be performed in the body or in a partition (e.g., a well or droplet). The barcode molecule 2220 comprising the first barcode sequence may hybridize directly or via a splint molecule to (i) the probe-associated molecule 2230 or a derivative thereof (e.g., a complement thereof or an amplification product), and/or (ii) the antibody B2253 (e.g., via a capture sequence). The barcode molecule 2220 may optionally be coupled to a bead. The barcoded molecules or derivatives thereof are then sequenced.

FIG. 24 shows exemplary data of gene expression and protein analysis data generated by the barcoding scheme described above. Each figure shows a plot of biomarkers (CD 14, CD8a, CD19, and CD 3) obtained from the sequencing of (i) a barcoded RNA product (e.g., a barcoded, probe-associated molecule 2230) and (ii) a barcoded antibody B (e.g., a barcoded reporter oligonucleotide). The intensity of the spots indicates the relative level of expression (e.g., gene expression or protein expression) detected from the barcoded product. In general, these figures demonstrate that a barcoded antibody product (e.g., a barcoded reporter oligonucleotide) has similar coverage as a barcoded RNA product (e.g., a barcoded, probe-associated molecule) when detecting a particular analyte or biomarker. Thus, the biomarker profile may be determined by assaying the biomarker protein (e.g., a reporter oligonucleotide plus a barcode that is a characteristic binding group that binds the biomarker protein), or may be determined by assaying biomarker gene expression (e.g., RNA plus a barcode using a dual probe for gene expression profiling). In some cases, both gene expression profiling and protein mass spectrometry can be used to characterize cells, for example, to determine the correlation between gene expression and protein expression.

EXAMPLE 6 overload of cells in partitions

The cells (or nuclei or cell beads) can be contacted with a characteristic binding group comprising a reporter oligonucleotide that identifies the characteristic or characteristic binding group and one or more probes (e.g., for hybridization to a target region of a target nucleic acid molecule (e.g., mRNA)).

As described elsewhere herein, the reporter oligonucleotide and/or one or more probes (or probe-associated molecules) may be barcoded in multiple partitions. The partitions may be overloaded such that fewer of the plurality of partitions are unoccupied. In one non-limiting example, a population of about 100,000 cells may be loaded into about 80,000 partitions.

If a partition is overloaded, there may still be many partitions containing single cells. Single cell partitions can be identified or filtered. For example, multiple partitions may be filtered (e.g., using 10x Genomics CellPlex) such that only singly occupied partitions are analyzed. Protein information and RNA information can be obtained from the singly occupied partition.

For multiplex partitioning (comprising more than one cell), protein information (from the reporter oligonucleotide) may be inferred, for example, using gene expression and protein profile (e.g., obtained from a single cell analysis) for cells with similar profiles. Examples of such cell overload can be used to reduce reagent waste while providing useful, multiplexed analytical data regarding gene expression and protein mass spectrometry in individual cells.

EXAMPLE 7 immobilization of cells, nuclei and/or cell beads

Cells, nuclei and/or cell beads may be immobilized. In some cases, immobilization may be performed prior to hybridization of the probe molecules described herein. Exemplary protocols and reagent lists for immobilizing samples containing cells, nuclei or cell beads are listed below.

Preparation of buffer

Reagent and consumable

Example protocol

A. up to 1000 ten thousand cells were thawed with warm medium.

B. centrifuge at 300g for 5min at 4 ℃.

C. The supernatant was removed without disturbing the cell pellet and the cell pellet was resuspended with 1mL of cold cell resuspension buffer.

D. transfer to 1.5mL tubes and measure concentration and viability. If the cell suspension had visible clumps of debris, it was filtered with Flowmi and counted again.

E. Centrifuge at 300g for 5min at 4 ℃.

F. The supernatant was removed without disturbing the cell pellet.

G. Using a conventional calibre pipette tip, 1mL of the fixing buffer was added to the cell pellet, and the mixture was gently pipetted 15 times.

H. incubate for 1 hour at room temperature.

I. At the end of the fixation, 1mL aliquots of quench buffer were prepared. Cooled on wet ice.

J. Centrifuge at 850g for 5min at room temperature.

K. The supernatant was removed without touching the bottom of the tube to avoid sucking away the pellet.

Cell pellet was resuspended in 1mL ice-cold quench buffer. Stored on ice.

M. store the immobilized cells.

EXAMPLE 8 multiplex analysis of RNA and proteins in double immobilized Single cells

The methods disclosed herein can be used to determine a variety of analytes in a single cell. In some cases, two analytes of a plurality of cells can be determined (i) using a pair of probes (e.g., comprising sequences complementary to the target region of the RNA) to determine the RNA, and (ii) using a characteristic binding group (e.g., an antibody) comprising a reporter oligonucleotide to determine the peptide, polypeptide, or protein. For example, by measuring gene and protein expression within a cell, RNA and protein data can be correlated to better understand transcriptome and proteomic profiles within a single cell.

In one example, a plurality of cells can be immobilized and permeabilized and contacted with (i) a plurality of probes comprising a first probe and a second probe and (ii) an antibody comprising a reporter oligonucleotide. The first probe and the second probe can hybridize to a first target region and a second target region of an intracellular RNA molecule to produce a probe-associated molecule, and the antibody can bind to a target protein on or within the cell. Barcoding can then be performed, for example, in a partition, to barcode the probe-associated molecules and reporter oligonucleotides. The barcoded molecules (e.g., barcoded, probe-associated molecules or derivatives thereof, and barcoded reporter oligonucleotides or derivatives thereof) can be sequenced and assigned to single cells based on the barcode sequence.

Various parameters for preparing RNA and protein molecules to be barcoded in cells can be tested. In some cases, it may be advantageous to provide additional immobilization operations, for example, after contacting the antibody with the target protein (also referred to herein as "antibody staining"), which may aid in immobilizing the antibody to the target protein during downstream processing (e.g., bar-coding). In some cases, antibody staining may be performed before or after hybridization of the first and second probes (also collectively referred to as "probes"). In some cases, the fixation or permeabilization of the cells can be performed using different fixation and permeabilization methods. In some cases, it may be advantageous to quench the antibody, for example, in a blocking buffer. Such example parameters may be tested by experimentation.

For example, a variety of cell fixation protocols may be performed (e.g., as shown in fig. 29). A plurality of experimental groups 1. Negative control groups were used in which cells were contacted with the reporter oligonucleotide conjugated antibodies, immobilized and permeabilized, quenched, and then contacted with the first and second probes. Group A comprising immobilizing and permeabilizing and optionally quenching cells (e.g., in a blocking buffer comprising bovine serum albumin (0.5%) and tween (0.01%) followed by contacting with antibodies, group 3. Group B comprising immobilizing and permeabilizing and optionally quenching cells (e.g., in a blocking buffer comprising bovine serum albumin (0.5%) and tween (0.01%) followed by contacting with antibodies, re-immobilizing, re-quenching, followed by contacting with probes, group 4. Group C comprising immobilizing and permeabilizing and optionally quenching cells (e.g., in a blocking buffer comprising bovine serum albumin (0.5%) and tween (0.01%) followed by contacting with antibodies, re-immobilizing and permeabilizing, quenching, followed by contacting with probes, group 5.D comprising immobilizing and permeabilizing and optionally quenching cells (e.g., in a blocking buffer comprising bovine serum albumin (0.5%) and tween (0.01%) followed by contacting with probes, followed by contacting with antibodies, group 6.E comprising immobilizing and permeabilizing cells (e.g., in a blocking buffer comprising bovine serum albumin (0.5%) and tween (0.01%) followed by contacting with antibodies, group 5% and optionally contacting with a blocking buffer (e.g., 0.01%) with a commercial solution comprising bovine serum albumin and 0.5%)Reagent immobilization and permeabilization of cells, usePermwash washing the cells, contacting the cells with antibody, followed by contact with a probe, group 8.G, usingReagent immobilization and permeabilization of cells, usePermwash washing the cells, contacting the cells with the antibody, quenching, re-fixing and permeabilizing, quenching, and then contacting with the probe. All groups can then be barcoded (e.g., for probes or probe-associated molecules or derivatives thereof and antibody reporter oligonucleotides or derivatives thereof) sequenced, and RNA and reporter oligonucleotides (indicating the presence of target proteins in cells) can be associated with single cells.

In one experimental setup, PBMC cells were used. The cells were contacted with a reporter oligonucleotide conjugated antibody (dG 9) (ab 270703) against perforin and a granzyme B (QA 18a 28) antibody.

Fig. 30A shows example data from the experimental set listed above. From left to right, the samples indicate the fraction of antibody reads that are "useful" (e.g., can be ascribed back to a single cell from a barcode sequence) for the negative control, group a, group B, group C, group D, group E, group F, group G. As can be seen in fig. 30A, performing antibody staining prior to probe hybridization (e.g., groups a, B, and C) resulted in a higher percentage of available antibody reads, and performing additional immobilization operations (groups B, C, and G) after antibody staining further increased the fraction of antibody reads (compared to groups a, D, E, F without second immobilization). As expected, the negative control group (fixed and permeabilized after antibody staining) produced a low fraction of available antibody reads. Interestingly, samples that were antibody stained after probe hybridization and not re-immobilized (groups D and E) produced low fractions of available antibodies, indicating that in some cases it may be useful to perform antibody staining prior to probe hybridization to obtain higher available read counts.

Fig. 30B shows example data from the same experiment indicating the results of a second fixation operation after antibody staining. These figures indicate the antibody densities detected in terms of unit cell density using sequencing and bar code identification for two proteins, perforin (left) and granzyme (right). Three conditions were divided into antibody post-staining immobilization (e.g., group B, group C, and group G), negative control, and no secondary immobilization.

For perforin, one initial peak was shown in all three conditions, which may be due to background signal. No substantial differences between the groups were observed. For granzyme, the negative control (labeled as negative control, 2) had the first two peaks (which may be due to background signal) and the third peak (which may be due to non-specific binding of the antibody). For no second fixation condition (labeled none, 3), a single peak was observed. For the conditions of fixation after antibody staining (labeled as fixation after antibody addition, 1), two peaks are observed, which may indicate two cell populations, which may be one negative population (e.g., monocytes with background signal or non-specific staining) and one positive population (e.g., specific staining for cells with higher signal for the second fixation, e.g., natural killer cells and/or cytotoxic T cells) or possibly two positive populations. Further studies may attempt to elucidate specific populations, for example, by running isotype controls.

FIG. 31 shows example gene expression data from the experimental groups listed above. From left to right, the samples indicate the median number of genes detected (e.g., by sequencing the probe-associated molecules or derivatives thereof) in the negative control, group a, group B, group C, group D, group E, group F, group G. As can be seen in FIG. 31, additional fixation operations (panels B, C and G) following antibody staining can reduce the number of genes (e.g., sensitivity) detected by the assay. Thus, when using a second fixation procedure, a compromise between antibody sensitivity and gene expression sensitivity can be observed. The negative control group (fixation and permeabilization after antibody staining) allows the detection of relatively high numbers of genes.

FIGS. 32-37 show exemplary data for gene expression and antibody staining results for some of the experimental groups described above. FIGS. 32A-C show t-SNE plots for negative control groups (cells stained with antibody prior to fixation and permeabilization). Fig. 32A shows a graph of different immune cell clusters, wherein ellipses indicate natural killer cells and cytotoxic T cell types, fig. 32B shows a gene expression profile of GZMB in immune cells (e.g., produced by barcoding molecules associated with probes or probes targeting GZMB), and fig. 32C shows an antibody staining profile in immune cells. GZMB gene expression profile indicates that GZMB is expressed in natural killer cells and cytotoxic T cells. Antibody staining showed some non-specific staining on monocytes and B cells and limited staining on natural killer cells.

FIGS. 33A-C show t-SNE plots for group D (immobilized and permeabilized, cells contacted with probe, then stained with antibody). FIG. 33A shows a graph of different immune cell clusters, where ellipses indicate natural killer cells and cytotoxic T cell types, FIG. 33B shows the gene expression profile of GZMB in immune cells (e.g., produced by barcoding molecules associated with probes or probes targeting GZMB), and FIG. 33C shows antibody staining in immune cells. For group D cells, GZMB gene expression profile indicates expression of GZMB in natural killer cells and cytotoxic T cells. Antibody staining showed some non-specific staining on monocytes and B cells and some specific staining on natural killer cells.

FIGS. 34A-C show t-SNE plots for group B (cells immobilized and permeabilized, contacted with antibody, re-immobilized, and then contacted with probe). Fig. 34A shows a graph of different immune cell clusters, wherein ellipses indicate natural killer cells and cytotoxic T cell types, fig. 34B shows a gene expression profile of GZMB in immune cells (e.g., produced by barcoding molecules associated with probes or probes targeting GZMB), and fig. 34C shows an antibody staining profile in immune cells. For group B cells, GZMB gene expression profile indicates expression of GZMB in natural killer cells and cytotoxic T cells. Antibody staining showed some non-specific staining on monocytes and more intense specific staining on natural killer cells compared to group D cells.

FIGS. 35A-C show t-SNE plots for group E (immobilized and permeabilized, cells contacted with probe, then stained with antibody in blocking solution). Fig. 35A shows a graph of different immune cell clusters, wherein ellipses indicate natural killer cells and cytotoxic T cell types, fig. 35B shows a gene expression profile of GZMB in immune cells (e.g., produced by barcoding molecules associated with probes or probes targeting GZMB), and fig. 35C shows an antibody staining profile in immune cells. For group E cells, GZMB gene expression profile indicates expression of GZMB in natural killer cells and cytotoxic T cells. Antibody staining showed some nonspecific staining.

Fig. 36A-C show group F (using commercially availableThe cells that the kit immobilizes and permeabilizes, stains with antibodies, and then contacts with the probe). Fig. 36A shows a graph of different immune cell clusters, wherein ellipses indicate natural killer cells and cytotoxic T cell types, fig. 36B shows a gene expression profile of GZMB in immune cells (e.g., produced by barcoding molecules associated with probes or probes targeting GZMB), and fig. 36C shows an antibody staining profile in immune cells. For group F cells, GZMB gene expression profile indicates expression of GZMB in natural killer cells and cytotoxic T cells. Antibody staining showed some nonspecific staining.

Fig. 37A-C show group G (using commercially availableCells that were fixed and permeabilized, stained with antibody, re-fixed, and then contacted with probes) in a kit. Figure 37A shows a graph of different immune cell clusters, where ellipses indicate natural killer cells and cytotoxic T cell types, figure 37B shows the gene expression profile of GZMB in immune cells (e.g., produced by barcoding molecules associated with probes or probes targeting GZMB), and figure 37C shows antibody staining profile in immune cells. For group F cells, GZMB gene expression profile indicates expression of GZMB in natural killer cells and cytotoxic T cells. Antibody staining showed some nonspecific staining in monocytes and other cells, but preferential staining on natural killer cells. A more specific staining was observed at the second fixation compared to the absence of the second fixation.

Taken together, these results indicate that under certain conditions, some specific staining occurs on natural killer cells and cytotoxic T cells. The greatest specificity was observed in the samples, where the cells were fixed and permeabilized, stained (contacted with antibody), re-fixed, and then contacted with the probes. Since some nonspecific staining of monocytes was observed, the specificity of antibody staining could be assessed or observed by depletion of monocytes. Overall, these results indicate that the second immobilization process may help improve protein expression signals when detecting multiple analytes (e.g., protein and RNA).

Example 9 multiplex assay barcoding of reporter oligonucleotides for RNA templated ligation products and characteristic binding groups

The methods described herein can be used to determine a variety of analytes in a cell, nucleus, or population of cell beads. As described herein, a cell, nucleus, or cell bead may be contacted with a characteristic binding group that comprises or is coupled to a reporter oligonucleotide (comprising a reporter sequence). The feature binding group may be coupled to one or more features (e.g., proteins) of the cell. The cell may also comprise a target nucleic acid molecule (e.g., an RNA molecule) for the assay.

FIG. 38 illustrates another exemplary multiplex workflow for determining cellular characteristics (e.g., proteins) and target nucleic acid molecules (e.g., RNA molecules). May be prepared, for example, in 4% formaldehyde and 0.01% tween-20 or a commercially available immobilization and permeabilization buffer (e.g., commercially availableFixation and permeabilization buffer), the cells, nuclei or beads are immobilized and permeabilized. In some cases, the cells, nuclei, or cell beads may be contacted with one or more characteristic binding groups comprising a reporter oligonucleotide, either before or after immobilization and permeabilization. One or more characteristic binding groups may specifically bind to a cellular characteristic (e.g., a specific protein), if present on or within a cell, nucleus, or cell bead. In some cases, the reporter oligonucleotide can be used to identify a characteristic binding group, thereby identifying the presence or absence of a target cell characteristic (e.g., a specific protein). For example, a plurality of cells may be contacted with a plurality of feature binding groups, which may be the same or different, and which may comprise the same or different reporter oligonucleotides. In one non-limiting example, one cell of the plurality of cells can be contacted with different feature binding groups, which can bind to different cellular features (e.g., different surface or intracellular proteins). Since each feature binding group comprises a reporter oligonucleotide having a barcode sequence identifying the feature binding group, the presence of such different cellular features (e.g., different surface or intracellular proteins) can be assessed (e.g., via sequencing) by the presence of the barcode sequence.

Alternatively or in addition to contacting the cell, nucleus, or cell bead with one or more characteristic binding groups, the cell, nucleus, or cell bead may be contacted with a first probe and a second probe to produce a probe-associated molecule (e.g., a probe-associated RNA molecule), as described herein. For example, a cell, nucleus, or cell bead that can optionally be immobilized and permeabilized can comprise a target nucleic acid molecule (e.g., an RNA molecule) having a first target region and a second target region. The first probe may comprise a first probe sequence at least partially complementary to the first target region and the second probe may comprise a second probe sequence at least partially complementary to the second target region. Hybridization of the first probe sequence to the first target region and hybridization of the second probe sequence to the second target region may be sufficient to produce a probe-associated molecule.

In some cases, a cell, nucleus, or cell bead may be contacted with a plurality of different probes. If present, a plurality of different probes can specifically hybridize to a target region of a target nucleic acid molecule. In some cases, the probe sequence may comprise a probe barcode sequence that may be used to identify the probe. For example, a plurality of cells may be contacted with a plurality of probes, which may be the same or different, and may comprise the same or different sequences (e.g., barcode sequences, probe sequences, adapter sequences). In one non-limiting example, one cell of the plurality of cells can be contacted with a different probe that can hybridize to a different target region of a target nucleic acid molecule (e.g., an RNA molecule). Each probe may comprise a probe barcode sequence identifying the probe, and the presence of such different target sequences may be assessed (e.g., via sequencing) by the presence of the probe barcode sequence or probe sequence. In some cases, the probe barcode sequences may be used to identify the starting sample or deconvolute the sequence and identify the sequence as being derived from a cell, nucleus, or cell bead (e.g., as shown in fig. 10).

After the cells, nuclei, or cell beads are contacted with the probes (e.g., the first probe and the second probe), the cells, nuclei, or cell beads can be washed to remove any unbound or unhybridized probes. The cells, nuclei, or cell beads may then be partitioned (e.g., in a droplet or well) for bar coding, as described herein. In one non-limiting example, cells, nuclei, or cell beads may be separated along with nucleic acid barcode molecules (shown coupled to the beads in fig. 38). The nucleic acid barcode molecule may comprise a barcode sequence and a capture sequence complementary to the sequence of one of the probes (e.g., the first probe or the second probe). The nucleic acid barcode molecule may comprise additional sequences, e.g., UMI, primer sequences, sequencing primer sequences (e.g., P5, P7, R1, R2 sequences). The capture sequence of the nucleic acid barcode molecule may anneal to the complementary sequence of one of the probes (e.g., the first probe or the second probe), and optionally an extension reaction may be performed to produce a barcode-loaded nucleic acid molecule comprising a barcode sequence or its complementary sequence and a sequence of at least one probe or its complementary sequence.

In some cases, the nucleic acid barcode molecule capture sequence may also anneal to the sequence of the reporter oligonucleotide (not shown in fig. 38) if the cell, nucleus, or cell bead contains a characteristic binding group coupled thereto. In some cases, an extension reaction can be performed to produce an additional barcoded nucleic acid molecule comprising the sequence of the reporter oligonucleotide or its complement and the barcode sequence or its complement.

After barcoding, the barcoded nucleic acid molecules and the additional barcoded nucleic acid molecules can be removed from the partition and subjected to conditions sufficient for sequencing, such as amplification, purification, sample index PCR, and the like. Such an example workflow can be used to obtain multiple information about cellular features (e.g., proteins) and correlate those features with nucleic acid information (e.g., presence or genotype of a target nucleic acid molecule (e.g., RNA)).

It should be appreciated that the processes described herein may be performed in any useful or convenient order. For example, for a cell, nucleus or cell bead, immobilization, permeabilization, contact with a characteristic binding moiety, and contact with a first probe and a second probe can occur in any useful order, and can be repeated any number of times. Any of these processes (e.g., immobilization, permeabilization, contact with a feature binding group, and contact with a first probe and a second probe) can be performed in bulk or in a partition.

EXAMPLE 10 RNA templated ligation for Whole transcriptome analysis in tissue samples

The methods described herein can be used to determine nucleic acid molecules (e.g., mRNA) in a tissue sample, e.g., a fresh tissue sample, a frozen (e.g., flash frozen) tissue sample, and the like. In some cases, whole transcriptome analysis may be performed in a tissue sample. In one such example, the tissue sample may comprise an mRNA molecule that may be contacted with a plurality of first probes and second probes. The plurality of first probes and second probes may comprise a set of full transcriptome analysis probes such that hundreds, thousands, or millions of RNA targets may be analyzed. For example, the plurality of first and second probes may comprise thousands of different first and second probes that may hybridize to different target sequences (e.g., coding or non-coding) of mRNA. In summary, the plurality of first probes and second probes may have sufficient sequence diversity and coverage to analyze the entire transcriptome of the sample. The plurality of first probes and second probes can comprise a gene-specific sequence that can be species-specific (e.g., capable of distinguishing between different animal cell types, such as human and mouse).

In some cases, using a dual probe (e.g., using a first probe and a second probe that hybridize to a first target region and a second target region, respectively, of an mRNA molecule) approach for mRNA analysis may be advantageous in providing higher analyte sensitivity, improving efficiency of barcoding, and/or discriminating a greater number of barcodes, UMIs, or both, than using a single probe (e.g., 3' single cell gene expression solution (10 xGenomics)). Table 1 shows example data for comparison of the amount of UMI detected in flash frozen human and mouse tissue samples subjected to whole transcriptome analysis using either (i) the single probe method (e.g., labeled as single cell 3' ("SC 3P") in Table 1) as shown and described in FIG. 12B, or (ii) the dual probe method (e.g., labeled as RNA templated ligation ("RTL") in Table 1) as shown in the nucleic acid analysis in FIGS. 16A-16B. Five different human samples from liver, colon, jejunum, ileum, testis, and one mouse sample from brain were tested. All samples were flash frozen. Each column of the numerical columns of table 1 shows the number of UMIs detected at 5,000 combined reads per cell ("PRPC") or 10,000 PRPCs in the RTL (dual probe full transcriptome analysis) and SC3P (single probe full transcriptome analysis) methods. As can be seen from table 1, the RTL workflow enabled the detection of higher amounts of UMI in all the different quick frozen tissue samples.

Table 1. Comparison of the amount of UMI detected when performing whole transcriptome analysis in quick frozen tissue samples using either single probe or double probe methods.

Similarly, table 2 shows example data for comparison of the amount of UMI detected in fresh mouse tissue samples using either (i) single probe method ("SC 3P") or (ii) double probe method ("RTL") for whole transcriptome analysis. Five different mouse samples from brain, colon, kidney, lung and liver were tested. All samples were fresh. Each column of the numerical columns of table 2 shows the number of UMIs detected at 5,000 combined reads per cell ("PRPC") or 10,000 PRPCs in the RTL (dual probe full transcriptome analysis) and SC3P (single probe full transcriptome analysis) methods. It can also be seen from table 2 that the RTL workflow enabled the detection of higher amounts of UMI in all fresh tissue samples.

Table 2. Comparison of the amount of UMI detected when whole transcriptome analysis is performed in fresh tissue samples using either the single probe or the double probe approach.

Taken together, these data demonstrate that analysis of mRNA using the dual probe method provides a sensitive method for determining total transcriptome in tissue samples.

EXAMPLE 11 preparation of samples from fixed tissue

The methods described herein can be used to prepare a fixed tissue sample for analysis. The first workflow of preparing the sample may include fixing and sectioning at least a portion of a fixed tissue (e.g., a formalin-fixed paraffin-embedded tissue sample). Tissue may be sectioned into one or more 10 to 50 micron rolls, which may then be placed into a sample tube. 3 milliliters (mL) of a non-polar solvent (e.g., xylene, neo-clear, orange, etc.) may be added to one or more rolls such that one or more rolls are fully submerged and allowed to stand for 5 minutes. The non-polar solvent may then be removed and discarded and the process repeated two more times. After addition of the non-polar solvent, 3mL of 100% ethanol may be added to the sample tube and incubated with one or more rolls for 30 seconds at ambient temperature. Ethanol may be aspirated and discarded using a pasteur pipette. This process can be repeated with 1mL of 70% ethanol, 1mL of 50% ethanol, and 1mL of double distilled water.

Subsequently, 1mL of Phosphate Buffered Saline (PBS) may be added to the sample tube at ambient temperature and then removed without disturbing the one or more rolls. Optionally, 1mL of citrate pH 8.0 cross-linking buffer may be added to the sample tube. If an optional citrate buffer is added, the sample tube may be incubated at 70 degrees celsius for 30 minutes to 1 hour. The sample may then be centrifuged at 850-2000 relative centrifugal force (rcf). Whether or not the optional addition of citrate is performed, the dissociation solution comprising, for example, a release enzyme with low pyrolysis (TL), a release enzyme with medium pyrolysis (TM), a release enzyme with high pyrolysis (TH), collagenase, etc., or any combination thereof, may be heated at 37 degrees celsius for 10 minutes. 2mL of the preheated dissociation solution may be added to the sample and incubated at 37 degrees celsius for 20 minutes to 1 hour while intermittently shaking the sample tube. The roll may then be dispersed 15 to 20 times (e.g., until the solution begins to become cloudy) using a silanized glass pipette to obtain a cell suspension.

Dissociated tissue may pass through the 70 micron filter and any tissue pieces may be pushed through the filter with the rear of the syringe plunger to maximize cell yield and remove debris and undissociated tissue pieces. Additional washing of the 70 micron filter can be performed by adding 2mL of PBS to the filter. Filtrate from the additional wash may be added to filtrate from the first filtration operation. The combined filtrate may be centrifuged at 850-2000rcf for 5 minutes, the supernatant may be removed without disturbing the pellet formed by centrifugation, and the pellet may be resuspended in 1mL of cooled PBS.

The cell concentration of the resuspended solution can then be determined using a fluorescent dye (e.g., ethidium homodimer-1, etc.) or an AO/PI staining solution, etc., using a Countess II FL automated cell counter, cellaca MX high throughput automated cell counter, etc. The resuspended solution can then be used as a sample for the methods and systems described elsewhere herein (e.g., RNA profiling, etc.).

The second workflow may include fixing and sectioning at least a portion of a fixed tissue (e.g., a formalin-fixed paraffin-embedded tissue sample). Tissue may be sectioned into one or more 10 to 50 micron rolls, which may then be placed onto a slide. 1 milliliter (mL) of a non-polar solvent (e.g., xylene, neo-clear, orange, etc.) may be added to one or more rolls such that one or more rolls are fully submerged and allowed to stand for 5 minutes. The non-polar solvent may then be removed and discarded and the process repeated two more times. After addition and removal of the non-polar solvent, 1mL of 100% ethanol may be added to the slide and incubated with one or more rolls for 30 seconds at ambient temperature. Ethanol may be aspirated and discarded using a pasteur pipette. Ethanol addition may be configured to ethanol hydration (e.g., the amount of water mixed with ethanol may increase with each addition. The process may be repeated with 1mL of 70% ethanol, 1mL of 50% ethanol, and 1mL of double distilled water.

Subsequently, 1mL of Phosphate Buffered Saline (PBS) may be added to the sample tube at ambient temperature and then removed without disturbing the one or more rolls. Subsequently, one or more tissue rolls may be finely minced on a slide and placed into a sample tube. The dissociation solution comprising, for example, a release enzyme with low pyrolysis (TL), a release enzyme with medium pyrolysis (TM), a release enzyme with high pyrolysis (TH), collagenase, etc., or any combination thereof, may be heated at 37 degrees celsius for 10 minutes. 2mL of the preheated dissociation solution may be added to the sample and incubated at 37 degrees celsius for 20 minutes to 1 hour while intermittently shaking the sample tube. The roll may then be dispersed 15 to 20 times (e.g., until the solution begins to become cloudy) using a silanized glass pipette to obtain a cell suspension.

EXAMPLE 12 time series analysis of T cell responses

Stimulation of T cells with anti-CD 3 and anti-CD 28 can be used to study antigen-specific T cell responses. The use of single cell transcriptomics may reveal the activation of gene level drives and/or association with cell surface protein activation markers. Measuring single cell gene expression (such as T cell stimulation) on a microscale timeframe can be difficult due to the logistic effort of sampling, counting, and processing cells. Single cell level analysis of these rapid processes can be achieved by freezing the sample on a short time scale using fixatives.

For example, two peripheral blood mononuclear cell samples can be cultured and stimulated using anti-CD 3 and anti-CD 28, and the resulting stimulated cells can be sampled at 6 different time scales, as shown in fig. 39A. Samples may be fixed at different times to freeze the samples in the state of those time frames, multiplexed into the analysis chip, and sequenced. As seen in fig. 39B, each fixed time sample can be analyzed and cells can be prepared as described elsewhere herein (e.g., example 11). The overlay of fig. 39B may show the clustering of monocytes, T cells and B cells. Figure 39C shows that FOS expression of T cell populations can be widely observed after T cell activation. Similarly, an up-regulation of CD69 expression can be observed and detected at the RNA level within 15 minutes of stimulation. This may be faster than protein level detection of CD69, which may take an hour or more to fully translate CD69 for detection. Similarly, in fig. 39D, transcripts of interleukin 2 (IL 2) and interferon gamma (IFN- γ) can be detected at elevated levels after stimulation. FIG. 39E shows the expression of VISR genes, which can encode a T cell activated V domain Ig inhibitor (VISTA) protein. The protein can be produced at high levels in activated T cells and tumor-infiltrating T cells. The VISR gene may be transcribed at higher levels after activation in activated T cells for an hour or more.

Example 13 sample immobilization and Probe hybridization

Immobilized samples can be generated using 4% formaldehyde as an immobilization reagent and a 10X Genomics immobilized RNA sample preparation kit. The sample may be fixed at ambient temperature for 1 hour or at 4 degrees celsius for 16 to 24 hours. For samples fixed at low temperatures (e.g., 4 degrees celsius), the use of the same fixing time can reduce the batch effect observed between samples. For example, each sample may be fixed for 16 hours to reduce batch effects between samples. After fixation, the samples may be stored at-80 degrees celsius until further processing is performed. For example, the sample may be maintained at-80 degrees celsius for years prior to analysis without significantly degrading the performance of the nucleic acid sequencing process performed on the sample.

EXAMPLE 14 fixed tissue preparation method

Fig. 44-46 illustrate examples of methods of preparing a fixed sample according to some embodiments. The method of this example can be run on two 25 micron rolls per sample. In this embodiment, both the expert and novice user verify the methods.

For the various methods of this embodiment, the tissue section transfer and rehydration sections may be similar. In other words, the tissue rehydration process may not be related to the dissociation method used on the tissue sample. Tissue rehydration may be as described elsewhere herein, e.g., example 11. Dissociation may include the use of an automatic dissociator (e.g., GENTLEMACS OCTO dissociator) (fig. 44-45) or a manual dissociator (fig. 46). The difference between fig. 44 and 45 may be that fig. 44 uses MILTENYI FFPE kits, while fig. 45 uses the methods and systems of the present disclosure.

Automatic dissociation may include adding 2mL of the dissociating enzyme mixture (e.g., 1 mg/mL of the releasing enzyme TH mixture) to the rehydrated sample stored in the dissociation tube that is rehydrated as described elsewhere herein. Once the tube is sealed with the lid, the sample can be dissociated using an automatic dissociator, followed by centrifugation at 300 rcf. After dissociation, the pellet may be resuspended in supernatant as described elsewhere herein and filtered through a 70 micron filter. The filters may be washed with Phosphate Buffered Saline (PBS) to increase yield and then centrifuged at 850rcf for an additional 5 minutes. The supernatant may be removed from the resulting pellet and the pellet may be resuspended in 0.5mL of tissue resuspension buffer. Samples may be counted and analyzed as described elsewhere herein.

Manual dissociation may include adding 100 microliters of 1mg/mL of the release enzyme TH and using a manual dissociation pestle to dissociate the rolls. 900 microliters of the dissociating enzyme mixture can be used to clean the top of the pestle. All 1mL of the solution may be incubated at 37 degrees celsius for 45 minutes (e.g., in a water bath, in a hot mixer at 800rpm, etc.). After incubation, the sample may be titrated by pipette mixing ten to twenty times, then through a 70 micron filter. The filter may be washed with 2mL of buffer to increase yield, then the filtered solution is centrifuged at 850rcf for five minutes, the supernatant discarded, 0.5mL of resuspension buffer added, and the sample prepared for further analysis.

In these embodiments, the validation provides a fraction of available reads of at least about 40%, a fraction of reads of at least about 80% of the mapped filtered probe set, a fraction of reads in cells on at least about 60% of the high quality preparation, and at least about 90% of the effective barcode. For a 2 25 micron section, the yield of the examples is at least about 400,000 cells per sample, with an observed recovery of at least about 50% of the cells to the expected cells. In addition, the ratio of pseudoglobal gene expression to cytassist assay is at least about 0.7, and the median observed recovery of targeted gene cells-UMI is at least about 0.8.

Fig. 47 provides a table of the pass percentages and other characteristics of the various procedures of the present example, according to some embodiments. The table shows enhanced performance of the methods of the present disclosure relative to other sample preparation methods, and this data is further visible in fig. 48, which shows examples of various organizations (labeled 4801, 4802, and 4803, respectively) processed by the workflows of fig. 44-46, according to some embodiments. The table shown in fig. 47 provides Design Input Requirements (DIRs) that may correspond to measured success thresholds or verification thresholds. For example, the validation threshold for cell yield from two rolls may be greater than or equal to 400,000 cells.

EXAMPLE 14 Single cell Whole transcriptome analysis

Formalin Fixed Paraffin Embedded (FFPE) tissue samples may be a large reservoir of human tumor specimens. Analysis of gene expression profiles of FFPE samples can provide insight into the etiology of the disease. The methods described herein can be used to perform profiling of gene expression in FFPE samples at single cell resolution without RNA degradation or with significantly reduced RNA degradation. The methods described herein can allow single cell FFPE sequencing for detection of whole transcriptome gene expression in dissociated FFPE tissue via in situ hybridization, ligation, and barcoding of RNA-specific probe pairs.

FFPE sections from nine different human cancer tissue pieces can be subjected to spectral analysis using the methods described herein. FFPE section tissue samples (e.g., rolls) of each cancer mass can be dissociated into single cell suspensions using a combination of enzymatic digestion and mechanical dissociation. Dissociated cells may be processed using the workflow described elsewhere herein, e.g., as shown in fig. 42. The gene and UMI per cell can demonstrate sensitivity to single cell assays of FFPE tissue at single cell resolution. The number of cells after dissociation of two 25 μm sections of each tissue type may be sufficient for hybridization and separation, e.g. into droplets. Table 3 shows example gene expression metrics for tissue analyzed by profiling. Gene and UMI metrics can be measured at 10,000 original reads per cell. The cell yield after dissociation of tissue sections can be listed along with the tissue cross-sectional dimensions.

Table 3. Gene expression metrics for each tissue analyzed by the profile.

Fig. 49A-49D illustrate example cell type clusters in single cell FFPE. Fig. 49A shows an example Uniform Manifold Approximation and Projection (UMAP) projection of single cells taken from prostate cancer FFPE. Cell types can be annotated manually. FIG. 49B shows MYH11 gene expression, which shows higher expression in perivascular-like (PVL) cell clusters. Fig. 49C shows an example UMAP projection of single cells taken from colorectal cancer FFPE. FIG. 49D shows MZB1 gene expression, which shows higher expression in plasma cell clusters.

Example 15 isolation of cells from FFPE tissue sections

Cells can be isolated from formaldehyde-fixed and paraffin-embedded (FFPE) tissue sections for use in the methods described elsewhere herein. Cell separation may include deparaffinization and rehydration of fixed tissue sections. The dewaxed and rehydrated tissue slices can be dissociated using a dissociating enzyme mixture and a lump pestle or GENTLEMACS OCTO dissociator. Dissociated cells may be resuspended in tissue resuspension buffer or quenching buffer prior to cell counting. At least 400,000 cells can be used for single cell analysis. At least 100,000 cells can be used for multiplex cell analysis. Prior to large multiplex studies, 10,000 cells per probe barcode can be used to run pilot multiplex assays for single or four sample multiplex assays. For multiplex analysis of sixteen samples, each probe barcode can be loaded with 8,000 cells. Cell separation can be demonstrated using FFPE tissue samples in the range of one to ten years of age. Each tissue sample may produce a different amount of material and data quality depending on age, tissue type, pre-fixation tissue mass, tissue density, and size and area of tissue in the roll.

Cell separation can be demonstrated using two or more 25 micrometer (μm) or 50 μm tissue sections from different tissue type spectra. For human tissue, two or more 25 μm slices may be used. For mouse tissue, two or more 50 μm sections may be used. Some tissue types may use more than two sections to generate enough cells for further processing. FFPE block characteristics may also affect yield. In some cases, a single 25 μm or 50 μm tissue section may produce enough cells for further processing.

The tissue may be sectioned and dissociated using a pestle dissociator or GENTLEMACS OCTO dissociator protocol. For pestle dissociation, tissue sections (e.g., rolls) may be transferred to a 1.5 milliliter (mL) tube for dewaxing. Dewaxed samples may be dissociated using a pestle. Alternatively, tissue slices (e.g., rolls) may be transferred into GENTLEMACS C tubes for dewaxing. The dewaxed sample may then be processed by Octo dissociators.

The degree of dehydration of FFPE blocks can be checked prior to cell separation. The tissue may be left to black to remove a portion of the paraffin and expose the tissue. The facing tissue mass may be placed back into ice water for a time sufficient to allow the mass to face the water to rehydrate the tissue mass. Using a microtome, the first 5 μm tissue can be removed and disposed of. Microtomes can be used to slice fixed tissue into 25 μm or 50 μm sections (e.g., rolls). After discarding the first section, additional sections of the rehydrated FFPE block can be transferred to a 1.5mL tube or GENTLEMACS C tube while leaving the rolls intact. The rolls may be stored at 4 ℃ for up to one week. 1mL of xylene may be added to a 1.5mL tube, or 3mL of xylene may be added to a GENTLEMACS tube. The tube may be incubated for 10 minutes at room temperature. With a Pasteur pipette, liquid can be removed from the tube without braking or otherwise damaging the roll. The xylene wash may be repeated for a total of three washes. 1mL of 100% ethanol may be added to the 1.5mL tube, or 3mL of 100% ethanol may be added to the GENTLEMACS C tube. The tube may be incubated for 30 seconds at room temperature. A pasteur pipette may be used to remove the liquid from the tube. 1mL of 70% ethanol may be added to the 1.5mL tube, or 3mL of 70% ethanol may be added to the GENTLEMACS C tube. The tube may be incubated for 30 seconds at room temperature. The liquid can be removed from the tube using a Pasteur pipette without damaging or damaging the roll. 1mL of 50% ethanol may be added to a 1.5mL tube, or 3mL of 50% ethanol may be added to a GENTLEMACS C tube. The tube may be incubated for 30 seconds at room temperature. The liquid can be removed from the tube using a Pasteur pipette without damaging or damaging the roll. 1mL of nuclease-free water may be added to the tube. The tube may be incubated for 30 seconds at room temperature. 1mL of Phosphate Buffered Saline (PBS) may be added to the tube, and the tube may be kept on ice. The sample may then be subjected to a lump-pestle dissociation or via GENTLEMACS OCTO disruptors.

EXAMPLE 16 pestle dissociation of FFPE tissue

PBS buffer can be removed from the 1.5mL tube using a pasteur pipette without damaging the rolls. As shown in FIG. 50A, 100. Mu.L of the dissociating enzyme mixture may be added to the tube. As shown in fig. 50B, the tissue roll can be dissociated using a 1.5mL lump pestle to break the roll into smaller pieces. 900 μl of the dissociating enzyme mixture can be added to the tube while the top of the pestle is rinsed into the tube to collect any additional pieces of tissue that adhere to the pestle. A pipette may be used to mix the tissue pieces with the dissociating enzyme mixture. The tube may be incubated at 37℃for 45 minutes at 800rpm in a hot mixer. Alternatively, the tube may be incubated for 45 minutes at 37 ℃. The samples may be mixed at 15 minute intervals via inversion. Using a1,000 μl pipette, the tissue pieces can be dispersed by pipetting 10 to 20 times. Optionally, the sample may be aspirated by pushing the tissue sheet and solution through a 23 gauge needle five or more times to increase cell recovery. The suspension can pass through a preseparation filter (30 μm) placed on a 5 or 15mL tube placed on ice. The initial tube and filter may each be rinsed with 1mL of chilled PBS to minimize cell loss. The filtrate may be collected in 5 or 15mL tubes. Tubes containing cell suspensions may be centrifuged at 850rcf for 5 minutes at 4 ℃. The supernatant can be removed without disturbing the pellet. The pellet may be resuspended in 0.5mL of cooled tissue resuspension buffer or quench buffer. The mixture may be pipetted five or more times to mix and remain on ice. An automatic cell counter (e.g., countess II or Cellaca MX) or a cytometer may be used to determine the cell concentration of the fixed tissue sample. The sample can be immediately used for analysis as described elsewhere herein.

EXAMPLE 17 GENTLEMACS OCTO A dissociator for FFPE tissue

PBS can be removed from GENTLEMACS C tubes without damaging the rolls. 2mL of the dissociating enzyme mixture may be added to the tube and the tube may be securely closed. As shown in fig. 51, the tube can be placed in GENTLEMACS OCTO a dissociator, heting units applied and GENTLEMACS PROGRAM 37c_ffpe_1 run. The program may run for about 48 minutes. At the end of the procedure, the tube may be removed from GENTLEMACS OCTO detachers and visually inspected to ensure that the rolls have detached. The tube may be centrifuged at about 300rcf for 1min, and the resulting pellet may be resuspended in supernatant. The suspension was passed through a preseparation filter (30 μm) placed on a 15mL tube on ice. The original GENTLEMACS tubes and filters can be rinsed with 2mL of chilled PBS to minimize cell loss. The filtrate may be collected in a tube. The cell suspension may be centrifuged at 850rcf for 5min at 4 ℃. The supernatant can be removed without disturbing the pellet. The pellet may be resuspended in 0.5mL of cooled tissue resuspension buffer or quench buffer. The pellet may be mixed with the buffer by pipetting the mixture five or more times while on ice. An automatic cell counter (e.g., countess II or Cellaca MX) or a cytometer may be used to determine the cell concentration of the fixed tissue sample. The sample can be immediately used for analysis as described elsewhere herein.

EXAMPLE 18 counting Using ethidium homodimer-1

Samples can be counted using ethidium homodimer-1 and a Countess II FL automated cell counter (red fluorescent protein (RFP) light cube) to accurately quantify even in the presence of subcellular fragments. The cell concentration of Countess may be between 1,000 and 4,000 cells/μl. The sample may be vortexed, briefly centrifuged, and diluted to about 1:100. mu.L of ethidium homodimer-1 can be aliquoted into tubes. The samples may be gently mixed and 10 μl may be added to the tube containing ethidium homodimer-1. 10 μl of the sample can be transferred to a Countess II cell count slide chamber. Slides may be inserted into the Countess II FL cell counter. The sample may be an image of RFP setup using fluorescent illumination and filtering. A bright field mode may be used to confirm that no large agglomerates are present. The RFP positive concentration can be measured and the cell concentration can be determined by multiplying the measured concentration by two. Alternatively, a Cellaca counter may be used to count samples stained with ethidium homodimer-1.

EXAMPLE 19 counting cells Using Propidium Iodide (PI) staining solution

Propidium Iodide (PI) may be used with Cellaca counters to count cells in a fixed tissue sample in the presence of subcellular debris. The cell concentration of Cellaca counter may be 100 to 10,000 cells/μl may be added to the mixed row of Cellaca plates with 25 μl of PI staining solution. The fixed tissue samples may be gently mixed. Depending on the concentration, sample 1:1 may be diluted with PBS, e.g., 15. Mu.L of PBS may be added to 15. Mu.L of sample. 25. Mu.L of sample can be added to the mix row containing the position of the PI staining solution. The staining solution and sample may be gently mixed 8 or more times. The stained samples may be transferred to a loading row of Cellaca plates. PI channels can be used to count cell numbers. Alternatively, a Countess II FL automatic cell counter may be used to count samples stained with PI. Fig. 52A shows an example image of the cells after dissociation. After dissociation, the cell sample may contain some debris. Further treatment, such as post-hybridization washing, may reduce fragmentation, as shown in FIG. 52B.

FIG. 53 shows exemplary cell yield data for various samples treated with a pestle dissociation and samples treated with GENTLEMACS OCTO dissociators.

EXAMPLE 20 fixed sample storage

Fixed samples, such as dissociated cells resuspended in a quenching buffer or tissue resuspension buffer, may be stored for short or long periods of time. Short term storage may include storage at 4 ℃. The frozen enhancer may be thawed at 65 ℃ for 10 minutes. The thawed enhancer may be vortexed and briefly centrifuged. The enhancer may be added to the immobilized sample at a dilution factor of 1:10, for example, by adding 50 μl of enhancer to 500 μl of quench buffer or tissue resuspension buffer solution of the immobilized sample. The samples can be stored at 4 ℃ for up to one week.

For long term storage, enhancers may be added to the sample at the same 1:10 dilution factor. 50% glycerol can be added to the sample with a final glycerol concentration of 10%. For example, 137.5 μl of 50% glycerol can be added to 550 μl of quenching buffer for the immobilized sample or tissue resuspension buffer solution containing the enhancer. The samples may be stored at-80 ℃ for up to six months.

The sample may undergo a color change (e.g., black, light gray, or green) during storage. Samples stored at-80 ℃ can be thawed at room temperature until no ice is present. The thawed samples may be centrifuged at 850rcf for 5 minutes at room temperature. The supernatant can be removed without disturbing the pellet. The cell pellet may be resuspended in 0.5ml of 0.5x PBS containing 0.02% Bovine Serum Albumin (BSA). In one example, 0.2U/. Mu.L of RNase inhibitor may be used. The sample may be stored on ice. The cell concentration of the immobilized sample can be determined as described elsewhere herein, and the sample can be immediately used for analysis as described elsewhere herein.

The methods described herein can be used to analyze single cell FFPE gene expression to detect biology preserved in obtained FFPE tumor samples. The method can be used for promoting the development of disease progression and therapeutic targets. For example, the sample preparation described herein may produce dissociated cells that may be sufficient for determining the transcriptome of tissue that has been preserved as at least FFPE tissue, thereby allowing analysis of the preserved tissue.

While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. It is not intended that the invention be limited to the specific examples provided within the specification. While the invention has been described with reference to the foregoing specification, the descriptions and illustrations of the embodiments herein are not intended to be construed in a limiting sense. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. Furthermore, it should be understood that all aspects of the invention are not limited to the specific depictions, configurations or relative proportions set forth herein which depend upon a variety of conditions and variables. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is therefore contemplated that the present invention shall also cover any such alternatives, modifications, variations or equivalents. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.

Claims

1. A method for analyzing embedded immobilized tissue nucleic acid molecules, comprising:

(a) Providing the embedded fixed tissue, wherein:

(i) The embedded fixed tissue comprises a plurality of cells,

(Ii) One cell of the plurality of cells comprises the nucleic acid molecule, and

(Iii) Embedding the embedded fixed tissue in a solid medium;

(b) Removing at least a portion of the solid medium from the embedded fixed tissue, thereby obtaining a fixed tissue comprising the plurality of cells;

(c) Dissociating the immobilized tissue into cells of the plurality of cells, wherein the cells comprise cells comprising the nucleic acid molecule, and

(D) In the cell, a barcode molecule is used to generate a barcoded nucleic acid molecule comprising a barcode sequence and a sequence corresponding to the nucleic acid molecule.

2. The method of claim 1, wherein the fixed tissue is fixed at least about 1 year prior to (a).

3. The method of claim 2, wherein the fixed tissue is fixed at least about 5 years prior to (a).

4. A method according to any one of claims 1 to 3, further comprising adding a first solvent to the embedded fixed tissue in (b) to dissolve the solid medium to obtain tissue.

5. The method of claim 4, wherein the first solvent is a non-polar solvent.

6. The method of claim 5, wherein the non-polar solvent comprises xylene.

7. A process as set forth in claim 5 wherein said nonpolar solvent comprises neo-waterXylene substitutes.

8. The method of claim 4, further comprising removing the first solvent from the tissue.

9. The method of claim 8, further comprising adding a second solvent to the tissue.

10. The method of claim 9, wherein the second solvent is a polar solvent.

11. The method of claim 10, wherein the polar solvent comprises ethanol.

12. The method of claim 9, further comprising removing the second solvent from the tissue.

13. The method of claim 12, further comprising adding a rehydrating agent to the tissue.

14. The method of claim 13, wherein the rehydrating agent comprises water.

15. The method of claim 13, further comprising removing the rehydration agent from the tissue.

16. The method of claim 15, further comprising adding a buffer to the tissue.

17. The method of claim 16, further comprising removing the buffer from the tissue.

18. The method of claim 17, further comprising dissociating the tissue in (c).

19. The method of claim 18, further comprising resuspending the tissue in a supernatant.

20. The method of claim 19, further comprising filtering the tissue.

21. The method of claim 20, further comprising washing the tissue.

22. The method of claim 21, further comprising resuspending the tissue.

23. The method of claim 18, wherein said dissociating the tissue is performed using an automated dissociator.

24. The method of claim 23, wherein the automatic ionizer is a GENTLEMACS ^TM Octo ionizer.

25. The method of claim 18, wherein said dissociating the tissue is performed using a manual morcellator.

26. The method of any one of claims 1-25, wherein (b) and (c) provide a cell yield of at least about 1x 10 ⁵ cells per two 25 micron sections of the fixed tissue.

27. The method of any one of claims 1 to 26, wherein the fixed tissue comprises fixed human tissue.

28. The method of claim 27, wherein the fixed tissue comprises fixed connective tissue, fixed epithelial tissue, fixed organ tissue, fixed muscle tissue, fixed ligament, fixed tendon, fixed skin tissue, fixed breast tissue, fixed bladder tissue, fixed kidney tissue, fixed liver tissue, fixed colon tissue, fixed thyroid tissue, fixed cervical tissue, fixed prostate tissue, fixed lung tissue, fixed heart tissue, fixed muscle tissue, fixed pancreatic tissue, fixed anal tissue, fixed bile duct tissue, fixed bone marrow, fixed uterine tissue, fixed ovarian tissue, fixed endometrial tissue, fixed vaginal tissue, fixed vulval tissue, fixed stomach tissue, fixed ocular tissue, fixed nasal tissue, fixed sinus tissue, fixed penile tissue, fixed salivary gland tissue, fixed intestinal tissue, fixed gall bladder tissue, fixed gastrointestinal tissue, fixed bladder tissue, fixed brain tissue, fixed spinal column tissue, fixed neurons, fixed brain blood representative of one or more of fixed blood, fixed blood representative of fixed hair, fixed horn, fixed collagen, or a plurality of fixed blood, or a plurality of fixed collagen.

29. The method of any one of claims 1 to 28, wherein (d) comprises hybridizing the barcode molecule to the nucleic acid molecule.

30. The method of claim 29, further comprising extending the barcode molecule to form the barcoded nucleic acid molecule.

31. The method of claim 30, wherein the extending comprises using an enzyme.

32. The method of claim 30, wherein the extending comprises ligating the barcode sequence to a probe.

33. The method of claim 32, wherein the probe comprises a moiety at least partially complementary to the nucleic acid molecule.

34. The method of any one of claims 1 to 33, further comprising partitioning the nucleic acid molecule into partitions prior to (d).

35. The method of claim 34, wherein the partition is one of a plurality of holes.

36. The method of claim 34, wherein the partition is one of a plurality of droplets.

37. The method of claim 36, wherein the plurality of droplets comprise oil droplets.

38. The method of claim 34, wherein the partition comprises a support.

39. The method of claim 38, wherein the support comprises beads.

40. The method of claim 38, wherein the barcode sequence is attached to the support.

41. The method of any one of claims 1 to 40, wherein the solid medium comprises paraffin.

42. The method of claim 41, wherein the fixed tissue comprises formalin-fixed tissue embedded in said paraffin.

43. The method of any one of claims 1 to 42, wherein the nucleic acid molecule comprises a ribonucleic acid (RNA) molecule.

44. The method of claim 43, wherein the nucleic acid molecule comprises a messenger RNA molecule.