Sample-multiplexing approaches for single-cell sequencing

Zhang, Yulong; Xu, Siwen; Wen, Zebin; Gao, Jinyu; Li, Shuang; Weissman, Sherman M.; Pan, Xinghua

doi:10.1007/s00018-022-04482-0

Sample-multiplexing approaches for single-cell sequencing

Review
Published: 05 August 2022

Volume 79, article number 466, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Cellular and Molecular Life Sciences Aims and scope Submit manuscript

Sample-multiplexing approaches for single-cell sequencing

Download PDF

Yulong Zhang^1,2,3,
Siwen Xu^1,2,4,
Zebin Wen^1,2,
Jinyu Gao^1,2,
Shuang Li³,
Sherman M. Weissman⁵ &
…
Xinghua Pan ORCID: orcid.org/0000-0002-7421-8155^1,2,3

3812 Accesses
13 Citations
16 Altmetric
1 Mention
Explore all metrics

Abstract

Single-cell sequencing is widely used in biological and medical studies. However, its application with multiple samples is hindered by inefficient sample processing, high experimental costs, ambiguous identification of true single cells, and technical batch effects. Here, we introduce sample-multiplexing approaches for single-cell sequencing in transcriptomics, epigenomics, genomics, and multiomics. In single-cell transcriptomics, sample multiplexing uses variants of native or artificial features as sample markers, enabling sample pooling and decoding. Such features include: (1) natural genetic variation, (2) nucleotide-barcode anchoring on cellular or nuclear membranes, (3) nucleotide-barcode internalization to the cytoplasm or nucleus, (4) vector-based barcode expression in cells, and (5) nucleotide-barcode incorporation during library construction. Other single-cell omics methods are based on similar concepts, particularly single-cell combinatorial indexing. These methods overcome current challenges, while enabling super-loading of single cells. Finally, selection guidelines are presented that can accelerate technological application.

Applications of Single-Cell Sequencing for Multiomics

Single-cell transcriptomics: background, technologies, applications, and challenges

Article 30 April 2024

Single-cell sequencing techniques from individual to multiomics analyses

Article Open access 15 September 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Cells are the basic structural and functional units of life, and multicellular organisms are composed of individual cells with varied gene expression profiles and functions. Single-cell RNA sequencing (scRNA-seq) has been developed rapidly since its inception [1]. In contrast to traditional bulk transcriptional profiling, scRNA-seq enables transcriptomic analysis of numerous individual cells in parallel [2,3,4], thereby helping to elucidate cellular heterogeneity and characterize rare cells in research related to tumors [5, 6], stem cells [7], the immune microenvironment [8], neurobiology [9,10,11], reproduction [12], and embryonic development [13,14,15,16,17]. Commercial scRNA-seq platforms such as 10 × Genomics Chromium [18] and BD Rhapsody [19, 20] routinely provide high throughput (10³–10⁵ cells) [21], which has catalyzed the development of numerous cell atlas projects, such as The Human Cell Atlas [22], The Fly Cell Atlas [23], and Plant Cell Atlas [24].

While the throughput has increased and the per-cell cost has fallen with rapid advancements in scRNA-seq, several major challenges still remain when processing multiple samples, which impede further adoption and evolution. In general, performing scRNA-seq for multiple individual samples is labor-intensive and requires expensive reagents and consumables. In the early stage, scientists eliminated false singlets (negatives and multiplets) subjectively using the unique molecular identifier (UMI) “cut-off,” resulting in the retention of false single cells and the loss of true cells with RNA contents that are too high or low. Afterward, more and more R packages have been developed to remove negatives or multiplets, such as scrublet [25], doubletFinder [26], and scuttle. However, these statistical methods can only partially solve the problem. Additionally, technical batch effects can mask biological signals when scRNA-seq profiles are integrated [27]. Fortunately, effective solutions to mitigate these challenges have been successfully applied.

Here, we principally discuss sample-multiplexing approaches for scRNA-seq benefiting from bioinformatic and biochemical techniques. These methods overcome common challenges facing scRNA-seq and enable the super-loading of single cells in a single run. More single cells can be obtained by super-loading, which is especially important for the detection and characterization of some rare cell subsets. However, uncontrolled overloading when too many samples are multiplexed will lead to an ultra-high rate of multiplets, which although can be removed when the data is obtained but will consume sequencing power. Although scRNA-seq is the most sophisticated solution, other methods of single-cell omics are gradually being developed with comparable strengths in terms of their applications. Therefore, we also discuss other sample-multiplexing methods, such as single-cell assay for transposase-accessible chromatin with high-throughput sequencing (scATAC-seq), single-cell whole-genome sequencing (scWGS), single-cell DNA-methylation analysis, single-cell high-throughput chromosome conformation capture (scHi-C), and single-cell multi-omics (scMulti-omics) (Fig. 1, Table 1). Finally, we provide selection guidelines and application models for these multiplexing methods.

Table 1 Characteristics of sample-multiplexing approaches used for single-cell sequencing

Full size table

Sample multiplexing for single-cell transcriptome sequencing

According to the central dogma of molecular biology, RNA bridges the gap between DNA and proteins, thereby reflecting the nature and genetic profiles of cells. Proteins are difficult to amplify and sequence, while high-throughput scWGS and scWES, requiring a large amount of sequencing, are too expensive to apply. Therefore, scRNA-seq is currently the most popular single-cell omics tool. Here, we present five categories of scRNA-seq sample-multiplexing approaches (Fig. 2).

Natural genetic variation

Demultiplexing scRNA-seq data by genetic variations with reference genotypes

Genetic variation, particularly single-nucleotide polymorphisms (SNPs), distinguishes different individuals, qualifying SNPs as natural barcodes for assigning cells to donors and identifying doublets from different donors (Fig. 2A). Multiple methods for demultiplexing cells from different individuals based on SNPs have been developed in recent years. Kang et al. [28] introduced demuxlet, the first method for harnessing genetic barcodes to identify cells. The demuxlet method assigns each cell to its individual source by calculating the maximum likelihood of obtaining RNA-seq reads with overlapping SNP sets, the genotypes of which are given in parallel as a reference for each individual. Using simulated data, demuxlet showed that 50 genetic variations are sufficient to assign cells for up to 64 individuals. In the same study, the performance of demuxlet was evaluated by analyzing pooled peripheral blood mononuclear cells (PBMCs) from eight patients with lupus.

Demultiplexing scRNA-seq data by genetic variations without reference genotypes

While the above-mentioned methods enable demultiplexing of mixed samples, they require additional genotyping information to assign individual cells back to the donors. These factors limit the utility of genotype-specific methods, as genotype data might be unavailable, or insufficient tissue may be available for DNA extraction. Recently, approaches were developed that do not require a genotype reference. Xu et al. [29] introduced scSplit, a hidden-state model approach for demultiplexing individual samples from mixed scRNA-seq data without extra genotype information. ScSplit identifies putative variant sites from scRNA-seq data and models the allelic counts into clusters using an expectation–maximization framework. This tool requires only FASTQ files and a cell barcode list as inputs. Huang et al. [30] presented Variational Inference for Reconstructing Ensemble Origins (Vireo), a principled Bayesian method for demultiplexing pooled samples independently of the genomic information. Vireo can also leverage genotype information when available (termed Vireo-GT). They evaluated the performance of Vireo-GT for singlet assignment. As expected, Vireo-GT performed slightly better than Vireo when using 16 genetically distinct scRNA-seq samples (AUC_Vireo = 0.999, AUC_Vireo-GT = 1.000). More recently, the computational tool, Souporcell, was designed for clustering scRNA-seq data by genotype without reference genotypes [31]. Souporcell fits a mixture model using a deterministic annealing variant of the expectation–maximization algorithm to cluster cells. The advantage of mixture model clustering over hard clustering is that cells can be assigned to different clusters, benefiting both doublet calling and ambient RNA estimations. Souporcell achieved high accuracy in terms of genotype clustering, doublet detection, and ambient RNA estimation. This method even surpassed demuxlet (the gold-standard method requiring genotype information a priori) in cell assignments and doublet accuracies. When more challenging scenarios involving multiple cell types are involved, 21 cells that demuxlet had labeled as maternal or fetal appeared in the cell clusters of the counterpart individual by Souporcell. Based on the reference genotypes, they are of an error by demuxlet assignments rather than that by Souporcell. Notably, Souporcell includes a low signal-to-noise ratio due to decreased numbers of UMIs per cell and high numbers of donors, resulting in increased local maxima.

Nucleotide-barcode anchoring on cellular or nuclear membranes

Oligo-tagged antibodies

Cells with higher expression levels of specific surface proteins can be labeled with more corresponding antibodies. Therefore, the cellular indexing of transcriptomes and epitopes by sequencing (CITE-seq) [32] and RNA-expression and protein-sequencing assay (REAP-seq) [33] approaches can simultaneously measure gene- and protein-expression levels in single cells with oligo-tagged antibodies. These antibodies link ssDNA instead of the fluorophores applied to Fluorescence Activating Cell Sorter (FACS), and barcoded ssDNA integrated with antibodies will be captured along with mRNAs. Stoeckius et al. used oligo-tagged antibodies (hashtags) that bound to ubiquitous cell-surface membrane proteins to mark and pool experimental samples, demultiplexed the data according to the barcoded antibody signals, and assigned cells to the original samples (Fig. 2B). This method was named “Cell Hashing,” as it is based on the concept of hash functions in computer science to index datasets with specific features. Eight human PBMC samples were labeled, pooled, and analyzed [34]. Class-I major histocompatibility complex, β-2-microglobulin, and the sodium–potassium ATPase-subunit CD298 are broadly expressed on the surface of human cells [34], and combining these two corresponding antibodies fulfills the purpose of double assurance for labeling all cells successfully.

The performance of single-nucleus RNA sequencing (snRNA-seq) has been compared with that of scRNA-seq [35,36,37]. Although snRNA-seq usually captures fewer genes, it has significant advantages in addressing complex tissues that are challenging to dissociate or are frozen, or comprise large or irregularly shaped cells, while avoiding bias caused by the loss of certain cells during single-cell suspension preparation [38,39,40]. Following Cell Hashing, Nucleus Hashing was developed for multiplex snRNA-seq and to profile human brain cortex samples [41]. The Nucleus Hashing approach utilizes DNA-barcoded antibodies to target the nuclear pore complex, which has a relatively conserved sequence between species and expands the application range.

Commercial oligo-tagged antibodies produced by BioLegend and BD are currently available and should streamline such research. However, some cells may not express the nearly ubiquitous targeted surface proteins, resulting in labeling and decoding failures.

Lipid-tagged indices

Lipid and cholesterol are basic cell-membrane components that are required for all life. Lipid- and cholesterol-modified oligonucleotide (LMOs and CMOs, respectively) scaffolds, used as “anchors,” can be rapidly and stably integrated into live cell membranes [42]. McGinnis et al. adapted LMOs and CMOs into MULTI-seq, a multiplexing method for scRNA-seq and snRNA-seq using lipid-tagged indices. Briefly, conjugation with a 5′ lignoceric acid amide and a 3′ palmitic acid amide increased the hydrophobicity of LMOs and enabled them to stably associate with membranes. MULTI-seq sample barcodes are constituted of a 3′ poly-A capture sequence, an 8-bp sample barcode, and a 5′ polymerase chain reaction (PCR) handle. In their study, McGinnis et al. used MULTI-seq to reveal the dynamics of T-cell activation with Jurkat cells, to perform a 96-plex perturbation assay with primary human mammary epithelial cells, and to perform multiplex analysis of cryopreserved primary tumors and metastatic lungs dissected from the patient-derived xenograft (PDX) mice of triple-negative breast cancer [43].

For practical applications, one type of LMO scaffold is sufficient to assemble with different DNA barcodes, increasing the method’s capability for multiplexing numerous samples in parallel. Furthermore, this method is not limited by species or genetic background differences and has little impact on cell viability and endogenous gene expression patterns because the labeling is extremely mild and requires only 5 min.

Concanavalin A (ConA)-based sample barcoding

Glycoproteins are ubiquitous on the plasma membrane. Based on the glycoprotein-binding ability of Concanavalin A, a ConA-based sample-barcoding strategy (CASB) was developed by Fang et al. Three components, i.e., biotinylated single-stranded DNA (ssDNA; as barcodes), streptavidin, and biotinylated Concanavalin A, self-assemble into the CASB complex. CASB has been leveraged to dissect the transcriptomic dynamics of MDA-MB-231 cells perturbed with five compounds and to demonstrate the IFN-γ-mediated epigenomic and transcriptomic changes in HAP1 cells by combining scRNA-seq and scATAC-seq [44].

CASB enables cell and nucleus labeling independently of the genetic background. In addition, the CASB complex binds to the target quickly and stably, even at low temperatures, without degrading the sample integrity. ssDNA barcode synthesis and complex assembly are extraordinarily convenient, flexible, and easy to be adapted to different single-cell sequencing workflows. Furthermore, CASB combines simultaneously combinatorial barcoding and sequential split-pool barcoding, making it capable of multiplexing more samples and further eliminating multiplets than previous methods.

ClickTags

Methyltetrazine-modified oligonucleotides can be coupled with surface proteins on methanol-fixed cells via a two-step chemical reaction using inverse electron-demand Diels–Alder chemistry and the heterobifunctional amine-reactive cross-linker NHS-trans-cyclooctene. Using these modified oligonucleotides as “ClickTags,” Gehring et al. achieved highly multiplexed scRNA-seq with fixed cells. They applied this strategy in a 96-plex perturbation experiment using neural stem cells treated with various concentrations of decitabine and Scriptaid, epidermal growth factor and basic fibroblast growth factor, retinoic acid, and bone morphogenic protein 4 [45].

However, the method is currently only suitable for methanol-fixed cells (rather than live cells) because competitive NHS-ester hydrolysis in aqueous buffer tends to result in a poor signal-to-noise ratio with scRNA-seq libraries.

Nucleotide-barcode internalization to the cytoplasm or nucleus

Liposomal transfection with short barcoding oligos

Similar to bio-membranes, liposomes are widely used as vectors to transport drugs and exogenous nucleic acids into cells in research and clinical applications. Shin et al. established a universal sample-barcoding method using liposomal transfection with short barcode oligonucleotides (SBOs; Fig. 2C). SBOs consisting of a barcode and a poly-A sequence were implemented to label various samples. They applied this method in a 5-plex time-course experiment using K562 cells treated with a BCR-ABL-targeting drug (imatinib) and in a 48-plex assay performed to screen 45 selected drugs [46].

This method is characterized by a simple experimental process and low-cost sample labeling. SBO synthesis is flexible and convenient, and the number of labeled samples can be largely expanded; therefore, this strategy is very suitable for conducting numerous parallel experiments, such as broad-spectrum drug-screening assays. As this method is based on liposomal transfection, it can be universally adapted to various species and potentially applied to the nucleus. However, additional perturbations may occur when SBOs are transfected using Lipofectamine 3000 at 4 h before library construction.

Staining nuclei with polyadenylated ssDNA oligonucleotides

Evidence suggests that ssDNA selectively diffuses into the nuclei of permeabilized cells instead of intact cells (Fig. 2C). Therefore, Srivatsan et al. used a straightforward approach with unmodified polyadenylated ssDNA oligonucleotides (hash oligos) to mark nuclei followed by single-cell combinatorial indexing RNA-seq (sci-RNA-seq). With this nuclear hashing strategy (known as the “sci-Plex” strategy), the combination of two oligos (one for plates and another for wells) contributed to the exponential scalability of sample indexing. Using sci-Plex, they acquired data from approximately 650,000 single cells across nearly 5000 independent samples in one experiment by screening three cancer cell lines exposed to 188 compounds at different doses [47].

The labeling capacity of sci-Plex is extremely high. Nonetheless, it functions only in the nuclei and remains limited by significantly low mRNA-capture efficiency in sci-RNA-seq.

Vector-based barcode expression in cells

Lentiviral barcodes facilitate cell tagging, pooling, and tracking

In previous studies, virus transduction along with high-throughput scRNA-seq was successfully used for lineage tracing. Subsequently, Guo et al. adapted this method to develop a novel lentiviral barcode-based multiplexing approach, called CellTag Indexing (Fig. 2D). With the design, 8-nt CellTags representing distinct sample indices are located in the 3′ untranslated region of the green fluorescent protein gene (which enables measurement of the transduction efficiency), followed by an SV40 polyadenylation signal sequence. Barcoded viruses are used to transduce the cells to be tagged. Guo et al. used this method to profile cultured cells in vitro and track cell engraftment and differentiation in vivo [48].

CellTag Indexing has been used to successfully achieve long-term stable tracing and multiplexing of live cells. However, it is incompatible with clinical samples and frozen tissues because CellTag barcodes need to be expressed in immortalized cells. Additionally, compared to other labeling technologies, the tagging method is time-consuming due to the need for transducing cells with viruses and expressing viral RNA.

Clustered regularly interspaced short palindromic repeats (CRISPR) and short-hairpin RNA (shRNA) enable multiplexed single-cell screening

Genetic screens enable gene-function analysis and accelerate biological discoveries, while confronting trade-offs between the number of perturbations probed and the complexity of phenotypes evaluated. To bridge this gap, Dixit et al. developed Perturb-seq by combining scRNA-seq and CRISPR-based perturbations. In their experiments, cells were infected with a pool of lentiviruses that expressed single-guide RNAs and guide barcodes to be captured together with mRNA during reverse transcription. They applied Perturb-seq to profile 200,000 cells, focusing on transcription factors that regulate the responses of dendritic cells to lipopolysaccharide [49]. In a companion study, Perturb-seq was used with CRISPR interference to investigate the mammalian unfolded-protein response [50]. Similarly, Jaitin et al. developed CRISP-seq to dissect immune circuits [51], and Datlinger et al. developed CROP-seq (also known as CRISPR droplet sequencing) [52]. Consequently, Daniel and Allon co-authored a paper entitled, “Genetic screening enters the single-cell era” [53].

shRNA is more stable and lasts longer in cells than small interfering RNA and can be expressed from lentiviruses. Thus, Aarts et al. integrated scRNA-seq with a shRNA screen to investigate senescence-based restraint of cell-fate conversion when expression of the transcription factors OCT4, SOX2, KLF4, and c-MYC reprograms somatic cells into induced pluripotent stem cells. Their findings explained why mTOR plays a dual role and how it imposes opposing effects on reprogramming by regulating senescence [54].

A large number of exploratory conditions are analyzed in parallel with CRISPR or shRNA screens and high-throughput scRNA-seq. Therefore, CRISPR and shRNA screens are essentially a replacement of, or kind of alternative of sample-multiplexing strategies. In addition, different combinations of gene knockout can be used as different sample barcodes thus adapted to conventional sample multiplexing for scRNA-seq.

Incorporating nucleotide barcodes during library construction

Barcode assembly for targeted sequencing (BART-Seq)

Uzbas et al. developed BART-Seq as a sample-multiplexing technique for targeted transcriptome or genome sequencing at single-cell level or in bulk cells. Based on their design, in 96-well plates, forward and reverse primers for targeted genes were hybridized with partially complementary ssDNA barcodes, followed by fill-in reactions and strand removal, resulting in the formation of dual barcoded primers. The target regions of genes captured by these barcoded primers were subjected to PCR and next-generation sequencing. They used BART-Seq to profile thousands of single human pluripotent stem cells exposed to different maintenance media and activation of the Wnt/β-catenin pathway [55].

In terms of targeted sequencing, BART-Seq is cost-effective, highly suitable for transcriptome and genome analysis of genes of interest, and can be generalized for other omics methods in the future. Moreover, BART-seq can be used to analyze a broader range of RNA species (more than just mRNAs), including non-polyadenylated long non-coding RNAs. However, when detecting limited genes, BART-Seq fails to cover the whole transcriptome or genome. Furthermore, UMIs have not been adapted to BART-Seq, which hinders counting of the absolute number of transcripts.

Single-cell combinatorial fluidic indexing (scifi)-RNA-seq

Loading a single-cell suspension into a microfluidic device at very low concentrations reduces the probability that two cells enter the same droplet (i.e., a doublet) as much as possible, resulting in waste as most “empty emulsion droplets” enclose fully functional barcoded microbeads and reverse transcription reagents, but no cells, thereby failing to capture any valid single-cell transcriptomes. For example, ~ 90–99% of gel bead-in-emulsions contain no cells when generated using the 10 × Genomics Chromium Single Cell 3' Reagent Kit v2, with a pool of ~ 750,000 barcodes to separately index each cell’s transcriptome, and possibly more exist with the v3 and v3.1 kits, with ~ 3,500,000 10 × barcodes. Inspired by combined indexing protocols [10,11,12], Paul et al. established the ultra-high-throughput scifi-RNA-seq system. Whole-transcriptome preindexing (round 1) was conducted by capturing barcoded primers and conducting reverse transcription inside permeabilized cells or nuclei, in “split pools,” within 96-well or 384-well plates (Fig. 2E). Following preindexing, cells or nuclei were pooled, and standard droplet-based scRNA-seq was performed with overloading and another round of indexing (round 2); thus, most droplets encapsulated several cells or nuclei (Fig. 3). Subsequently, many effective droplets, including those containing multiplets or singlets, were obtained because the combination of the two barcodes (round 1 and round 2) reached a huge quantity that almost all single cells received a unique identity tag. Paul et al. applied scifi-RNA-seq in various human and mouse cell lines, primary human T cells, and a highly multiplexed CRISPR screen for T-cell receptor activation [56].

Using scifi-RNA-seq, 151,788 single cells were identified in four pooled human cell lines, a 15-fold increase over the recommended output on the 10 × Genomics Chromium system, further demonstrating that 1.53 million nuclei could be loaded into a single channel if necessary [54]. Undoubtedly, scifi-RNA-seq has built-in support for sample multiplexing because of the different reverse transcription primers used in the round-1 barcoding process. Additionally, scifi-RNA-seq technology can potentially function as an alternative system for scRNA-seq, including subnanoliter well plates, and also enhance the throughput of other single-cell omics sequencing approaches. Notably, permeabilized cells or nuclei (but not live cells) are required for scifi-RNA-seq.

Single-cell combinatorial indexing (sci)-RNA-seq and split-pool ligation-based transcriptome sequencing (SPLiT-seq)

In sci-RNA-seq, fixed and permeabilized cells (or nuclei) are implemented in two or three rounds of split-pool barcoding in 96- or 384-well plates (Fig. 3); therefore, large-scale combinatorial barcodes are sufficient to label a considerable number of single cells. Using this method, approximately 50,000 Caenorhabditis elegans cells (covering their somatic cell compositions) were investigated [57]. This technology has since been upgraded to sci-RNA-seq3, where a “mouse organogenesis cell atlas” was constructed within the critical window profiling of approximately 2 million cells [58]. Similar to sci-RNA-seq, SPLiT-seq involves four rounds of combinatorial barcoding (Fig. 3), and 156,049 single nuclei from mouse brains and spinal cords were analyzed [59].

When performing sci-RNA-seq or SPLiT-seq, each round of indexing gives sample multiplexing a chance, especially in round 1. In addition, neither method requires expensive platforms or regents. However, sci-RNA-seq and SPLiT-seq capture fewer genes and currently require more experimental time relative to popular commercial scRNA-seq technologies, which makes them less preferable options.

Sample multiplexing for single-cell epigenome and genome sequencing

Below, we present sample-multiplexing approaches for single-cell epigenome and genome sequencing together, which share considerable similarities.

Sample multiplexing for scATAC-seq

Chromatin accessibility is vital for regulating gene expression. ATAC-seq provides information regarding cis-acting DNA elements, including promoters, enhancers, and silencers, and the binding of trans-acting factors to DNA in open chromatin regions [60, 61]. Here, we introduce three strategies for sample multiplexing in scATAC-seq.

sci-ATAC-seq

Most single-cell technologies are based on the principle that single cells are individually compartmentalized, and the early library construction proceeds in independent biochemical reaction systems that each cell owns, making it difficult to simultaneously handle massive numbers of single cells per assay. Fortunately, single-cell combinatorial indexing as adapted to scATAC-seq (termed sci-ATAC-seq) overcomes this limitation. In this method, nuclei are molecularly tagged with barcoded Tn5 transposases in each of the wells, pooled, and then randomly redistributed into a second set of wells so that a second barcode is introduced during PCR (Fig. 3). Over 15,000 GM12878 and HL60 cells have been profiled using sci-ATAC-seq [62].

This method enables sample multiplexing but captures a relatively limited number of fragments from nucleosome-free regions per nucleus compared to common droplet-based scATAC-seq technology.

Droplet-based scATAC-seq (dscATAC-seq) combined with combinatorial indexing (dsciATAC-seq)

To boost the breadth and depth of chromatin accessibility profiling, Lareau et al. developed dscATAC-seq technology, where a microfluidic device was utilized to simultaneously encapsulate PCR reagents and barcoded beads into a single droplet with nuclei that were transposed using the Tn5 transposon to link sequencing adaptors into open chromatin regions. Based on this technology, they further designed dsciATAC-seq by combining dscATAC-seq with combinatorial indexing through barcoded transpositions, which is conceptually parallel to scifi-RNA-seq, thereby massively scaling up throughput (up to 105 single cells/experiment; Fig. 3). Subsequently, they applied dscATAC-seq or dsciATAC-seq to assay 46,653 cells from an adult mouse brain, or 136,463 resting and stimulated human bone marrow-derived cells, respectively [63].

dscATAC-seq produced high-quality data with 105 nuclear fragments/cell, while dsciATAC-seq increased the cell throughput tenfold using 24 transposon barcodes, with accessibility to larger output with 48 or 96 barcodes. Additionally, barcoded Tn5 transposon provides an opportunity to multiplex samples for scATAC-seq [63].

CASB for scATAC-seq

CASB has also been designed to suit sample multiplexing for both scATAC-seq and scRNA-seq. Indeed, the CASB barcodes remained abundant and showed minimal cross-contamination after a reaction of 1 h transposition at 37 °C, without influencing the epigenomic profiles. In the workflow, a 222-nt ssDNA barcode, with S5-ME and S7-ME adapter sequences attached by primers during scATAC-seq library amplification, flanked a sequence containing sample barcodes [44].

Wang et al. developed SNuBar to multiplex scATAC-seq and scATAC and RNA co-assay with unmodified oligonucleotides [64]. Furthermore, in Wang’s study, SNPs-based multiplexing was successfully used to verify the reliability of SNuBar, and a high correlation was acquired between two methods, which inspires us that sample multiplexing depending on natural genetic variation is also suitable for single-cell epigenome and genome sequencing.

Sample multiplexing for scWGS

Whole-genome sequencing facilitates genome assembly from various species and provides information pertaining to genetic variations, including single-nucleotide variants (SNVs), small indels, and copy-number variants (CNVs) [65]. scWGS is advancing in two directions: deep sequencing for detecting SNVs and shallow sequencing to identify CNVs and aneuploidy [66].

sci-ATAC-seq only targets open chromatin regions (1–4% of the genome), which is desirable for epigenetic profiling. However, this property or restriction is confusing when detecting single-cell CNVs because of biological bias and severely limits read counts (~ 3000/per cell). Therefore, Vitak et al. designed two strategies to unbind nucleosomes from genomic DNA without disturbing the nuclear integrity. One is lithium-assisted nucleosome depletion (LAND), which employs the chaotropic agent lithium diiodosalicylate to disrupt DNA–protein interactions, thereby releasing DNA from histones. The other is cross-linking with sodium dodecyl sulfate (xSDS), utilizing the detergent to denature histones and dissociate them from DNA (cross-linking is necessary before denaturation because SDS disrupts the nuclear integrity). They further developed a scWGS method for CNV detection incorporating combinatorial indexed sequencing, termed SCI-seq, which is highly consistent with sci-ATAC-seq during all experimental steps, except for the nucleosome removal step (Fig. 3). Subsequent assessment revealed substantially better coverage uniformity using xSDS than LAND. SCI-seq was used to analyze 16,698 single cells from cultured cell lines, primate frontal cortex tissue, and two human adenocarcinomas [66].

Mission Bio Tapestri platform enables simultaneous single-cell DNA and protein analysis using oligo-tagged antibodies provided by BioLegend. Thus, we suggest that this strategy may be applicable to multiplexed scDNA-seq according to the concept of Cell Hashing.

Sample multiplexing for single-cell DNA-methylation analysis

DNA methylation regulates gene expression by recruiting relevant proteins or by suppressing transcription factor binding to DNA, and 5-methylcytosine is the most common type of DNA methylation in plants and animals [67, 68].

Based on the principle that sodium bisulfite (BS) converts cytosine (but not methylcytosine) to uracil in genomic DNA [69], DNA methylation can be detected at single-base resolution from a limited fraction of the genome to the whole-genome scale using bisulfite sequencing (WGBS) [70], in bulk or at single-cell level (scWGBS) [71,72,73,74,75,76]. To increase the cell-count throughput of scWGBS, Mulqueen et al. established sci-MET, which is highly similar to SCI-seq, with the two largest differences occurring during library construction (i.e., transposition with adaptors depleted of cytosines and BS conversion; Fig. 3). They applied sci-MET to discriminate three human cell lines (after mixing them) and to analyze mouse cortical tissue, which mainly comprised excitatory and inhibitory neuronal cells [77].

Sample multiplexing for scHi-C

Hi-C helps shed light on three-dimensional genome architectures and DNA interactions in eukaryotes. For example, topologically associating domains are visible by Hi-C analysis and frequently involve dynamic promoter–enhancer interactions that influence gene expression [78,79,80,81].

As a type of single-cell combinatorial-indexing technology, sciHi-C has successfully bridged the gap between high-throughput and single-cell chromosome conformation analyses. These experimental procedures were derived mostly from traditional Hi-C assays (such as fixation, digestion, proximity ligation, affinity purification, and library amplification), while utilizing biotinylated bridge adaptors and custom-barcoded Illumina Y-adaptors to link the first and second round of barcodes in two 96-well plates, respectively, mirroring a combinatorial indexing design (Fig. 3). In the proof of concept, sciHi-C was leveraged to generate six libraries including 10,696 single cells from mixed mouse cells (primary mouse embryonic fibroblasts and the “Patski” embryonic fibroblast line) and human cells (HeLa S3, HAP1, K562, and GM12878) [82].

Sample multiplexing for scMulti-omics

Multiomics considers multiple types of molecules simultaneously and generates a comprehensive understanding of biological processes [83, 84]. Here, we introduce some sample-multiplexing ideas for scMulti-omics.

First, Cell Hashing plus CITE-seq (or REAP-seq) enables researchers to pool samples and to simultaneously quantitate mRNA transcripts and proteins in the same cells [32,33,34]. Furthermore, REAP-seq can be merged with expanded CRISPR-compatible cellular indexing of transcriptomes and epitopes by sequencing (ECCITE-seq), which simultaneously profiles proteins, mRNAs, T cell receptors, and CRISPR perturbations [85]. Secondly, sci-CAR jointly profiles single-cell chromatin accessibility and mRNA (CAR) with an inherent ability of combinatorial indexing for sample multiplexing [86] (Fig. 3). Additionally, using the 10 × Genomics Chromium microfluidics platform, Mimitou et al. developed ATAC with select antigen profiling by sequencing (ASAP-seq) to detect proteins, chromatin accessibility, and mitochondrial DNA mutations in thousands of single cells, and gene-expression levels (beyond those three levels) can be measured via DOGMA-seq (named for DNA, RNA, and protein spanning the central dogma of gene regulation) [87]. Similarly, Swanson et al. developed integrated cellular indexing of chromatin landscape and epitopes (ICICLE-seq) and TEA-seq (named for transcriptomics, epitopes, and chromatin accessibility) [88]. Both teams’ methods are amenable to sample multiplexing with barcoded antibodies. Finally, theoretically natural genetic variations can be used to demultiplex scMulti-omics profiles derived from mixed non-isogenic samples.

Most of the multiplexing strategies reviewed above can potentially be integrated with various scMulti-omics research techniques to meet experimental needs.

Selection guidelines and applications of sample-multiplexing approaches for single-cell sequencing

The suitability of a given sample-multiplexing method depends on several factors, namely:

(1)
whether the focus of the study is on transcriptomics, epigenomics, genomics, or multiomics;
(2)
experimental materials, e.g., Cell Hashing is designed for live cells, ClickTags for fixed cells, Nucleus Hashing and sci-Plex for nuclei, and MULTI-seq and CASB for both cells and nuclei;
(3)
the number of samples pooled together, e.g., oligo-tagged antibodies is sufficient for labeling a low number of samples, whereas some sci technologies can label thousands of samples;
(4)
the cell count per sample and the median number of genes captured per cell. A limited number of cells per sample with a low median number of captured genes/cell may be acquired using sci-Plex, sci-RNA-seq, or SPLiT-seq, despite their capacity for analyzing a massive number of samples;
(5)
research needs. The CRISPR system and shRNAs are appropriate tools for screening gene functions broadly, whereas CellTag is a favorable choice for studying stem cell differentiation; and
(6)
extra disturbances. Various labeling strategies and experimental times result in different levels of extra disturbances. For instance, it takes 4 h to conduct liposomal transfection with short barcoding oligos (probably followed by severe impacts), whereas using natural genetic variations minimizes extra disturbances to almost zero without additional labeling procedures. In addition, the factors such as the cost, experimental platform, processing time, and labor are also well worth considering.

Some sample-multiplexing technologies among the mentioned have greatly facilitated single-cell studies. For the purpose of appropriate use, we still need to be aware of their respective technical highlights and potential caveats (Table 2). For example, demuxlet has the prominent advantage that avoids additional process of labeling, and additional damage of the cells, but is inapplicable when samples share the same genetic background or reference genotypes cannot be obtained. ScSplit, Vireo, and Souporcell have further reduced sequencing costs depending on machine learning rather than reference genotypes, yet researchers still need additional information when they assign cell clusters to the specific samples.

Table 2 Comparison of the highlights and caveats for frequently-used sample-multiplexing methods

Full size table

In general, Cell Hashing can label almost all cell types [89]. However, it has been found that combining the two corresponding antibodies (against CD298 and 2M2) is ineffective for labeling a few human tumor cell lines that have been cultured for multiple generations in vitro. It should be noted that the expression of membrane proteins used for sample barcoding in some stem cells is unclear. BioLegend’s antibody combination for mouse Cell Hashing performs poor at marking egg cells, some stem cells, C3 and B16-BL6 melanoma cells. In addition, attention should also be paid to the presence of membrane protein damage in samples that are sensitive to digestion with trypsin. Therefore, it may be a good choice to use FACS to verify whether cells can be labeled before Cell Hashing is applied, and different antibodies may be chosen to label a minority of special cell types.

Compared to Cell Hashing, Nucleus Hashing is more friendly to complex tissues that are challenging to dissociate or are frozen, and the labeling target, nuclear pore complex has a relatively conserved sequence between species thus expanding the application. Nevertheless, Nucleus Hashing generally shows a less effectiveness of nucleus labeling than lipid-tagged methods, due to the destruction of nuclear pore complex during cell lysis and nucleus isolation.

MULTI-seq and 10 × Genomics CellPlex break the limits of species and cell types, but requires high quality of single cell (or nucleus) suspension. 10 × Genomics has reminded on its website that FACS or Fluorescence Activating Nucleus Sorter (FANS) to purify substandard samples before or after labeling is needed to improve the ratio of signal to noise. Recently, Viacheslav et al. compared antibody- and lipid-based multiplexing methods for single-cell RNA-seq, and drew a conclusion that antibody-based hashing is a more efficient protocol on human cell lines and PBMCs, while lipid hashing delivers a better result on nuclei [90]. In addition, our laboratory finds that antibody-based hashing is more stable and effective than lipid hashing when cell viability is low or apparent debris exists in single cell suspension, while lipid hashing suffers from the tendency of cross-contamination which may be attributed to the fluidity of lipids.

Sample-multiplexing approaches for single-cell sequencing have numerous applications, including: (1) massive screening of drugs or genes, (2) studying stem cell differentiation and lineage tracing, (3) constructing spatiotemporal cell atlases for various species, (4) discovering and identifying rare cell subpopulations, and (5) exploring tumor heterogeneity and the immune microenvironment (Fig. 4). These applications are becoming flexible and convenient owing to the native characteristics of sample multiplexing, such as the ability to overload single cells and to reduce experimental costs.

Concluding remarks

Here, we describe sample-multiplexing approaches for single-cell sequencing in transcriptomics, epigenomics, genomics, and multiomics, illustrate their respective strengths and disadvantages, and briefly provide selection guidelines and applications. Researchers are paying increasing attention to these methods, and innovations such as single-nucleus barcoding (SNuBar) [64] are being developed continuously. Sample multiplexing makes library construction more economically efficient while the cost of sequencing is still decreasing, which is very significant when a large number of samples or/and cells are desired in analysis, thereby sample multiplexing increases its impact on the broad fields of life science research.

Obviously, single-cell transcriptome sequencing provides the maturest solution to profile cell states and molecular characterizations by measuring mRNA information, and consequently relevant studies have been blossoming. Recently, Cheng et al. summarized sample-multiplexing approaches for scRNA-seq [91], and witnessed the thriving development in this field. Compared to single-cell transcriptome sequencing, there is a significant lag in experimental techniques and analytical pipelines for single-cell epigenome and genome sequencing. And in the latter case, the research costs are generally more expensive; the technological obstacles are even harder, so sample multiplexing is more desired. The good news is that we can see these challenges being overcome, and multiplexed single-cell epigenome and genome sequencing are driving to maturity out of their infancy. Especially, scATAC-seq matching scRNA-seq well has been widely applied and sample-multiplexing methods for it have emerged increasingly. In addition, Mission Bio Tapestri platform provides single-cell DNA-seq and DNA and protein co-assay, probably accessible to sample multiplexing based on the concept of Cell Hashing. Great importance should be attached to multiplexed scMulti-omics, which provides multidimensional and more comprehensive data, deepening the understanding of the central dogma associating with different physiological and pathological status of cells.

Notably, a dozen of popular technologies are available to explore spatial information of tissues, such as Slide-Seq [92], HDST [93], DBiT-seq [94], 10 × Genomics VISIUM, and BGI Stereomics. At present, there is no relevant literature of technology about sample multiplexing for spatial transcriptomics, which is of the future direction. In our laboratory, we have successfully implemented sample multiplexing on 10 × Genomics VISIUM by embedding multiple tissue samples together in OCT, or placing several tissue sections in one capture area. Further progress may include the adaptation of sample multiplexing to many emerging sequencing technologies, such as Cleavage Under Targets and Tagmentation (CUT&Tag) [95]. These elegant and powerful methods will fuel insights into complex biological processes at single-cell level.

Availability of data and material

Not applicable.

References

Tang F et al (2009) mRNA-Seq whole-transcriptome analysis of a single cell. Nat Methods 6:377–382. https://doi.org/10.1038/nmeth.1315
Article CAS PubMed Google Scholar
Shalek AK et al (2013) Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells. Nature 498:236–240. https://doi.org/10.1038/nature12172
Article CAS PubMed PubMed Central Google Scholar
Tang F et al (2011) Development and applications of single-cell transcriptome analysis. Nat Methods 8:S6-11. https://doi.org/10.1038/nmeth.1557
Article CAS PubMed PubMed Central Google Scholar
Deng Q et al (2014) Single-cell RNA-seq reveals dynamic, random monoallelic gene expression in mammalian cells. Science (New York, N.Y.) 343:193–196. https://doi.org/10.1126/science.1245316
Article CAS Google Scholar
Zhang X et al (2016) Single-cell sequencing for precise cancer research: progress and prospects. Can Res 76:1305–1312. https://doi.org/10.1158/0008-5472.Can-15-1907
Article CAS Google Scholar
Suvà ML, Tirosh I (2019) Single-cell RNA sequencing in cancer: lessons learned and emerging challenges. Mol Cell 75:7–12. https://doi.org/10.1016/j.molcel.2019.05.003
Article CAS PubMed Google Scholar
Wen L, Tang F (2016) Single-cell sequencing in stem cell biology. Genome Biol 17:71. https://doi.org/10.1186/s13059-016-0941-0
Article CAS PubMed PubMed Central Google Scholar
Chen H et al (2019) Revolutionizing immunology with single-cell RNA sequencing. Cell Mol Immunol 16:242–249. https://doi.org/10.1038/s41423-019-0214-4
Article CAS PubMed PubMed Central Google Scholar
Ofengeim D et al (2017) Single-cell RNA sequencing: unraveling the brain one cell at a time. Trends Mol Med 23:563–576. https://doi.org/10.1016/j.molmed.2017.04.006
Article CAS PubMed PubMed Central Google Scholar
Cuevas-Diaz Duran R et al (2017) Single-cell RNA-sequencing of the brain. Clin Transl Med 6:20. https://doi.org/10.1186/s40169-017-0150-9
Article PubMed PubMed Central Google Scholar
Mu Q et al (2019) Deciphering brain complexity using single-cell sequencing. Genomics Proteomics Bioinformatics 17:344–366. https://doi.org/10.1016/j.gpb.2018.07.007
Article PubMed PubMed Central Google Scholar
Tšuiko O et al (2020) Preimplantation Genetic Testing: single-cell technologies at the forefront of PGT and embryo research. Reproduction (Cambridge, England) 160:A19-a31. https://doi.org/10.1530/rep-20-0102
Article CAS Google Scholar
Shangguan Y et al (2020) Application of single-cell RNA sequencing in embryonic development. Genomics 112:4547–4551. https://doi.org/10.1016/j.ygeno.2020.08.007
Article CAS PubMed Google Scholar
Peng G et al (2020) Using single-cell and spatial transcriptomes to understand stem cell lineage specification during early embryo development. Annu Rev Genomics Hum Genet 21:163–181. https://doi.org/10.1146/annurev-genom-120219-083220
Article CAS PubMed Google Scholar
Hedlund E, Deng Q (2018) Single-cell RNA sequencing: technical advancements and biological applications. Mol Aspects Med 59:36–46. https://doi.org/10.1016/j.mam.2017.07.003
Article CAS PubMed Google Scholar
Huang X et al (2018) High throughput single cell RNA sequencing, bioinformatics analysis and applications. Adv Exp Med Biol 1068:33–43. https://doi.org/10.1007/978-981-13-0502-3_4
Article CAS PubMed Google Scholar
Yasen A et al (2020) Progress and applications of single-cell sequencing techniques. Infect Genet Evol 80:104198. https://doi.org/10.1016/j.meegid.2020.104198
Article CAS PubMed Google Scholar
Zheng GX et al (2017) Massively parallel digital transcriptional profiling of single cells. Nat Commun 8:14049. https://doi.org/10.1038/ncomms14049
Article CAS PubMed PubMed Central Google Scholar
Shum EY et al (2019) Quantitation of mRNA transcripts and proteins using the BD Rhapsody™ single-cell analysis system. Adv Exp Med Biol 1129:63–79. https://doi.org/10.1007/978-981-13-6037-4_5
Article CAS PubMed Google Scholar
Svensson V et al (2018) Exponential scaling of single-cell RNA-seq in the past decade. Nat Protoc 13(4):599–604. https://doi.org/10.1038/nprot.2017.149
Article CAS PubMed Google Scholar
Valihrach L et al (2018) Platforms for single-cell collection and analysis. Int J Mol Sci. https://doi.org/10.3390/ijms19030807
Article PubMed PubMed Central Google Scholar
Regev A et al (2017) The human cell Atlas. Elife. https://doi.org/10.7554/eLife.27041
Article PubMed PubMed Central Google Scholar
Li H et al (2021) Fly Cell Atlas: a single-cell transcriptomic atlas of the adult fruit fly. J bioRxiv. https://doi.org/10.1101/2021.07.04.451050
Article PubMed Google Scholar
Rhee SY et al (2019) Towards building a plant cell Atlas. Trends Plant Sci 24:303–310. https://doi.org/10.1016/j.tplants.2019.01.006
Article CAS PubMed PubMed Central Google Scholar
Wolock SL et al (2019) Scrublet: computational identification of cell doublets in single-cell transcriptomic data. Cell Syst 8(4):281–291. https://doi.org/10.1016/j.cels.2018.11.005 (e9)
Article CAS PubMed PubMed Central Google Scholar
McGinnis CS et al (2019) DoubletFinder: doublet detection in single-cell RNA sequencing data using artificial nearest neighbors. Cell Syst 8(4):329-337.e4. https://doi.org/10.1016/j.cels.2019.03.003
Article CAS PubMed PubMed Central Google Scholar
Hicks SC et al (2018) Missing data and technical variability in single-cell RNA-sequencing experiments. Biostatistics (Oxford, England) 19:562–578. https://doi.org/10.1093/biostatistics/kxx053
Article Google Scholar
Kang HM et al (2018) Multiplexed droplet single-cell RNA-sequencing using natural genetic variation. Nat Biotechnol 36:89–94. https://doi.org/10.1038/nbt.4042
Article CAS PubMed Google Scholar
Xu J et al (2019) Genotype-free demultiplexing of pooled single-cell RNA-seq. Genome Biol 20:290. https://doi.org/10.1186/s13059-019-1852-7
Article CAS PubMed PubMed Central Google Scholar
Huang Y et al (2019) Vireo: Bayesian demultiplexing of pooled single-cell RNA-seq data without genotype reference. Genome Biol 20:273. https://doi.org/10.1186/s13059-019-1865-2
Article PubMed PubMed Central Google Scholar
Heaton H et al (2020) Souporcell: robust clustering of single-cell RNA-seq data by genotype without reference genotypes. Nat Methods 17:615–620. https://doi.org/10.1038/s41592-020-0820-1
Article CAS PubMed Google Scholar
Stoeckius M et al (2017) Simultaneous epitope and transcriptome measurement in single cells. Nat Methods 14:865–868. https://doi.org/10.1038/nmeth.4380
Article CAS PubMed PubMed Central Google Scholar
Peterson VM et al (2017) Multiplexed quantification of proteins and transcripts in single cells. Nat Biotechnol 35:936–939. https://doi.org/10.1038/nbt.3973
Article CAS PubMed Google Scholar
Stoeckius M et al (2018) Cell Hashing with barcoded antibodies enables multiplexing and doublet detection for single cell genomics. Genome Biol 19:224. https://doi.org/10.1186/s13059-018-1603-1
Article CAS PubMed PubMed Central Google Scholar
Wu H et al (2019) Advantages of single-nucleus over single-cell RNA sequencing of adult kidney: rare cell types and novel cell states revealed in fibrosis. J Am Soc Nephrol 30:23–32. https://doi.org/10.1681/asn.2018090912
Article CAS PubMed Google Scholar
Slyper M et al (2020) A single-cell and single-nucleus RNA-Seq toolbox for fresh and frozen human tumors. Nat Med 26:792–802. https://doi.org/10.1038/s41591-020-0844-1
Article CAS PubMed PubMed Central Google Scholar
Ding J et al (2020) Systematic comparison of single-cell and single-nucleus RNA-sequencing methods. Nat Biotechnol 38:737–746. https://doi.org/10.1038/s41587-020-0465-8
Article CAS PubMed PubMed Central Google Scholar
Habib N et al (2017) Massively parallel single-nucleus RNA-seq with DroNc-seq. Nat Methods 14:955–958. https://doi.org/10.1038/nmeth.4407
Article CAS PubMed PubMed Central Google Scholar
Nagy C et al (2018) Single-nucleus RNA sequencing shows convergent evidence from different cell types for altered synaptic plasticity in major depressive disorder
Lake BB et al (2018) Integrative single-cell analysis of transcriptional and epigenetic states in the human adult brain. Nat Biotechnol 36:70–80. https://doi.org/10.1038/nbt.4038
Article CAS PubMed Google Scholar
Gaublomme JT et al (2019) Nuclei multiplexing with barcoded antibodies for single-nucleus genomics. Nat Commun 10:2907. https://doi.org/10.1038/s41467-019-10756-2
Article CAS PubMed PubMed Central Google Scholar
Weber RJ et al (2014) Efficient targeting of fatty-acid modified oligonucleotides to live cell membranes through stepwise assembly. Biomacromol 15:4621–4626. https://doi.org/10.1021/bm501467h
Article CAS Google Scholar
McGinnis CS et al (2019) MULTI-seq: sample multiplexing for single-cell RNA sequencing using lipid-tagged indices. Nat Methods 16:619–626. https://doi.org/10.1038/s41592-019-0433-8
Article CAS PubMed PubMed Central Google Scholar
Fang L et al (2021) CASB: a concanavalin A-based sample barcoding strategy for single-cell sequencing. Mol Syst Biol 17:e10060. https://doi.org/10.15252/msb.202010060
Article CAS PubMed PubMed Central Google Scholar
Gehring J et al (2020) Highly multiplexed single-cell RNA-seq by DNA oligonucleotide tagging of cellular proteins. Nat Biotechnol 38:35–38. https://doi.org/10.1038/s41587-019-0372-z
Article CAS PubMed Google Scholar
Shin D et al (2019) Multiplexed single-cell RNA-seq via transient barcoding for simultaneous expression profiling of various drug perturbations. Sci Adv 5:eaav2249. https://doi.org/10.1126/sciadv.aav2249
Article CAS PubMed PubMed Central Google Scholar
Srivatsan SR et al (2020) Massively multiplex chemical transcriptomics at single-cell resolution. Science (New York, N.Y.) 367:45–51. https://doi.org/10.1126/science.aax6234
Article CAS Google Scholar
Guo C et al (2019) Cell Tag Indexing: genetic barcode-based sample multiplexing for single-cell genomics. Genome Biol 20:90. https://doi.org/10.1186/s13059-019-1699-y
Article CAS PubMed PubMed Central Google Scholar
Dixit A et al (2016) Perturb-Seq: dissecting molecular circuits with scalable single-cell RNA profiling of pooled genetic screens. Cell 167:1853-1866.e1817. https://doi.org/10.1016/j.cell.2016.11.038
Article CAS PubMed PubMed Central Google Scholar
Adamson B et al (2016) A multiplexed single-cell CRISPR screening platform enables systematic dissection of the unfolded protein response. Cell 167:1867–1882. https://doi.org/10.1016/j.cell.2016.11.048 (e1821)
Article CAS PubMed PubMed Central Google Scholar
Jaitin DA et al (2016) Dissecting immune circuits by linking CRISPR-pooled screens with single-cell RNA-Seq. Cell 167:1883–1896. https://doi.org/10.1016/j.cell.2016.11.039 (e1815)
Article CAS PubMed Google Scholar
Datlinger P et al (2017) Pooled CRISPR screening with single-cell transcriptome readout. Nat Methods 14:297–301. https://doi.org/10.1038/nmeth.4177
Article CAS PubMed PubMed Central Google Scholar
Wagner DE, Klein AM (2017) Genetic screening enters the single-cell era. Nat Methods 14:237–238. https://doi.org/10.1038/nmeth.4196
Article CAS PubMed Google Scholar
Aarts M et al (2017) Coupling shRNA screens with single-cell RNA-seq identifies a dual role for mTOR in reprogramming-induced senescence. Genes Dev 31:2085–2098. https://doi.org/10.1101/gad.297796.117
Article CAS PubMed PubMed Central Google Scholar
Uzbas F et al (2019) BART-Seq: cost-effective massively parallelized targeted sequencing for genomics, transcriptomics, and single-cell analysis. Genome Biol 20:155. https://doi.org/10.1186/s13059-019-1748-6
Article CAS PubMed PubMed Central Google Scholar
Datlinger P et al (2021) Ultra-high-throughput single-cell RNA sequencing and perturbation screening with combinatorial fluidic indexing. Nat Methods 18:635–642. https://doi.org/10.1038/s41592-021-01153-z
Article CAS PubMed PubMed Central Google Scholar
Cao J et al (2017) Comprehensive single-cell transcriptional profiling of a multicellular organism. Science (New York, N.Y.) 357:661–667. https://doi.org/10.1126/science.aam8940
Article CAS Google Scholar
Cao J et al (2019) The single-cell transcriptional landscape of mammalian organogenesis. Nature 566:496–502. https://doi.org/10.1038/s41586-019-0969-x
Article CAS PubMed PubMed Central Google Scholar
Rosenberg AB et al (2018) Single-cell profiling of the developing mouse brain and spinal cord with split-pool barcoding. Science (New York, N.Y.) 360:176–182. https://doi.org/10.1126/science.aam8999
Article CAS Google Scholar
Shashikant T, Ettensohn CA (2019) Genome-wide analysis of chromatin accessibility using ATAC-seq. Methods Cell Biol 151:219–235. https://doi.org/10.1016/bs.mcb.2018.11.002
Article CAS PubMed Google Scholar
Yan F et al (2020) From reads to insight: a hitchhiker’s guide to ATAC-seq data analysis. Genome Biol 21:22. https://doi.org/10.1186/s13059-020-1929-3
Article PubMed PubMed Central Google Scholar
Cusanovich DA et al (2015) Multiplex single cell profiling of chromatin accessibility by combinatorial cellular indexing. Science (New York, N.Y.) 348:910–914. https://doi.org/10.1126/science.aab1601
Article CAS Google Scholar
Lareau CA et al (2019) Droplet-based combinatorial indexing for massive-scale single-cell chromatin accessibility. Nat Biotechnol 37:916–924. https://doi.org/10.1038/s41587-019-0147-6
Article CAS PubMed Google Scholar
Wang K et al (2021) Simple oligonucleotide-based multiplexing of single-cell chromatin accessibility. Mol Cell 81:4319-4332.e4310. https://doi.org/10.1016/j.molcel.2021.09.026
Article CAS PubMed Google Scholar
Ng PC, Kirkness EF (2010) Whole genome sequencing. Methods Mol Biol (Clifton, N.J.) 628:215–226. https://doi.org/10.1007/978-1-60327-367-1_12
Article CAS Google Scholar
Vitak SA et al (2017) Sequencing thousands of single-cell genomes with combinatorial indexing. Nat Methods 14:302–308. https://doi.org/10.1038/nmeth.4154
Article CAS PubMed PubMed Central Google Scholar
Moore LD et al (2013) DNA methylation and its basic function. Neuropsychopharmacology 38:23–38. https://doi.org/10.1038/npp.2012.112
Article CAS PubMed Google Scholar
Gouil Q, Keniry A (2019) Latest techniques to study DNA methylation. Essays Biochem 63:639–648. https://doi.org/10.1042/ebc20190027
Article CAS PubMed PubMed Central Google Scholar
Frommer M et al (1992) A genomic sequencing protocol that yields a positive display of 5-methylcytosine residues in individual DNA strands. Proc Natl Acad Sci USA 89:1827–1831. https://doi.org/10.1073/pnas.89.5.1827
Article CAS PubMed PubMed Central Google Scholar
Lister R et al (2009) Human DNA methylomes at base resolution show widespread epigenomic differences. Nature 462:315–322. https://doi.org/10.1038/nature08514
Article CAS PubMed PubMed Central Google Scholar
Smallwood SA et al (2014) Single-cell genome-wide bisulfite sequencing for assessing epigenetic heterogeneity. Nat Methods 11:817–820. https://doi.org/10.1038/nmeth.3035
Article CAS PubMed PubMed Central Google Scholar
Farlik M et al (2015) Single-cell DNA methylome sequencing and bioinformatic inference of epigenomic cell-state dynamics. Cell Rep 10:1386–1397. https://doi.org/10.1016/j.celrep.2015.02.001
Article CAS PubMed PubMed Central Google Scholar
Farlik M et al (2016) DNA methylation dynamics of human hematopoietic stem cell differentiation. Cell Stem Cell 19:808–822. https://doi.org/10.1016/j.stem.2016.10.019
Article CAS PubMed PubMed Central Google Scholar
Angermueller C et al (2016) Parallel single-cell sequencing links transcriptional and epigenetic heterogeneity. Nat Methods 13:229–232. https://doi.org/10.1038/nmeth.3728
Article CAS PubMed PubMed Central Google Scholar
Clark SJ et al (2017) Genome-wide base-resolution mapping of DNA methylation in single cells using single-cell bisulfite sequencing (scBS-seq). Nat Protoc 12:534–547. https://doi.org/10.1038/nprot.2016.187
Article CAS PubMed Google Scholar
Luo C et al (2017) Single-cell methylomes identify neuronal subtypes and regulatory elements in mammalian cortex. Science (New York, N.Y.) 357:600–604. https://doi.org/10.1126/science.aan3351
Article CAS Google Scholar
Mulqueen RM et al (2018) Highly scalable generation of DNA methylation profiles in single cells. Nat Biotechnol 36:428–431. https://doi.org/10.1038/nbt.4112
Article CAS PubMed PubMed Central Google Scholar
Lieberman-Aiden E et al (2009) Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science (New York, N.Y.) 326:289–293. https://doi.org/10.1126/science.1181369
Article CAS Google Scholar
Ramani V et al (2016) Understanding spatial genome organization: methods and insights. Genomics Proteomics Bioinformatics 14:7–20. https://doi.org/10.1016/j.gpb.2016.01.002
Article PubMed PubMed Central Google Scholar
Eagen KP (2018) Principles of chromosome architecture revealed by Hi-C. Trends Biochem Sci 43:469–478. https://doi.org/10.1016/j.tibs.2018.03.006
Article CAS PubMed PubMed Central Google Scholar
Kong S, Zhang Y (2019) Deciphering Hi-C: from 3D genome to function. Cell Biol Toxicol 35:15–32. https://doi.org/10.1007/s10565-018-09456-2
Article CAS PubMed Google Scholar
Ramani V et al (2017) Massively multiplex single-cell Hi-C. Nat Methods 14:263–266. https://doi.org/10.1038/nmeth.4155
Article CAS PubMed PubMed Central Google Scholar
Hasin Y et al (2017) Multi-omics approaches to disease. Genome Biol 18:83. https://doi.org/10.1186/s13059-017-1215-1
Article CAS PubMed PubMed Central Google Scholar
Chappell L et al (2018) Single-sell (Multi)omics technologies. Annu Rev Genomics Hum Genet 19:15–41. https://doi.org/10.1146/annurev-genom-091416-035324
Article CAS PubMed Google Scholar
Mimitou EP et al (2019) Multiplexed detection of proteins, transcriptomes, clonotypes and CRISPR perturbations in single cells. Nat Methods 16:409–412. https://doi.org/10.1038/s41592-019-0392-0
Article CAS PubMed PubMed Central Google Scholar
Cao J et al (2018) Joint profiling of chromatin accessibility and gene expression in thousands of single cells. Science (New York, N.Y.) 361:1380–1385. https://doi.org/10.1126/science.aau0730
Article CAS Google Scholar
Mimitou EP et al (2021) Scalable, multimodal profiling of chromatin accessibility, gene expression and protein levels in single cells. Nat Biotechnol 39:1246–1258. https://doi.org/10.1038/s41587-021-00927-2
Article CAS PubMed PubMed Central Google Scholar
Swanson E et al (2021) Simultaneous trimodal single-cell measurement of transcripts, epitopes, and chromatin accessibility using TEA-seq. Elife. https://doi.org/10.7554/eLife.63632
Article PubMed PubMed Central Google Scholar
Hartmann FJ et al (2018) A universal live cell barcoding-platform for multiplexed human single cell analysis. Sci Rep 8(1):10770. https://doi.org/10.1038/s41598-018-28791-2
Article CAS PubMed PubMed Central Google Scholar
Mylka V et al (2022) Comparative analysis of antibody- and lipid-based multiplexing methods for single-cell RNA-seq. Genome Biol 23(1):55. https://doi.org/10.1186/s13059-022-02628-8
Article CAS PubMed PubMed Central Google Scholar
Cheng J et al (2021) Multiplexing methods for simultaneous large-scale transcriptomic profiling of samples at single-cell resolution. Adv Sci (Weinheim, Baden-Wurttemberg, Germany) 8:e2101229. https://doi.org/10.1002/advs.202101229
Article CAS Google Scholar
Rodriques SG et al (2019) Slide-seq: A scalable technology for measuring genome-wide expression at high spatial resolution. Science 363(6434):1463–1467. https://doi.org/10.1126/science.aaw1219
Article CAS PubMed PubMed Central Google Scholar
Vickovic S et al (2019) High-definition spatial transcriptomics for in situ tissue profiling. Nat Methods 16(10):987–990. https://doi.org/10.1038/s41592-019-0548-y
Article CAS PubMed PubMed Central Google Scholar
Liu Y et al (2020) High-spatial-resolution multi-omics sequencing via deterministic barcoding in tissue. Cell 183(6):1665-1681.e18. https://doi.org/10.1016/j.cell.2020.10.026
Article CAS PubMed PubMed Central Google Scholar
Kaya-Okur HS et al (2019) CUT&Tag for efficient epigenomic profiling of small samples and single cells. Nat Commun 10:1930. https://doi.org/10.1038/s41467-019-09982-5
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Kai He from Southern Medical University, and Yanfang Lu from Henan Provincial People’s Hospital for suggestion in the preparation of this manuscript.

Funding

This work was supported by the grants from the Natural Science Foundation of Guangdong Province (Major Basic Cultivation Project 2018B030308004, and Major Projects of Basic and Applied Basic Research 2019B1515120033), the National Nature Science Foundation of China (32071452), the Open Fund Programs of Shenzhen Bay Laboratory (SZBL2020090501003), and the Pearl River Talents Program Local Innovative and Research Teams (2017BT01S131).

Author information

Authors and Affiliations

Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Southern Medical University, Guangzhou, Guangdong, 510515, China
Yulong Zhang, Siwen Xu, Zebin Wen, Jinyu Gao & Xinghua Pan
Guangdong Provincial Key Laboratory of Single Cell Technology and Application, Southern Medical University, Guangzhou, Guangdong, 510515, China
Yulong Zhang, Siwen Xu, Zebin Wen, Jinyu Gao & Xinghua Pan
Shenzhen Bay Laboratory, Shenzhen, Guangdong, China
Yulong Zhang, Shuang Li & Xinghua Pan
SequMed BioTechnology, Inc., Guangzhou, Guangdong, China
Siwen Xu
Department of Genetics, Yale University School of Medicine, New Haven, CT, 06520-8005, USA
Sherman M. Weissman

Authors

Yulong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Siwen Xu
View author publications
You can also search for this author in PubMed Google Scholar
Zebin Wen
View author publications
You can also search for this author in PubMed Google Scholar
Jinyu Gao
View author publications
You can also search for this author in PubMed Google Scholar
Shuang Li
View author publications
You can also search for this author in PubMed Google Scholar
Sherman M. Weissman
View author publications
You can also search for this author in PubMed Google Scholar
Xinghua Pan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the review conception and design. Material preparation, data collection, and analysis were performed by YZ, SX, ZW, and JG. The first draft of the manuscript was written by YZ and SX, and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Xinghua Pan.

Ethics declarations

Conflict of interest

There are no conflicts of interest to declare.

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhang, Y., Xu, S., Wen, Z. et al. Sample-multiplexing approaches for single-cell sequencing. Cell. Mol. Life Sci. 79, 466 (2022). https://doi.org/10.1007/s00018-022-04482-0

Download citation

Received: 18 February 2022
Revised: 25 June 2022
Accepted: 11 July 2022
Published: 05 August 2022
DOI: https://doi.org/10.1007/s00018-022-04482-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Sample-multiplexing approaches for single-cell sequencing

Abstract

Similar content being viewed by others

Applications of Single-Cell Sequencing for Multiomics

Single-cell transcriptomics: background, technologies, applications, and challenges

Single-cell sequencing techniques from individual to multiomics analyses

Introduction

Sample multiplexing for single-cell transcriptome sequencing

Natural genetic variation

Demultiplexing scRNA-seq data by genetic variations with reference genotypes

Demultiplexing scRNA-seq data by genetic variations without reference genotypes

Nucleotide-barcode anchoring on cellular or nuclear membranes

Oligo-tagged antibodies

Lipid-tagged indices

Concanavalin A (ConA)-based sample barcoding

ClickTags

Nucleotide-barcode internalization to the cytoplasm or nucleus

Liposomal transfection with short barcoding oligos

Staining nuclei with polyadenylated ssDNA oligonucleotides

Vector-based barcode expression in cells

Lentiviral barcodes facilitate cell tagging, pooling, and tracking

Clustered regularly interspaced short palindromic repeats (CRISPR) and short-hairpin RNA (shRNA) enable multiplexed single-cell screening

Incorporating nucleotide barcodes during library construction

Barcode assembly for targeted sequencing (BART-Seq)

Single-cell combinatorial fluidic indexing (scifi)-RNA-seq

Single-cell combinatorial indexing (sci)-RNA-seq and split-pool ligation-based transcriptome sequencing (SPLiT-seq)

Sample multiplexing for single-cell epigenome and genome sequencing

Sample multiplexing for scATAC-seq

sci-ATAC-seq

Droplet-based scATAC-seq (dscATAC-seq) combined with combinatorial indexing (dsciATAC-seq)

CASB for scATAC-seq

Sample multiplexing for scWGS

Sample multiplexing for single-cell DNA-methylation analysis

Sample multiplexing for scHi-C

Sample multiplexing for scMulti-omics

Selection guidelines and applications of sample-multiplexing approaches for single-cell sequencing

Concluding remarks

Availability of data and material

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval and consent to participate

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation