Single-cell RNA sequencing (scRNA-Seq) is a powerful tool that is growing in popularity thanks to its utility.
We have built a standardized pipeline that speeds up QC and processing of single-cell data and enables complete transparency and reproducibility. Our platform consists of two applications, QB_QuickPassQC, and QB_StanAnnDEX.
QB_QuickPassQC allows us to rapidly assess new and un-vetted single-cell data sets for overall quality, doublet presence1, ambient RNA contamination2, and confounding technical covariates.
QB_QuickPassQC also enables us to swiftly identify biologically and statistically sound filtering thresholds and confounding variables which we utilize in our QB_StanAnnDEX platform.
In Brief:
QB_StanAnnDEXis a containerized R application that performs (1) filtering and standardization, (2) annotation, and (3) differential expression on a single-cell RNA-Seq data.
We utilize this pipeline on pre-filtered data sets or data sets that have been vetted internally via QB_QuickPassQC.
A brief explanation of each step within the pipeline is provided below:
(1) Quality control and standardization:
We perform quality control at the individual cell and sample level. All cells and samples that are low quality are removed. Expression matrices are then normalized and if necessary, confounding variables are regressed out.
(2) Annotating and clustering cells:
Cells are annotated with either standard or custom references using the R package, SingleR3. After annotating cells, QB performs dimension reduction via PCA followed by graph-based clustering, and then visualizations via UMAP.
(3) Differential Expression:
During QC, we have automated the assessment of client-provided covariates, which may be of technical or biological nature. This allows us to conduct differential expression (DE) analyses with the proper controls, and regress technical covariates as needed. After completion of annotation and clustering, differential expression is performed utilizing Wilcoxon’s rank sum test within Seurat4,5. Final DE lists are determined and outputted.
References