Transcriptomics

Dataset Information

0

Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes [cell mixtures bulk]


ABSTRACT: RNA profiling technologies at single-cell resolutions, including single-cell and single-nuclei RNA sequencing (scRNA-Seq and snRNA-Seq, scnRNA-Seq for short), can help characterize the composition of tissues and reveal cells that influence key healthy and disease functions. However, the use of these technologies is challenging because of their relatively high costs and exacting sample collection requirements. Computational deconvolution methods that infer the composition of RNA-Seq-profiled samples using scnRNA-Seq-characterized cell types can expand the benefit of these technologies, but their effectiveness remains controversial. We produced the first systematic evaluation of deconvolution methods on datasets with either known compositions or based on concurrent RNA-Seq and scnRNA-Seq profiles. Our analyses revealed biases that are common to scnRNA-Seq 10X Genomics assays and illustrated the importance of accurate and properly controlled data preprocessing and method selection and optimization. Moreover, our results suggested that concurrent RNA-Seq and scnRNA-Seq profiles can help improve the accuracy of both scnRNA-Seq preprocessing and the deconvolution methods that employ them. Indeed, our proposed method, Single-cell RNA Quantity Informed Deconvolution (SQUID), combined RNA-Seq transformation and a dampened weighted least squares deconvolution approach to consistently outperform other methods in predicting the composition of cell mixtures and tissue samples. Moreover, our analysis suggested that only SQUID could identify outcomes-predictive cancer cell subtypes in pediatric acute myeloid leukemia and neuroblastoma datasets.

ORGANISM(S): Homo sapiens

PROVIDER: GSE220605 | GEO | 2023/04/28

REPOSITORIES: GEO

Similar Datasets

2023-04-28 | GSE220607 | GEO
2023-04-28 | GSE220606 | GEO
2023-04-28 | GSE220651 | GEO
2021-03-31 | GSE163252 | GEO
2019-02-06 | GSE107011 | GEO
2019-02-06 | GSE106898 | GEO
2021-06-04 | GSE161470 | GEO
2016-03-18 | GSE79385 | GEO
2018-12-12 | GSE123604 | GEO
2012-09-13 | E-GEOD-40829 | biostudies-arrayexpress