Unknown

Dataset Information

0

SCDC: bulk gene expression deconvolution by multiple single-cell RNA sequencing references.


ABSTRACT: Recent advances in single-cell RNA sequencing (scRNA-seq) enable characterization of transcriptomic profiles with single-cell resolution and circumvent averaging artifacts associated with traditional bulk RNA sequencing (RNA-seq) data. Here, we propose SCDC, a deconvolution method for bulk RNA-seq that leverages cell-type specific gene expression profiles from multiple scRNA-seq reference datasets. SCDC adopts an ENSEMBLE method to integrate deconvolution results from different scRNA-seq datasets that are produced in different laboratories and at different times, implicitly addressing the problem of batch-effect confounding. SCDC is benchmarked against existing methods using both in silico generated pseudo-bulk samples and experimentally mixed cell lines, whose known cell-type compositions serve as ground truths. We show that SCDC outperforms existing methods with improved accuracy of cell-type decomposition under both settings. To illustrate how the ENSEMBLE framework performs in complex tissues under different scenarios, we further apply our method to a human pancreatic islet dataset and a mouse mammary gland dataset. SCDC returns results that are more consistent with experimental designs and that reproduce more significant associations between cell-type proportions and measured phenotypes.

SUBMITTER: Dong M 

PROVIDER: S-EPMC7820884 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

SCDC: bulk gene expression deconvolution by multiple single-cell RNA sequencing references.

Dong Meichen M   Thennavan Aatish A   Urrutia Eugene E   Li Yun Y   Perou Charles M CM   Zou Fei F   Jiang Yuchao Y  

Briefings in bioinformatics 20210101 1


Recent advances in single-cell RNA sequencing (scRNA-seq) enable characterization of transcriptomic profiles with single-cell resolution and circumvent averaging artifacts associated with traditional bulk RNA sequencing (RNA-seq) data. Here, we propose SCDC, a deconvolution method for bulk RNA-seq that leverages cell-type specific gene expression profiles from multiple scRNA-seq reference datasets. SCDC adopts an ENSEMBLE method to integrate deconvolution results from different scRNA-seq dataset  ...[more]

Similar Datasets

| S-EPMC6048536 | biostudies-literature
| S-EPMC6342984 | biostudies-literature
| S-EPMC7650528 | biostudies-literature
2019-08-30 | GSE136148 | GEO
| S-EPMC7803005 | biostudies-literature
| S-EPMC8373058 | biostudies-literature
| S-EPMC6030502 | biostudies-literature
| S-EPMC6114100 | biostudies-other