Dataset Information

Interpretable and context-free deconvolution of multi-scale whole transcriptomic data with UniCell deconvolve.

ABSTRACT: We introduce UniCell: Deconvolve Base (UCDBase), a pre-trained, interpretable, deep learning model to deconvolve cell type fractions and predict cell identity across Spatial, bulk-RNA-Seq, and scRNA-Seq datasets without contextualized reference data. UCD is trained on 10 million pseudo-mixtures from a fully-integrated scRNA-Seq training database comprising over 28 million annotated single cells spanning 840 unique cell types from 898 studies. We show that our UCDBase and transfer-learning models achieve comparable or superior performance on in-silico mixture deconvolution to existing, reference-based, state-of-the-art methods. Feature attribute analysis uncovers gene signatures associated with cell-type specific inflammatory-fibrotic responses in ischemic kidney injury, discerns cancer subtypes, and accurately deconvolves tumor microenvironments. UCD identifies pathologic changes in cell fractions among bulk-RNA-Seq data for several disease states. Applied to lung cancer scRNA-Seq data, UCD annotates and distinguishes normal from cancerous cells. Overall, UCD enhances transcriptomic data analysis, aiding in assessment of cellular and spatial context.

SUBMITTER: Charytonowicz D

PROVIDER: S-EPMC10008582 | biostudies-literature | 2023 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Interpretable and context-free deconvolution of multi-scale whole transcriptomic data with UniCell deconvolve.

Charytonowicz Daniel D Brody Rachel R Sebra Robert R

Nature communications 20230311 1

We introduce UniCell: Deconvolve Base (UCDBase), a pre-trained, interpretable, deep learning model to deconvolve cell type fractions and predict cell identity across Spatial, bulk-RNA-Seq, and scRNA-Seq datasets without contextualized reference data. UCD is trained on 10 million pseudo-mixtures from a fully-integrated scRNA-Seq training database comprising over 28 million annotated single cells spanning 840 unique cell types from 898 studies. We show that our UCDBase and transfer-learning models ...[more]

PMID: 36906603

Dataset Information

Interpretable and context-free deconvolution of multi-scale whole transcriptomic data with UniCell deconvolve.

Publications

Interpretable and context-free deconvolution of multi-scale whole transcriptomic data with UniCell deconvolve.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

consICA: an R package for robust reference-free deconvolution of multi-omics data.
| S-EPMC11257712 | biostudies-literature

SpatialDWLS: accurate deconvolution of spatial transcriptomic data.
| S-EPMC8108367 | biostudies-literature

Reference-free cell type deconvolution of multi-cellular pixel-resolution spatially resolved transcriptomics data.
| S-EPMC9055051 | biostudies-literature

To deconvolve, or not to deconvolve: Inferences of neuronal activities using calcium imaging data.
| S-EPMC9482453 | biostudies-literature

A deconvolution algorithm for multi-echo functional MRI: Multi-echo Sparse Paradigm Free Mapping.
| S-EPMC6819276 | biostudies-literature

Model-free quantification of dynamic PET data using nonparametric deconvolution.
| S-EPMC4528013 | biostudies-literature

STsisal: a reference-free deconvolution pipeline for spatial transcriptomics data.
| S-EPMC11911522 | biostudies-literature

Phenotype prediction using biologically interpretable neural networks on multi-cohort multi-omics data.
| S-EPMC11297229 | biostudies-literature

Interpretable Multi-Scale Deep Learning for RNA Methylation Analysis across Multiple Species
| S-EPMC10932270 | biostudies-literature

Learning interpretable cellular and gene signature embeddings from single-cell transcriptomic data.
| S-EPMC8421403 | biostudies-literature