Unknown

Dataset Information

0

ScDisInFact: disentangled learning for integration and prediction of multi-batch multi-condition single-cell RNA-sequencing data.


ABSTRACT: Single-cell RNA-sequencing (scRNA-seq) has been widely used for disease studies, where sample batches are collected from donors under different conditions including demographical groups, disease stages, and drug treatments. It is worth noting that the differences among sample batches in such a study are a mixture of technical confounders caused by batch effect and the biological variations caused by condition effect. However, current batch effect removal methods often eliminate both technical batch effects and meaningful condition effects, while perturbation prediction methods solely focus on condition effects, resulting in inaccurate gene expression predictions due to unaccounted batch effects. Here we introduce scDisInFact, a deep learning framework that models both batch effect and condition effect in scRNA-seq data. scDisInFact learns latent factors that disentangle condition effects from batch effects, enabling it to simultaneously perform three tasks: batch effect removal, condition-associated key gene detection, and perturbation prediction. We evaluated scDisInFact on both simulated and real datasets, and compared its performance to baseline methods for each task. Our results demonstrate that scDisInFact outperforms existing methods that focus on individual tasks, providing a more comprehensive and accurate approach for integrating and predicting multi-batch multi-condition single-cell RNA-sequencing data.

SUBMITTER: Zhang Z 

PROVIDER: S-EPMC10187255 | biostudies-literature | 2023 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

scDisInFact: disentangled learning for integration and prediction of multi-batch multi-condition single-cell RNA-sequencing data.

Zhang Ziqi Z   Zhao Xinye X   Qiu Peng P   Zhang Xiuwei X  

bioRxiv : the preprint server for biology 20230502


Single-cell RNA-sequencing (scRNA-seq) has been widely used for disease studies, where sample batches are collected from donors under different conditions including demographical groups, disease stages, and drug treatments. It is worth noting that the differences among sample batches in such a study are a mixture of technical confounders caused by batch effect and the biological variations caused by condition effect. However, current batch effect removal methods often eliminate both technical ba  ...[more]

Similar Datasets

| S-EPMC10827746 | biostudies-literature
| S-EPMC8788944 | biostudies-literature
| S-EPMC10944609 | biostudies-literature
| S-EPMC11304254 | biostudies-literature
| S-EPMC10336139 | biostudies-literature
| S-EPMC9979929 | biostudies-literature
| S-EPMC10788980 | biostudies-literature
| S-EPMC10197537 | biostudies-literature
| S-EPMC5374972 | biostudies-other
| S-EPMC8355612 | biostudies-literature