Unknown

Dataset Information

0

A benchmark study of simulation methods for single-cell RNA sequencing data.


ABSTRACT: Single-cell RNA-seq (scRNA-seq) data simulation is critical for evaluating computational methods for analysing scRNA-seq data especially when ground truth is experimentally unattainable. The reliability of evaluation depends on the ability of simulation methods to capture properties of experimental data. However, while many scRNA-seq data simulation methods have been proposed, a systematic evaluation of these methods is lacking. We develop a comprehensive evaluation framework, SimBench, including a kernel density estimation measure to benchmark 12 simulation methods through 35 scRNA-seq experimental datasets. We evaluate the simulation methods on a panel of data properties, ability to maintain biological signals, scalability and applicability. Our benchmark uncovers performance differences among the methods and highlights the varying difficulties in simulating data characteristics. Furthermore, we identify several limitations including maintaining heterogeneity of distribution. These results, together with the framework and datasets made publicly available as R packages, will guide simulation methods selection and their future development.

SUBMITTER: Cao Y 

PROVIDER: S-EPMC8617278 | biostudies-literature | 2021 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

A benchmark study of simulation methods for single-cell RNA sequencing data.

Cao Yue Y   Yang Pengyi P   Yang Jean Yee Hwa JYH  

Nature communications 20211125 1


Single-cell RNA-seq (scRNA-seq) data simulation is critical for evaluating computational methods for analysing scRNA-seq data especially when ground truth is experimentally unattainable. The reliability of evaluation depends on the ability of simulation methods to capture properties of experimental data. However, while many scRNA-seq data simulation methods have been proposed, a systematic evaluation of these methods is lacking. We develop a comprehensive evaluation framework, SimBench, includin  ...[more]

Similar Datasets

| S-EPMC6964114 | biostudies-literature
| S-EPMC6918801 | biostudies-literature
| S-EPMC5596896 | biostudies-literature
| S-EPMC7444317 | biostudies-literature
| S-EPMC8720898 | biostudies-literature
| S-EPMC7214028 | biostudies-literature
| S-EPMC7897250 | biostudies-literature
| S-EPMC9071466 | biostudies-literature
| S-EPMC6734286 | biostudies-literature
| S-EPMC8921632 | biostudies-literature