Unknown

Dataset Information

0

A multi-center cross-platform single-cell RNA sequencing reference dataset.


ABSTRACT: Single-cell RNA sequencing (scRNA-seq) is developing rapidly, and investigators seeking to use this technology are left with a variety of options for both experimental platform and bioinformatics methods. There is an urgent need for scRNA-seq reference datasets for benchmarking of different scRNA-seq platforms and bioinformatics methods. To be broadly applicable, these should be generated from renewable, well characterized reference samples and processed in multiple centers across different platforms. Here we present a benchmark scRNA-seq dataset that includes 20 scRNA-seq datasets acquired either as mixtures or as individual samples from two biologically distinct cell lines for which a large amount of multi-platform whole genome sequencing data are also available. These scRNA-seq datasets were generated from multiple popular platforms across four sequencing centers. We believe the datasets we describe here will provide a resource that meets this need by allowing evaluation of various bioinformatics methods for scRNA-seq analyses, including but not limited to data preprocessing, imputation, normalization, clustering, batch correction, and differential analysis.

SUBMITTER: Chen X 

PROVIDER: S-EPMC7854649 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

A multi-center cross-platform single-cell RNA sequencing reference dataset.

Chen Xin X   Yang Zhaowei Z   Chen Wanqiu W   Zhao Yongmei Y   Farmer Andrew A   Tran Bao B   Furtak Vyacheslav V   Moos Malcolm M   Xiao Wenming W   Wang Charles C  

Scientific data 20210202 1


Single-cell RNA sequencing (scRNA-seq) is developing rapidly, and investigators seeking to use this technology are left with a variety of options for both experimental platform and bioinformatics methods. There is an urgent need for scRNA-seq reference datasets for benchmarking of different scRNA-seq platforms and bioinformatics methods. To be broadly applicable, these should be generated from renewable, well characterized reference samples and processed in multiple centers across different plat  ...[more]

Similar Datasets

| S-EPMC8578599 | biostudies-literature
| S-EPMC9283889 | biostudies-literature
2025-09-01 | GSE303762 | GEO
| S-EPMC9879959 | biostudies-literature
| S-EPMC7560448 | biostudies-literature
| S-EPMC8456427 | biostudies-literature
| S-EPMC11245320 | biostudies-literature
| S-EPMC11725501 | biostudies-literature
| S-EPMC4322577 | biostudies-literature
| S-EPMC10018366 | biostudies-literature