Transcriptomics,Genomics

Dataset Information

167

BigSCale: An Analytical Framework for Big-Scale Single Cell Data


ABSTRACT: Single-cell RNA sequencing significantly deepened our insights into complex tissues and latest techniques are capable to analyze ten-thousands of cells simultaneously. With bigSCale, we provide an analytical framework being scalable to analyze millions of cells, addressing challenges of future large data sets. Unlike other methods, bigSCale does not constrain data to fit an a priori-defined distribution and instead uses an accurate numerical model of noise. We evaluate the performance of bigSCale using a biological model of aberrant gene expression and simulated data sets, which underlined its speed and accuracy. We further apply bigSCale to analyze 1.3 million cells from the mouse developing forebrain. The directed down-sampling strategy identified rare populations, such as Reelin positive Cajal-Retzius neurons, for which we determined a previously not recognized heterogeneity associated to distinct differentiation stages, spatial organization and cellular function. Together, bigSCale presents a perfect solution to address future challenges of large single-cell data sets. Overall design: Single cell transcriptomes from 1847 neuronal progenitors differentiated from iPSC of one healthy donor, two patients with Williams-Beuren syndrome and two patients with 7q11.23 microduplication syndrome.

INSTRUMENT(S): Illumina HiSeq 2500 (Homo sapiens)

SUBMITTER: giovanni iacono 

PROVIDER: GSE102934 | GEO | 2018-04-15

REPOSITORIES: GEO

Similar Datasets

| GSE29767 | GEO
| GSE80311 | GEO
2013-01-17 | E-GEOD-29767 | ArrayExpress
2016-03-08 | E-GEOD-74734 | ArrayExpress
| GSE74734 | GEO
2014-12-15 | E-GEOD-63057 | ArrayExpress
2014-12-15 | E-GEOD-63028 | ArrayExpress
2014-12-15 | E-GEOD-63055 | ArrayExpress
2011-09-21 | GSE28040 | GEO
2013-11-12 | E-GEOD-35571 | ArrayExpress