Genomics

Dataset Information

0

A Bayesian mixture model for clustering droplet-based single cell transcriptomic data from population studies


ABSTRACT: Abstract: The recently developed droplet-based single cell transcriptome sequencing (scRNA-seq) technology makes it feasible to perform a population-scale scRNA-seq study, in which the transcriptome is measured for tens of thousands of single cells from multiple individuals. Despite the advances of many clustering methods, there are few tailored methods for population-scale scRNA-seq studies. Here, we develop a BAyesian Mixture Model for Single Cell sequencing (BAMM-SC) method to cluster scRNA-seq data from multiple individuals simultaneously. BAMM-SC takes raw count data as input and accounts for data heterogeneity and batch effect among multiple individuals in a unified Bayesian hierarchical model framework. Results from applications of BAMM-SC to in-house experimental scRNA-seq datasets using blood and lung cells from humans or mice demonstrate that BAMM-SC outperformed existing clustering methods with considerable improved clustering accuracy, particularly in the presence of heterogeneity among individuals. Data purpose: To evaluate the performance of BAMM-SC for clustering droplet-based scRNA-seq data in population-based study, we performed single cell RNA-seq on peripheral blood mononuclear cells (PBMC) isolated from whole blood obtained from 4 healthy donors, and on lung cells isolated from streptococcus pneumonia (SP) infected and naïve mice.

ORGANISM(S): Mus musculus Homo sapiens

PROVIDER: GSE128066 | GEO | 2019/03/09

REPOSITORIES: GEO

Similar Datasets

2020-05-18 | GSE148665 | GEO
| PRJNA526250 | ENA
2021-11-30 | E-MTAB-10800 | biostudies-arrayexpress
2023-03-11 | PXD035765 | Pride
2019-12-31 | GSE134134 | GEO
2020-12-20 | GSE111893 | GEO
2021-06-25 | GSE165905 | GEO
2020-11-24 | E-MTAB-9522 | biostudies-arrayexpress
2021-06-29 | GSE159417 | GEO
2022-11-15 | E-MTAB-11188 | biostudies-arrayexpress