Unknown

Dataset Information

0

Characterizing the replicability of cell types defined by single cell RNA-sequencing data using MetaNeighbor.


ABSTRACT: Single-cell RNA-sequencing (scRNA-seq) technology provides a new avenue to discover and characterize cell types; however, the experiment-specific technical biases and analytic variability inherent to current pipelines may undermine its replicability. Meta-analysis is further hampered by the use of ad hoc naming conventions. Here we demonstrate our replication framework, MetaNeighbor, that quantifies the degree to which cell types replicate across datasets, and enables rapid identification of clusters with high similarity. We first measure the replicability of neuronal identity, comparing results across eight technically and biologically diverse datasets to define best practices for more complex assessments. We then apply this to novel interneuron subtypes, finding that 24/45 subtypes have evidence of replication, which enables the identification of robust candidate marker genes. Across tasks we find that large sets of variably expressed genes can identify replicable cell types with high accuracy, suggesting a general route forward for large-scale evaluation of scRNA-seq data.

SUBMITTER: Crow M 

PROVIDER: S-EPMC5830442 | biostudies-literature | 2018 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Characterizing the replicability of cell types defined by single cell RNA-sequencing data using MetaNeighbor.

Crow Megan M   Paul Anirban A   Ballouz Sara S   Huang Z Josh ZJ   Gillis Jesse J  

Nature communications 20180228 1


Single-cell RNA-sequencing (scRNA-seq) technology provides a new avenue to discover and characterize cell types; however, the experiment-specific technical biases and analytic variability inherent to current pipelines may undermine its replicability. Meta-analysis is further hampered by the use of ad hoc naming conventions. Here we demonstrate our replication framework, MetaNeighbor, that quantifies the degree to which cell types replicate across datasets, and enables rapid identification of clu  ...[more]

Similar Datasets

| S-EPMC8762856 | biostudies-literature
| S-EPMC7031312 | biostudies-literature
| S-EPMC7505465 | biostudies-literature
| S-EPMC8742092 | biostudies-literature
| S-EPMC8766768 | biostudies-literature
| S-EPMC7321924 | biostudies-literature
| S-EPMC8168892 | biostudies-literature