Unknown

Dataset Information

0

DropClust: efficient clustering of ultra-large scRNA-seq data.


ABSTRACT: Droplet based single cell transcriptomics has recently enabled parallel screening of tens of thousands of single cells. Clustering methods that scale for such high dimensional data without compromising accuracy are scarce. We exploit Locality Sensitive Hashing, an approximate nearest neighbour search technique to develop a de novo clustering algorithm for large-scale single cell data. On a number of real datasets, dropClust outperformed the existing best practice methods in terms of execution time, clustering accuracy and detectability of minor cell sub-types.

SUBMITTER: Sinha D 

PROVIDER: S-EPMC5888655 | biostudies-literature | 2018 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

dropClust: efficient clustering of ultra-large scRNA-seq data.

Sinha Debajyoti D   Kumar Akhilesh A   Kumar Himanshu H   Bandyopadhyay Sanghamitra S   Sengupta Debarka D  

Nucleic acids research 20180401 6


Droplet based single cell transcriptomics has recently enabled parallel screening of tens of thousands of single cells. Clustering methods that scale for such high dimensional data without compromising accuracy are scarce. We exploit Locality Sensitive Hashing, an approximate nearest neighbour search technique to develop a de novo clustering algorithm for large-scale single cell data. On a number of real datasets, dropClust outperformed the existing best practice methods in terms of execution ti  ...[more]

Similar Datasets

| S-EPMC8157426 | biostudies-literature
| S-EPMC8682753 | biostudies-literature
| S-EPMC8275983 | biostudies-literature
| S-EPMC7897472 | biostudies-literature
| S-EPMC8344557 | biostudies-literature
| S-EPMC7141853 | biostudies-literature
| S-EPMC8439043 | biostudies-literature
| S-EPMC6022691 | biostudies-other
| S-EPMC7647120 | biostudies-literature