Unknown

Dataset Information

0

SC3s: efficient scaling of single cell consensus clustering to millions of cells.


ABSTRACT:

Background

Today it is possible to profile the transcriptome of individual cells, and a key step in the analysis of these datasets is unsupervised clustering. For very large datasets, efficient algorithms are required to ensure that analyses can be conducted with reasonable time and memory requirements.

Results

Here, we present a highly efficient k-means based approach, and we demonstrate that it scales favorably with the number of cells with regards to time and memory.

Conclusions

We have demonstrated that our streaming k-means clustering algorithm gives state-of-the-art performance while resource requirements scale favorably for up to 2 million cells.

SUBMITTER: Quah FX 

PROVIDER: S-EPMC9743492 | biostudies-literature | 2022 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

SC3s: efficient scaling of single cell consensus clustering to millions of cells.

Quah Fu Xiang FX   Hemberg Martin M  

BMC bioinformatics 20221212 1


<h4>Background</h4>Today it is possible to profile the transcriptome of individual cells, and a key step in the analysis of these datasets is unsupervised clustering. For very large datasets, efficient algorithms are required to ensure that analyses can be conducted with reasonable time and memory requirements.<h4>Results</h4>Here, we present a highly efficient k-means based approach, and we demonstrate that it scales favorably with the number of cells with regards to time and memory.<h4>Conclus  ...[more]

Similar Datasets

| S-EPMC7203756 | biostudies-literature
2017-02-16 | GSE79102 | GEO
| S-EPMC5410170 | biostudies-literature
| S-EPMC2533155 | biostudies-literature
| S-EPMC8574980 | biostudies-literature
| S-EPMC10764072 | biostudies-literature
| S-EPMC6823959 | biostudies-literature
| S-EPMC5127768 | biostudies-literature
| S-EPMC8905497 | biostudies-literature
| S-EPMC6311928 | biostudies-literature