Dataset Information


Machine learning predicts putative hematopoietic stem cells within large single-cell transcriptomics data sets.

ABSTRACT: Hematopoietic stem cells (HSCs) are an essential source and reservoir for normal hematopoiesis, and their function is compromised in many blood disorders. HSC research has benefitted from the recent development of single-cell molecular profiling technologies, where single-cell RNA sequencing (scRNA-seq) in particular has rapidly become an established method to profile HSCs and related hematopoietic populations. The classic definition of HSCs relies on transplantation assays, which have been used to validate HSC function for cell populations defined by flow cytometry. Flow cytometry information for single cells, however, is not available for many new high-throughput scRNA-seq methods, thus highlighting an urgent need for the establishment of alternative ways to pinpoint the likely HSCs within large scRNA-seq data sets. To address this, we tested a range of machine learning approaches and developed a tool, hscScore, to score single-cell transcriptomes from murine bone marrow based on their similarity to gene expression profiles of validated HSCs. We evaluated hscScore across scRNA-seq data from different laboratories, which allowed us to establish a robust method that functions across different technologies. To facilitate broad adoption of hscScore by the wider hematopoiesis community, we have made the trained model and example code freely available online. In summary, our method hscScore provides fast identification of mouse bone marrow HSCs from scRNA-seq measurements and represents a broadly useful tool for analysis of single-cell gene expression data.

PROVIDER: S-EPMC6900257 | BioStudies |

REPOSITORIES: biostudies

Similar Datasets

| S-EPMC8700284 | BioStudies
| S-EPMC6424521 | BioStudies
| S-EPMC6298771 | BioStudies
| S-EPMC7653854 | BioStudies
| S-EPMC6818972 | BioStudies
| S-EPMC8428393 | BioStudies
| S-EPMC8581166 | BioStudies
| S-EPMC8072066 | BioStudies
| S-EPMC7876897 | BioStudies
| S-EPMC8428103 | BioStudies