Unknown

Dataset Information

0

Synthetic DNA barcodes identify singlets in scRNA-seq datasets and evaluate doublet algorithms.


ABSTRACT: Single-cell RNA sequencing (scRNA-seq) datasets contain true single cells, or singlets, in addition to cells that coalesce during the protocol, or doublets. Identifying singlets with high fidelity in scRNA-seq is necessary to avoid false negative and false positive discoveries. Although several methodologies have been proposed, they are typically tested on highly heterogeneous datasets and lack a priori knowledge of true singlets. Here, we leveraged datasets with synthetically introduced DNA barcodes for a hitherto unexplored application: to extract ground-truth singlets. We demonstrated the feasibility of our framework, "singletCode," to evaluate existing doublet detection methods across a range of contexts. We also leveraged our ground-truth singlets to train a proof-of-concept machine learning classifier, which outperformed other doublet detection algorithms. Our integrative framework can identify ground-truth singlets and enable robust doublet detection in non-barcoded datasets.

SUBMITTER: Zhang Z 

PROVIDER: S-EPMC11293576 | biostudies-literature | 2024 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Synthetic DNA barcodes identify singlets in scRNA-seq datasets and evaluate doublet algorithms.

Zhang Ziyang Z   Melzer Madeline E ME   Arun Keerthana M KM   Sun Hanxiao H   Eriksson Carl-Johan CJ   Fabian Itai I   Shaashua Sagi S   Kiani Karun K   Oren Yaara Y   Goyal Yogesh Y  

Cell genomics 20240625 7


Single-cell RNA sequencing (scRNA-seq) datasets contain true single cells, or singlets, in addition to cells that coalesce during the protocol, or doublets. Identifying singlets with high fidelity in scRNA-seq is necessary to avoid false negative and false positive discoveries. Although several methodologies have been proposed, they are typically tested on highly heterogeneous datasets and lack a priori knowledge of true singlets. Here, we leveraged datasets with synthetically introduced DNA bar  ...[more]

Similar Datasets

| S-EPMC8262260 | biostudies-literature
| S-EPMC7739457 | biostudies-literature
| S-EPMC7549635 | biostudies-literature
| S-EPMC10980719 | biostudies-literature
| S-EPMC8439043 | biostudies-literature
| S-EPMC11471582 | biostudies-literature
| S-EPMC6594080 | biostudies-literature
| S-EPMC7791999 | biostudies-literature
| S-EPMC10638921 | biostudies-literature
| S-EPMC6868346 | biostudies-literature