Unknown

Dataset Information

0

Machine-learning-optimized Cas12a barcoding enables the recovery of single-cell lineages and transcriptional profiles.


ABSTRACT: The development of CRISPR-based barcoding methods creates an exciting opportunity to understand cellular phylogenies. We present a compact, tunable, high-capacity Cas12a barcoding system called dual acting inverted site array (DAISY). We combined high-throughput screening and machine learning to predict and optimize the 60-bp DAISY barcode sequences. After optimization, top-performing barcodes had ∼10-fold increased capacity relative to the best random-screened designs and performed reliably across diverse cell types. DAISY barcode arrays generated ∼12 bits of entropy and ∼66,000 unique barcodes. Thus, DAISY barcodes-at a fraction of the size of Cas9 barcodes-achieved high-capacity barcoding. We coupled DAISY barcoding with single-cell RNA-seq to recover lineages and gene expression profiles from ∼47,000 human melanoma cells. A single DAISY barcode recovered up to ∼700 lineages from one parental cell. This analysis revealed heritable single-cell gene expression and potential epigenetic modulation of memory gene transcription. Overall, Cas12a DAISY barcoding is an efficient tool for investigating cell-state dynamics.

SUBMITTER: Hughes NW 

PROVIDER: S-EPMC10599400 | biostudies-literature | 2022 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Machine-learning-optimized Cas12a barcoding enables the recovery of single-cell lineages and transcriptional profiles.

Hughes Nicholas W NW   Qu Yuanhao Y   Zhang Jiaqi J   Tang Weijing W   Pierce Justin J   Wang Chengkun C   Agrawal Aditi A   Morri Maurizio M   Neff Norma N   Winslow Monte M MM   Wang Mengdi M   Cong Le L  

Molecular cell 20220624 16


The development of CRISPR-based barcoding methods creates an exciting opportunity to understand cellular phylogenies. We present a compact, tunable, high-capacity Cas12a barcoding system called dual acting inverted site array (DAISY). We combined high-throughput screening and machine learning to predict and optimize the 60-bp DAISY barcode sequences. After optimization, top-performing barcodes had ∼10-fold increased capacity relative to the best random-screened designs and performed reliably acr  ...[more]

Similar Datasets

2024-12-27 | GSE246294 | GEO
| S-EPMC5417821 | biostudies-literature
| PRJNA1032463 | ENA
| S-EPMC10102853 | biostudies-literature
| S-EPMC10581463 | biostudies-literature
| S-EPMC11463589 | biostudies-literature
| S-EPMC11797022 | biostudies-literature
| S-EPMC10403974 | biostudies-literature
| S-EPMC6858556 | biostudies-literature
| S-EPMC6849489 | biostudies-literature