Genomics

Dataset Information

0

Prediction of on-target and off-target activity of CRISPR-Cas13dguide RNAs using deep learning


ABSTRACT: Transcriptome engineering applications in living cells with RNA-targeting CRISPR effectors depend on accurate prediction of on-target activity and off-target avoidance. Here, we design and test ~200,000 RfxCas13d guide RNAs targeting essential genes in human cells with systematically-designed mismatches, insertions and deletions (indels). We find that mismatches and indels have a position- and context-dependent impact on Cas13d activity, and mismatches that result in G:U wobble pairings are better tolerated than other single-base mismatches. Using this large-scale dataset, we train a convolutional neural network that we term TIGER (Targeted Inhibition of Gene Expression via gRNA design) to predict efficacy from guide sequence and context. TIGER outperforms existing models at predicting on- and off-target activity on our dataset and published datasets. We show that TIGER scoring combined with specific mismatches yields the first general framework to modulate transcript expression, enabling use of RNA-targeting CRISPRs to precisely control gene dosage.

ORGANISM(S): synthetic construct Homo sapiens

PROVIDER: GSE232228 | GEO | 2023/05/12

REPOSITORIES: GEO

Similar Datasets

2022-02-10 | GSE193667 | GEO
2022-07-01 | GSE199542 | GEO
2020-01-30 | GSE142675 | GEO
2023-01-14 | GSE222451 | GEO
2015-03-04 | E-GEOD-61099 | biostudies-arrayexpress
2012-06-29 | E-GEOD-33021 | biostudies-arrayexpress
2024-03-21 | E-MTAB-12748 | biostudies-arrayexpress
2022-03-03 | E-MTAB-11497 | biostudies-arrayexpress
2022-02-10 | GSE193666 | GEO
2014-05-18 | GSE55887 | GEO