Unknown

Dataset Information

0

A synergistic DNA logic predicts genome-wide chromatin accessibility.


ABSTRACT: Enhancers and promoters commonly occur in accessible chromatin characterized by depleted nucleosome contact; however, it is unclear how chromatin accessibility is governed. We show that log-additive cis-acting DNA sequence features can predict chromatin accessibility at high spatial resolution. We develop a new type of high-dimensional machine learning model, the Synergistic Chromatin Model (SCM), which when trained with DNase-seq data for a cell type is capable of predicting expected read counts of genome-wide chromatin accessibility at every base from DNA sequence alone, with the highest accuracy at hypersensitive sites shared across cell types. We confirm that a SCM accurately predicts chromatin accessibility for thousands of synthetic DNA sequences using a novel CRISPR-based method of highly efficient site-specific DNA library integration. SCMs are directly interpretable and reveal that a logic based on local, nonspecific synergistic effects, largely among pioneer TFs, is sufficient to predict a large fraction of cellular chromatin accessibility in a wide variety of cell types.

SUBMITTER: Hashimoto T 

PROVIDER: S-EPMC5052050 | biostudies-literature | 2016 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications


Enhancers and promoters commonly occur in accessible chromatin characterized by depleted nucleosome contact; however, it is unclear how chromatin accessibility is governed. We show that log-additive cis-acting DNA sequence features can predict chromatin accessibility at high spatial resolution. We develop a new type of high-dimensional machine learning model, the Synergistic Chromatin Model (SCM), which when trained with DNase-seq data for a cell type is capable of predicting expected read count  ...[more]

Similar Datasets

| S-EPMC7546623 | biostudies-literature
| S-EPMC7874078 | biostudies-literature
| S-EPMC8386078 | biostudies-literature
2016-07-20 | GSE80105 | GEO
2021-07-04 | PXD026484 | Pride
| S-EPMC6582963 | biostudies-literature