Genomics

Dataset Information

0

Positional Interpretation of Cis-Regulatory Code and Nucleosome Organization with Deep Learning Models


ABSTRACT: Sequence-to-function neural networks learn cis-regulatory sequence rules driving many types of genomic data. Interpreting these models to relate the sequence rules to underlying biological processes remains challenging, especially for complex genomic readouts such as MNase-seq, which maps nucleosome occupancy but is confounded by experimental bias. We introduce pairwise influence by sequence attribution (PISA), an interpretation tool that combinatorially decodes which bases contributed to the readout at a specific genomic coordinate. PISA visualizes the effects of transcription factor motifs, detects undiscovered motifs with complex contribution patterns, and reveals experimental biases. By learning the bias for MNase-seq, PISA enables unprecedented nucleosome prediction models, allowing the \emph{de novo} discovery of nucleosome-positioning motifs and their long-range chromatin effects, as well as the design of sequences with altered nucleosome configurations. These results show that PISA is a versatile tool that expands our ability to train and interpret sequence-to-function neural networks on genomics data and understand the underlying cis-regulatory code.

ORGANISM(S): Saccharomyces cerevisiae

PROVIDER: GSE313524 | GEO | 2025/12/12

REPOSITORIES: GEO

Dataset's files

Source:
Action DRS
Other
Items per page:
1 - 1 of 1

Similar Datasets

2017-02-26 | GSE70122 | GEO
| PRJEB2854 | ENA
2017-11-28 | GSE94313 | GEO
2015-11-05 | E-GEOD-73337 | biostudies-arrayexpress
2016-04-08 | GSE57558 | GEO
2016-04-08 | GSE57556 | GEO
2016-04-08 | GSE57557 | GEO
2010-08-20 | E-GEOD-23712 | biostudies-arrayexpress
2019-02-12 | GSE125053 | GEO
2019-08-27 | GSE128689 | GEO