Proteomics

Dataset Information

0

Dynamic regulatory grammar of human promoters uncovered by MPRA-trained deep learning


ABSTRACT: One of the major challenges in genomics is to build computational models that accurately predict genome-wide gene expression from the sequences of regulatory elements. Promoters play a key role in gene regulation, yet their regulatory logic remains incompletely understood. Here, we present PARM, a cell-type specific deep learning model trained on specially designed massively parallel reporter assays that query human promoter sequences. PARM is computationally light-weight and reliably predicts autonomous promoter activity across the genome from DNA sequence alone, in multiple cell types. PARM can also design purely synthetic strong promoters. We leveraged PARM to systematically identify transcription factor (TF) binding sites that likely to contribute to the activity of each natural human promoter, and to detect the rewiring of these regulatory interactions upon various stimuli to the cells. We also uncovered and experimentally confirmed striking positional preferences of TFs that differ between activating and repressive regulatory functions, as well as a complex grammar of motif-motif interactions. Our approach provides a foundation towards a deeper understanding of the dynamic regulation of human promoters by TFs.

INSTRUMENT(S):

ORGANISM(S): Homo Sapiens (human)

TISSUE(S): Blood Cell, Cell Suspension Culture

DISEASE(S): Acute Leukemia

SUBMITTER: Miguel Hernandez Quiles  

LAB HEAD: Prof. Dr. Michiel Vermeulen

PROVIDER: PXD072017 | Pride | 2026-01-14

REPOSITORIES: Pride

Similar Datasets

2025-07-01 | GSE301246 | GEO
2012-12-31 | E-GEOD-40215 | biostudies-arrayexpress
2019-01-08 | GSE117594 | GEO
2021-06-16 | PXD024794 | Pride
2021-07-27 | GSE179255 | GEO
2021-07-27 | GSE179254 | GEO
2022-02-07 | GSE168328 | GEO
2022-04-29 | GSE196451 | GEO
2017-01-01 | GSE68677 | GEO
2022-10-13 | PXD030973 | Pride