Unknown

Dataset Information

0

On the identification of differentially-active transcription factors from ATAC-seq data.


ABSTRACT: ATAC-seq has emerged as a rich epigenome profiling technique, and is commonly used to identify Transcription Factors (TFs) underlying given phenomena. A number of methods can be used to identify differentially-active TFs through the accessibility of their DNA-binding motif, however little is known on the best approaches for doing so. Here we benchmark several such methods using a combination of curated datasets with various forms of short-term perturbations on known TFs, as well as semi-simulations. We include both methods specifically designed for this type of data as well as some that can be repurposed for it. We also investigate variations to these methods, and identify three particularly promising approaches (chromVAR-limma with critical adjustments, monaLisa and a combination of GC smooth quantile normalization and multivariate modeling). We further investigate the specific use of nucleosome-free fragments, the combination of top methods, and the impact of technical variation. Finally, we illustrate the use of the top methods on a novel dataset to characterize the impact on DNA accessibility of TRAnscription Factor TArgeting Chimeras (TRAFTAC), which can deplete TFs - in our case NFkB - at the protein level.

SUBMITTER: Gerbaldo F 

PROVIDER: S-EPMC10942475 | biostudies-literature | 2024 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

On the identification of differentially-active transcription factors from ATAC-seq data.

Gerbaldo Felix Ezequiel FE   Sonder Emanuel E   Fischer Vincent V   Frei Selina S   Wang Jiayi J   Gapp Katharina K   Robinson Mark D MD   Germain Pierre-Luc PL  

bioRxiv : the preprint server for biology 20240820


ATAC-seq has emerged as a rich epigenome profiling technique, and is commonly used to identify Transcription Factors (TFs) underlying given phenomena. A number of methods can be used to identify differentially-active TFs through the accessibility of their DNA-binding motif, however little is known on the best approaches for doing so. Here we benchmark several such methods using a combination of curated datasets with various forms of short-term perturbations on known TFs, as well as semi-simulati  ...[more]

Similar Datasets

| S-EPMC11534267 | biostudies-literature
| S-EPMC11039736 | biostudies-literature
| S-EPMC6391789 | biostudies-literature
| S-EPMC6099720 | biostudies-literature
| S-EPMC10950023 | biostudies-literature
| S-EPMC10070275 | biostudies-literature
| S-EPMC5562042 | biostudies-literature
| S-EPMC7405637 | biostudies-literature
| S-EPMC4334524 | biostudies-literature
| S-EPMC8883642 | biostudies-literature