Dataset Information


TACO: a general-purpose tool for predicting cell-type-specific transcription factor dimers.

ABSTRACT: BACKGROUND: Cooperative binding of transcription factor (TF) dimers to DNA is increasingly recognized as a major contributor to binding specificity. However, it is likely that the set of known TF dimers is highly incomplete, given that they were discovered using ad hoc approaches, or through computational analyses of limited datasets. RESULTS: Here, we present TACO (Transcription factor Association from Complex Overrepresentation), a general-purpose standalone software tool that takes as input any genome-wide set of regulatory elements and predicts cell-type-specific TF dimers based on enrichment of motif complexes. TACO is the first tool that can accommodate motif complexes composed of overlapping motifs, a characteristic feature of many known TF dimers. Our method comprehensively outperforms existing tools when benchmarked on a reference set of 29 known dimers. We demonstrate the utility and consistency of TACO by applying it to 152 DNase-seq datasets and 94 ChIP-seq datasets. CONCLUSIONS: Based on these results, we uncover a general principle governing the structure of TF-TF-DNA ternary complexes, namely that the flexibility of the complex is correlated with, and most likely a consequence of, inter-motif spacing.

SUBMITTER: Jankowski A 

PROVIDER: S-EPMC4004051 | BioStudies | 2014-01-01T00:00:00Z

REPOSITORIES: biostudies

Similar Datasets

2018-01-01 | S-EPMC6364043 | BioStudies
2013-01-01 | S-EPMC3826502 | BioStudies
1000-01-01 | S-EPMC3106185 | BioStudies
2013-01-01 | S-EPMC3730104 | BioStudies
2014-01-01 | S-EPMC4267662 | BioStudies
2014-01-01 | S-EPMC3957073 | BioStudies
2014-01-01 | S-EPMC4082612 | BioStudies
2016-01-01 | S-EPMC4743414 | BioStudies
2017-01-01 | S-EPMC5389469 | BioStudies
1000-01-01 | S-EPMC4234207 | BioStudies