Genomics

Dataset Information

0

Quantitative modeling of transcription factor binding specificities using DNA shape


ABSTRACT: Accurate predictions of the DNA binding specificities of transcription factors (TFs) are necessary for understanding gene regulatory mechanisms. Traditionally, predictive models are built based on nucleotide sequence features. Here, we employed three- dimensional DNA shape information obtained on a high-throughput basis to integrate intuitive DNA structural features into the modeling of TF binding specificities using support vector regression. We performed quantitative predictions of DNA binding specificities, using the DREAM5 dataset for 65 mouse TFs and genomic-context protein binding microarray data for three human basic helix-loop-helix TFs. DNA shape-augmented models compared favorably with sequence-based models for these predictions. Although both k-mer and DNA shape features encoded the interdependencies between nucleotide positions of the binding site, using DNA shape features reduced the dimensionality of the feature space compared to k-mer use. Finally, analyzing the weights of DNA shape-augmented models uncovered TF family- specific structural readout mechanisms that were not obvious from the nucleotide sequence.

ORGANISM(S): Homo sapiens

PROVIDER: GSE59845 | GEO | 2014/11/04

SECONDARY ACCESSION(S): PRJNA256346

REPOSITORIES: GEO

Similar Datasets

2014-11-04 | E-GEOD-59845 | biostudies-arrayexpress
2023-12-31 | GSE227873 | GEO
2013-03-29 | E-GEOD-44437 | biostudies-arrayexpress
2014-05-08 | GSE50762 | GEO
2014-01-19 | E-GEOD-53776 | biostudies-arrayexpress
2015-01-21 | GSE65073 | GEO
2015-01-21 | E-GEOD-65073 | biostudies-arrayexpress
2013-03-29 | GSE44437 | GEO
2013-03-29 | GSE44436 | GEO
2013-05-17 | E-GEOD-47026 | biostudies-arrayexpress