Genomics

Dataset Information

0

Determination and Inference of Eukaryotic Transcription Factor Sequence Specificity


ABSTRACT: The DNA sequence preferences of the vast majority of eukaryotic transcription factors (TFs) are unknown. Using an approach designed to broadly sample both DNA-binding domain types and eukaryotic clades, we have determined DNA-binding motifs for 1,033 TFs from 131 diverse eukaryotes, encompassing 54 domain types. Closely related orthologs and paralogs typically have very similar sequence preferences; this property allows inference of motifs for roughly one third of the 166,851 known or predicted eukaryotic TFs. While the origins of most motifs can be dated to hundreds of millions of years ago, we also characterize more recent TF expansions. Sequences matching the motifs are enriched upstream of TSS in most eukaryotic lineages, and at informative eQTL SNPs in Arabidopsis promoters, demonstrating their utility in mapping transcriptional networks. The motifs are housed at http://cisbp.ccbr.utoronto.ca

ORGANISM(S): synthetic construct

PROVIDER: GSE53348 | GEO | 2014/08/01

SECONDARY ACCESSION(S): PRJNA232033

REPOSITORIES: GEO

Similar Datasets

2014-08-01 | E-GEOD-53348 | biostudies-arrayexpress
2012-12-13 | E-GEOD-42864 | biostudies-arrayexpress
2012-12-13 | GSE42864 | GEO
2023-10-04 | GSE244409 | GEO
2023-10-04 | GSE244408 | GEO
2023-10-04 | GSE244410 | GEO
2015-04-23 | E-GEOD-65719 | biostudies-arrayexpress
2021-02-09 | GSE157085 | GEO
2013-03-29 | GSE44437 | GEO
2013-03-29 | GSE44436 | GEO