Genomics

Dataset Information

0

Nucleotide composition-linked divergence of vertebrate core promoter architecture


ABSTRACT: Transcription initiation involves the recruitment of basal transcription factors to the core promoter. A variety of core promoter elements exists, however for most of these motifs the distribution across species is unknown. Here we report on the comparison of human and amphibian promoter sequences. We have used oligo-capping in combination with deep sequencing to determine transcription start sites in Xenopus tropicalis. To systematically predict regulatory elements we have developed a de novo motif finding pipeline using an ensemble of computational tools. A comprehensive comparison of human and amphibian promoter sequences revealed both similarities and differences in core promoter architecture. Some of the differences stem from a highly divergent nucleotide composition of Xenopus and human promoters. Whereas the distribution of some core promoter motifs is conserved independent of species-specific nucleotide bias, the frequency of another class of motifs correlates with the single nucleotide frequencies. This class includes the well-known TATA box and SP1 motifs, which are more abundant in Xenopus and human promoters, respectively. While these motifs are enriched above the local nucleotide background in both organisms, their frequency varies in step with this background. These differences are likely adaptive as these motifs can recruit TFIID to either CpG island or sharply initiating promoters. Our results highlight both conserved and diverged aspects of vertebrate transcription, most notably showing co-opted motif usage to recruit the transcriptional machinery to promoters with diverging nucleotide composition. This shows how sweeping changes in nucleotide composition are compatible with highly conserved mechanisms of transcription initiation.

ORGANISM(S): Xenopus tropicalis

PROVIDER: GSE21482 | GEO | 2010/12/30

SECONDARY ACCESSION(S): PRJNA126041

REPOSITORIES: GEO

Similar Datasets

| E-GEOD-21482 | biostudies-arrayexpress
2017-01-01 | GSE68677 | GEO
| PRJNA126041 | ENA
| E-GEOD-19562 | biostudies-arrayexpress
| E-GEOD-36898 | biostudies-arrayexpress
2009-12-19 | GSE19562 | GEO
2015-04-23 | GSE68168 | GEO
2012-03-29 | GSE36898 | GEO
2018-08-08 | GSE118242 | GEO
2021-02-01 | GSE130798 | GEO