Unknown

Dataset Information

0

The mutation spectrum in genomic late replication domains shapes mammalian GC content.


ABSTRACT: Genome sequence compositions and epigenetic organizations are correlated extensively across multiple length scales. Replication dynamics, in particular, is highly correlated with GC content. We combine genome-wide time of replication (ToR) data, topological domains maps and detailed functional epigenetic annotations to study the correlations between replication timing and GC content at multiple scales. We find that the decrease in genomic GC content at large scale late replicating regions can be explained by mutation bias favoring A/T nucleotide, without selection or biased gene conversion. Quantification of the free dNTP pool during the cell cycle is consistent with a mechanism involving replication-coupled mutation spectrum that favors AT nucleotides at late S-phase. We suggest that mammalian GC content composition is shaped by independent forces, globally modulating mutation bias and locally selecting on functional element. Deconvoluting these forces and analyzing them on their native scales is important for proper characterization of complex genomic correlations.

SUBMITTER: Kenigsberg E 

PROVIDER: S-EPMC4872117 | biostudies-literature | 2016 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

The mutation spectrum in genomic late replication domains shapes mammalian GC content.

Kenigsberg Ephraim E   Yehuda Yishai Y   Marjavaara Lisette L   Keszthelyi Andrea A   Chabes Andrei A   Tanay Amos A   Simon Itamar I  

Nucleic acids research 20160416 9


Genome sequence compositions and epigenetic organizations are correlated extensively across multiple length scales. Replication dynamics, in particular, is highly correlated with GC content. We combine genome-wide time of replication (ToR) data, topological domains maps and detailed functional epigenetic annotations to study the correlations between replication timing and GC content at multiple scales. We find that the decrease in genomic GC content at large scale late replicating regions can be  ...[more]

Similar Datasets

| S-EPMC6944446 | biostudies-literature
| S-EPMC6629674 | biostudies-literature
| S-EPMC6144785 | biostudies-literature
| S-EPMC2936529 | biostudies-literature
| S-EPMC8153448 | biostudies-literature
| S-EPMC4191780 | biostudies-literature
2012-06-12 | GSE38562 | GEO
| S-EPMC9272728 | biostudies-literature
2018-06-26 | GSE109804 | GEO
2012-06-11 | E-GEOD-38562 | biostudies-arrayexpress