Project description:Genomic mapping of DNA replication origins (ORIs) in mammals provides a powerful means for understanding the regulatory complexity of our genome. Here we combine a genome-wide approach to identify preferential sites of DNA replication initiation at 0.4% of the mouse genome with detailed molecular analysis at distinct classes of ORIs according to their location relative to the genes. Our study reveals that 85% of the replication initiation sites in mouse embryonic stem (ES) cells are associated with transcriptional units. Nearly half of the identified ORIs map at promoter regions and, interestingly, ORI density strongly correlates with promoter density, reflecting the coordinated organisation of replication and transcription in the mouse genome. Detailed analysis of ORI activity showed that CpG island promoter-ORIs are the most efficient ORIs in ES cells and both ORI specification and firing efficiency are maintained across cell types. Remarkably, the distribution of replication initiation sites at promoter-ORIs exactly parallels that of transcription start sites (TSS) suggesting a co-evolution of the regulatory regions driving replication and transcription. Moreover, we found that promoter-ORIs are significantly enriched in CAGE tags derived from early embryos relative to all promoters. This association implies that transcription initiation early in development sets the probability of ORI activation unveiling a new hallmark in ORI efficiency regulation in mammalian cells. Two biological replicates of lambda-exonuclease treated short nascent strands (100-600 or 300-800 nt in length) were co-hybridised with genomic DNA from the same cells to tiled genomic array covering 10.1 Mb of the mouse genome (Agilent Technologies)
Project description:Genomic mapping of DNA replication origins (ORIs) in mammals provides a powerful means for understanding the regulatory complexity of our genome. Here we combine a genome-wide approach to identify preferential sites of DNA replication initiation at 0.4% of the mouse genome with detailed molecular analysis at distinct classes of ORIs according to their location relative to the genes. Our study reveals that 85% of the replication initiation sites in mouse embryonic stem (ES) cells are associated with transcriptional units. Nearly half of the identified ORIs map at promoter regions and, interestingly, ORI density strongly correlates with promoter density, reflecting the coordinated organisation of replication and transcription in the mouse genome. Detailed analysis of ORI activity showed that CpG island promoter-ORIs are the most efficient ORIs in ES cells and both ORI specification and firing efficiency are maintained across cell types. Remarkably, the distribution of replication initiation sites at promoter-ORIs exactly parallels that of transcription start sites (TSS) suggesting a co-evolution of the regulatory regions driving replication and transcription. Moreover, we found that promoter-ORIs are significantly enriched in CAGE tags derived from early embryos relative to all promoters. This association implies that transcription initiation early in development sets the probability of ORI activation unveiling a new hallmark in ORI efficiency regulation in mammalian cells.
Project description:We have devised a method for isolating virtually pure and comprehensive libraries of restriction fragments that contained replication initiation sites (bubbles) in vivo (Mesner et al. 2009). We have now sequenced and mapped the bubble-containing fragments from GM06990, a near-normal EBV-transformed lymphoblastoid cell line, and have compared origin distributions to a comprehensive replication timing study recently published for this cell line. We find that early-firing origins overwhelmingly represent zones, associate only marginally with active transcription units, are localized within large domains of open chromatin, and are significantly associated with DNAseI hypersensitivity. Origin density falls from early- to mid-S-phase, but, surprisingly, rises again in late-S-phase to density levels only 17% lower than in early S-phase. Strikingly, late origin density calculated on the 1Mb scale is robustly correlated with measures of increasing chromatin compaction. While origin efficiency decreases with replication time, the median efficiency in late-replicating, heterochromatic domains is only 25% lower than in early-replicating, euchromatic loci. Thus, the activation of early- and late-firing origins must be regulated by quintessentially different mechanisms. The aggregate data can be unified into a model in which initiation site selection is driven almost entirely by epigenetic factors that fashion both the long-range and local chromatin environments, with underlying DNA sequence and local transcriptional activity playing only minor roles. Importantly, the comprehensive origin map we have prepared for GM06990 overlaps only modestly with origin maps recently reported for the genomes of four different human cell lines based on distributions of small nascent strands. 3 samples of DNA replication (2 technical replicates, 1 biological replicate), 2 samples of poly-A RNA-Seq
Project description:Introgressed variants from other species can be an important source of genetic variation because they may arise rapidly, can include multiple mutations on a single haplotype, and have often been pretested by selection in the species of origin. Although introgressed alleles are generally deleterious, several studies have reported introgression as the source of adaptive alleles-including the rodenticide-resistant variant of Vkorc1 that introgressed from Mus spretus into European populations of Mus musculus domesticus. Here, we conducted bidirectional genome scans to characterize introgressed regions into one wild population of M. spretus from Spain and three wild populations of M. m. domesticus from France, Germany, and Iran. Despite the fact that these species show considerable intrinsic postzygotic reproductive isolation, introgression was observed in all individuals, including in the M. musculus reference genome (GRCm38). Mus spretus individuals had a greater proportion of introgression compared with M. m. domesticus, and within M. m. domesticus, the proportion of introgression decreased with geographic distance from the area of sympatry. Introgression was observed on all autosomes for both species, but not on the X-chromosome in M. m. domesticus, consistent with known X-linked hybrid sterility and inviability genes that have been mapped to the M. spretus X-chromosome. Tract lengths were generally short with a few outliers of up to 2.7 Mb. Interestingly, the longest introgressed tracts were in olfactory receptor regions, and introgressed tracts were significantly enriched for olfactory receptor genes in both species, suggesting that introgression may be a source of functional novelty even between species with high barriers to gene flow.
Project description:Translational research is commonly performed in the C57B6/J mouse strain, chosen for its genetic homogeneity and phenotypic uniformity. Here, we evaluate the suitability of the white-footed deer mouse (Peromyscus leucopus) as a model organism for aging research, offering a comparative analysis against C57B6/J and diversity outbred (DO) Mus musculus strains. Our study includes comparisons of body composition, skeletal muscle function, and cardiovascular parameters, shedding light on potential applications and limitations of P. leucopus in aging studies. Notably, P. leucopus exhibits distinct body composition characteristics, emphasizing reduced muscle force exertion and a unique metabolism, particularly in fat mass. Cardiovascular assessments showed changes in arterial stiffness, challenging conventional assumptions and highlighting the need for a nuanced interpretation of aging-related phenotypes. Our study also highlights inherent challenges associated with maintaining and phenotyping P. leucopus cohorts. Behavioral considerations, including anxiety-induced responses during handling and phenotyping assessment, pose obstacles in acquiring meaningful data. Moreover, the unique anatomy of P. leucopus necessitates careful adaptation of protocols designed for Mus musculus. While showcasing potential benefits, further extensive analyses across broader age ranges and larger cohorts are necessary to establish the reliability of P. leucopus as a robust and translatable model for aging studies.
Project description:Mammalian chromosome replication starts from distinct sites, but the principles governing initiation site selection are unclear because proteins essential for DNA replication do not exhibit sequence-specific DNA binding. We identified a replication initiation determinant (RepID) protein that binds a subset of replication initiation sites. A large fraction of RepID binding sites share a common G-rich motif and exhibit elevated replication initiation. RepID is required for initiation of DNA replication from Rep-ID bound replication origins, including the origin at the human beta-globin (HBB) locus. At HBB, RepID is involved in an interaction between the replication origin (Rep-P) and the locus control region. RepID depleted murine embryonic fibroblasts exhibit abnormal replication fork progression and fewer replication initiation events. These observations are consistent with a model suggesting that RepID facilitates replication initiation at a distinct group of human replication origins. Nascent strands were purified with the lambda exonuclease methods from HCT116 cells and sequenced. Chromatin from unsyncrhonized untreated cultures of U2OS cells was subjected to ChIP-Seq with antibody directed against RepID/PHIP
Project description:Mammalian chromosome replication starts from distinct sites, but the principles governing initiation site selection are unclear because proteins essential for DNA replication do not exhibit sequence-specific DNA binding. We identified a replication initiation determinant (RepID) protein that binds a subset of replication initiation sites. A large fraction of RepID binding sites share a common G-rich motif and exhibit elevated replication initiation. RepID is required for initiation of DNA replication from Rep-ID bound replication origins, including the origin at the human beta-globin (HBB) locus. At HBB, RepID is involved in an interaction between the replication origin (Rep-P) and the locus control region. RepID depleted murine embryonic fibroblasts exhibit abnormal replication fork progression and fewer replication initiation events. These observations are consistent with a model suggesting that RepID facilitates replication initiation at a distinct group of human replication origins.
Project description:BackgroundCopy number variation is an important dimension of genetic diversity and has implications in development and disease. As an important model organism, the mouse is a prime candidate for copy number variant (CNV) characterization, but this has yet to be completed for a large sample size. Here we report CNV analysis of publicly available, high-density microarray data files for 351 mouse tail samples, including 290 mice that had not been characterized for CNVs previously.ResultsWe found 9634 putative autosomal CNVs across the samples affecting 6.87% of the mouse reference genome. We find significant differences in the degree of CNV uniqueness (single sample occurrence) and the nature of CNV-gene overlap between wild-caught mice and classical laboratory strains. CNV-gene overlap was associated with lipid metabolism, pheromone response and olfaction compared to immunity, carbohydrate metabolism and amino-acid metabolism for wild-caught mice and classical laboratory strains, respectively. Using two subspecies of wild-caught Mus musculus, we identified putative CNVs unique to those subspecies and show this diversity is better captured by wild-derived laboratory strains than by the classical laboratory strains. A total of 9 genic copy number variable regions (CNVRs) were selected for experimental confirmation by droplet digital PCR (ddPCR).ConclusionThe analysis we present is a comprehensive, genome-wide analysis of CNVs in Mus musculus, which increases the number of known variants in the species and will accelerate the identification of novel variants in future studies.
Project description:BackgroundLong terminal repeat (LTR) retrotransposons make up a large fraction of the typical mammalian genome. They comprise about 8% of the human genome and approximately 10% of the mouse genome. On account of their abundance, LTR retrotransposons are believed to hold major significance for genome structure and function. Recent advances in genome sequencing of a variety of model organisms has provided an unprecedented opportunity to evaluate better the diversity of LTR retrotransposons resident in eukaryotic genomes.ResultsUsing a new data-mining program, LTR_STRUC, in conjunction with conventional techniques, we have mined the GenBank mouse (Mus musculus) database and the more complete Ensembl mouse dataset for LTR retrotransposons. We report here that the M. musculus genome contains at least 21 separate families of LTR retrotransposons; 13 of these families are described here for the first time.ConclusionsAll families of mouse LTR retrotransposons are members of the gypsy-like superfamily of retroviral-like elements. Several different families of unrelated non-autonomous elements were identified, suggesting that the evolution of non-autonomy may be a common event. High sequence similarity between several LTR retrotransposons identified in this study and those found in distantly-related species suggests that horizontal transfer has been a significant factor in the evolution of mouse LTR retrotransposons.
Project description:Here we report the expansion of the genetic code of Mus musculus with various unnatural amino acids including Nɛ-acetyl-lysine. Stable integration of transgenes encoding an engineered Nɛ-acetyl-lysyl-tRNA synthetase (AcKRS)/tRNAPyl pair into the mouse genome enables site-specific incorporation of unnatural amino acids into a target protein in response to the amber codon. We demonstrate temporal and spatial control of protein acetylation in various organs of the transgenic mouse using a recombinant green fluorescent protein (GFPuv) as a model protein. This strategy will provide a powerful tool for systematic in vivo study of cellular proteins in the most commonly used mammalian model organism for human physiology and disease.