Project description:BACKGROUND:Identification of protein-protein interactions is an important first step to understand living systems. High-throughput experimental approaches have accumulated large amount of information on protein-protein interactions in human and other model organisms. Such interaction information has been successfully transferred to other species, in which the experimental data are limited. However, the annotation transfer method could yield false positive interologs due to the lack of conservation of interactions when applied to phylogenetically distant organisms. RESULTS:To address this issue, we used phylogenetic profile method to filter false positives in interologs based on the notion that evolutionary conserved interactions show similar patterns of occurrence along the genomes. The approach was applied to Mus musculus, in which the experimentally identified interactions are limited. We first inferred the protein-protein interactions in Mus musculus by using two approaches: i) identifying mouse orthologs of interacting proteins (interologs) based on the experimental protein-protein interaction data from other organisms; and ii) analyzing frequency of mouse ortholog co-occurrence in predicted operons of bacteria. We then filtered possible false-positives in the predicted interactions using the phylogenetic profiles. We found that this filtering method significantly increased the frequency of interacting protein-pairs coexpressed in the same cells/tissues in gene expression omnibus (GEO) database as well as the frequency of interacting protein-pairs shared the similar Gene Ontology (GO) terms for biological processes and cellular localizations. The data supports the notion that phylogenetic profile helps to reduce the number of false positives in interologs. CONCLUSION:We have developed protein-protein interaction database in mouse, which contains 41109 interologs. We have also developed a web interface to facilitate the use of database http://lgsun.grc.nia.nih.gov/mppi/.
Project description:BACKGROUND: Long terminal repeat (LTR) retrotransposons make up a large fraction of the typical mammalian genome. They comprise about 8% of the human genome and approximately 10% of the mouse genome. On account of their abundance, LTR retrotransposons are believed to hold major significance for genome structure and function. Recent advances in genome sequencing of a variety of model organisms has provided an unprecedented opportunity to evaluate better the diversity of LTR retrotransposons resident in eukaryotic genomes. RESULTS: Using a new data-mining program, LTR_STRUC, in conjunction with conventional techniques, we have mined the GenBank mouse (Mus musculus) database and the more complete Ensembl mouse dataset for LTR retrotransposons. We report here that the M. musculus genome contains at least 21 separate families of LTR retrotransposons; 13 of these families are described here for the first time. CONCLUSIONS: All families of mouse LTR retrotransposons are members of the gypsy-like superfamily of retroviral-like elements. Several different families of unrelated non-autonomous elements were identified, suggesting that the evolution of non-autonomy may be a common event. High sequence similarity between several LTR retrotransposons identified in this study and those found in distantly-related species suggests that horizontal transfer has been a significant factor in the evolution of mouse LTR retrotransposons.
Project description:Copy number variation is an important dimension of genetic diversity and has implications in development and disease. As an important model organism, the mouse is a prime candidate for copy number variant (CNV) characterization, but this has yet to be completed for a large sample size. Here we report CNV analysis of publicly available, high-density microarray data files for 351 mouse tail samples, including 290 mice that had not been characterized for CNVs previously.We found 9634 putative autosomal CNVs across the samples affecting 6.87% of the mouse reference genome. We find significant differences in the degree of CNV uniqueness (single sample occurrence) and the nature of CNV-gene overlap between wild-caught mice and classical laboratory strains. CNV-gene overlap was associated with lipid metabolism, pheromone response and olfaction compared to immunity, carbohydrate metabolism and amino-acid metabolism for wild-caught mice and classical laboratory strains, respectively. Using two subspecies of wild-caught Mus musculus, we identified putative CNVs unique to those subspecies and show this diversity is better captured by wild-derived laboratory strains than by the classical laboratory strains. A total of 9 genic copy number variable regions (CNVRs) were selected for experimental confirmation by droplet digital PCR (ddPCR).The analysis we present is a comprehensive, genome-wide analysis of CNVs in Mus musculus, which increases the number of known variants in the species and will accelerate the identification of novel variants in future studies.
Project description:Here we report the expansion of the genetic code of Mus musculus with various unnatural amino acids including N?-acetyl-lysine. Stable integration of transgenes encoding an engineered N?-acetyl-lysyl-tRNA synthetase (AcKRS)/tRNAPyl pair into the mouse genome enables site-specific incorporation of unnatural amino acids into a target protein in response to the amber codon. We demonstrate temporal and spatial control of protein acetylation in various organs of the transgenic mouse using a recombinant green fluorescent protein (GFPuv) as a model protein. This strategy will provide a powerful tool for systematic in vivo study of cellular proteins in the most commonly used mammalian model organism for human physiology and disease.
Project description:The mammalian vomeronasal organ (VNO) expresses two G-protein coupled receptor gene families that mediate pheromone responses, the V1R and V2R receptor genes. In rodents, there are ~150 V1R genes comprising 12 subfamilies organized in gene clusters at multiple chromosomal locations. Previously, we showed that several of these subfamilies had been extensively modulated by gene duplications, deletions, and gene conversions around the time of the evolutionary split of the mouse and rat lineages, consistent with the hypothesis that V1R repertoires might be involved in reinforcing speciation events. Here, we generated genome sequence for one large cluster containing two V1R subfamilies in Mus spretus, a closely related and sympatric species to Mus musculus, and investigated evolutionary change in these repertoires along the two mouse lineages.We describe a comparison of spretus and musculus with respect to genome organization and synteny, as well as V1R gene content and phylogeny, with reference to previous observations made between mouse and rat. Unlike the mouse-rat comparisons, synteny seems to be largely conserved between the two mouse species. Disruption of local synteny is generally associated with differences in repeat content, although these differences appear to arise more from deletion than new integrations. Even though unambiguous V1R orthology is evident, we observe dynamic modulation of the functional repertoires, with two of seven V1Rb and one of eleven V1Ra genes lost in spretus, two V1Ra genes becoming pseudogenes in musculus, two additional orthologous pairs apparently subject to strong adaptive selection, and another divergent orthologous pair that apparently was subjected to gene conversion.Therefore, eight of the 18 (~44%) presumptive V1Ra/V1Rb genes in the musculus-spretus ancestor appear to have undergone functional modulation since these two species diverged. As compared to the rat-mouse split, where modulation is evident by independent expansions of these two V1R subfamilies, divergence between musculus and spretus has arisen more by mutations within coding sequences. These results support the hypothesis that adaptive changes in functional V1R repertoires contribute to the delineation of very closely related species.
Project description:House mice (Mus musculus) emit ultrasonic vocalizations (USVs), which are surprisingly complex and have features of bird song, but their functions are not well understood. Previous studies have reported mixed evidence on whether there are sex differences in USV emission, though vocalization rate or other features may depend upon whether potential receivers are of the same or opposite sex. We recorded the USVs of wild-derived adult house mice (F1 of wild-caught Mus musculus musculus), and we compared the vocalizations of males and females in response to a stimulus mouse of the same- or opposite-sex. To detect and quantify vocalizations, we used an algorithm that automatically detects USVs (Automatic Mouse Ultrasound Detector or A-MUD). We found high individual variation in USV emission rates (4 to 2083 elements/10 min trial) and a skewed distribution, with most mice (60%) emitting few (?50) elements. We found no differences in the rates of calling between the sexes overall, but mice of both sexes emitted vocalizations at a higher rate and higher frequencies during opposite- compared to same-sex interactions. We also observed a trend toward higher amplitudes by males when presented with a male compared to a female stimulus. Our results suggest that mice modulate the rate and frequency of vocalizations depending upon the sex of potential receivers.
Project description:Several years after sequencing the human genome and the mouse genome, much remains to be discovered about the functions of most human and mouse genes. Computational prediction of gene function promises to help focus limited experimental resources on the most likely hypotheses. Several algorithms using diverse genomic data have been applied to this task in model organisms; however, the performance of such approaches in mammals has not yet been evaluated.In this study, a standardized collection of mouse functional genomic data was assembled; nine bioinformatics teams used this data set to independently train classifiers and generate predictions of function, as defined by Gene Ontology (GO) terms, for 21,603 mouse genes; and the best performing submissions were combined in a single set of predictions. We identified strengths and weaknesses of current functional genomic data sets and compared the performance of function prediction algorithms. This analysis inferred functions for 76% of mouse genes, including 5,000 currently uncharacterized genes. At a recall rate of 20%, a unified set of predictions averaged 41% precision, with 26% of GO terms achieving a precision better than 90%.We performed a systematic evaluation of diverse, independently developed computational approaches for predicting gene function from heterogeneous data sources in mammals. The results show that currently available data for mammals allows predictions with both breadth and accuracy. Importantly, many highly novel predictions emerge for the 38% of mouse genes that remain uncharacterized.
Project description:Wild populations of the house mouse (Mus musculus) represent the raw genetic material for the classical inbred strains in biomedical research and are a major model system for evolutionary biology. We provide whole genome sequencing data of individuals representing natural populations of M. m. domesticus (24 individuals from 3 populations), M. m. helgolandicus (3 individuals), M. m. musculus (22 individuals from 3 populations) and M. spretus (8 individuals from one population). We use a single pipeline to map and call variants for these individuals and also include 10 additional individuals of M. m. castaneus for which genomic data are publically available. In addition, RNAseq data were obtained from 10 tissues of up to eight adult individuals from each of the three M. m. domesticus populations for which genomic data were collected. Data and analyses are presented via tracks viewable in the UCSC or IGV genome browsers. We also provide information on available outbred stocks and instructions on how to keep them in the laboratory.
Project description:The Mesio-Temporal Lobe Epilepsy syndrome is the most common form of intractable epilepsy. It is characterized by recurrence of focal seizures and is often associated with hippocampal sclerosis and drug resistance. We aimed to characterize the molecular changes occurring during the initial stages of epileptogenesis in search of new therapeutic targets for Mesio-Temporal Lobe Epilepsy. We used a mouse model obtained by intra-hippocampal microinjection of kainate and performed hippocampal whole genome expression analysis at 6h, 12h and 24h post-injection, followed by multilevel bioinformatics analysis. We report significant changes in immune and inflammatory responses, neuronal network reorganization processes and glial functions, predominantly initiated during status epilepticus at 12h and persistent after the end of status epilepticus at 24h post-kainate. Upstream regulator analysis highlighted Cyba, Cybb and Vim as central regulators of multiple overexpressed genes implicated in glial responses at 24h. In silico microRNA analysis indicated that miR-9, miR-19b, miR-129, and miR-223 may regulate the expression of glial-associated genes at 24h. Our data support the hypothesis that glial-mediated inflammatory response holds a key role during epileptogenesis, and that microglial cells may participate in the initial process of epileptogenesis through increased ROS production via the NOX complex.
Project description:The laboratory mouse is the workhorse of immunology, used as a model of mammalian immune function, but how well immune responses of laboratory mice reflect those of free-living animals is unknown. Here we comprehensively characterize serological, cellular and functional immune parameters of wild mice and compare them with laboratory mice, finding that wild mouse cellular immune systems are, comparatively, in a highly activated (primed) state. Associations between immune parameters and infection suggest that high level pathogen exposure drives this activation. Moreover, wild mice have a population of highly activated myeloid cells not present in laboratory mice. By contrast, in vitro cytokine responses to pathogen-associated ligands are generally lower in cells from wild mice, probably reflecting the importance of maintaining immune homeostasis in the face of intense antigenic challenge in the wild. These data provide a comprehensive basis for validating (or not) laboratory mice as a useful and relevant immunological model system.