Dynamic evolution of V1R putative pheromone receptors between Mus musculus and Mus spretus.
ABSTRACT: The mammalian vomeronasal organ (VNO) expresses two G-protein coupled receptor gene families that mediate pheromone responses, the V1R and V2R receptor genes. In rodents, there are ~150 V1R genes comprising 12 subfamilies organized in gene clusters at multiple chromosomal locations. Previously, we showed that several of these subfamilies had been extensively modulated by gene duplications, deletions, and gene conversions around the time of the evolutionary split of the mouse and rat lineages, consistent with the hypothesis that V1R repertoires might be involved in reinforcing speciation events. Here, we generated genome sequence for one large cluster containing two V1R subfamilies in Mus spretus, a closely related and sympatric species to Mus musculus, and investigated evolutionary change in these repertoires along the two mouse lineages.We describe a comparison of spretus and musculus with respect to genome organization and synteny, as well as V1R gene content and phylogeny, with reference to previous observations made between mouse and rat. Unlike the mouse-rat comparisons, synteny seems to be largely conserved between the two mouse species. Disruption of local synteny is generally associated with differences in repeat content, although these differences appear to arise more from deletion than new integrations. Even though unambiguous V1R orthology is evident, we observe dynamic modulation of the functional repertoires, with two of seven V1Rb and one of eleven V1Ra genes lost in spretus, two V1Ra genes becoming pseudogenes in musculus, two additional orthologous pairs apparently subject to strong adaptive selection, and another divergent orthologous pair that apparently was subjected to gene conversion.Therefore, eight of the 18 (~44%) presumptive V1Ra/V1Rb genes in the musculus-spretus ancestor appear to have undergone functional modulation since these two species diverged. As compared to the rat-mouse split, where modulation is evident by independent expansions of these two V1R subfamilies, divergence between musculus and spretus has arisen more by mutations within coding sequences. These results support the hypothesis that adaptive changes in functional V1R repertoires contribute to the delineation of very closely related species.
Project description:BACKGROUND: Mus spretus diverged from Mus musculus over one million years ago. These mice are genetically and phenotypically divergent. Despite the value of utilizing M. musculus and M. spretus for quantitative trait locus (QTL) mapping, relatively little genomic information on M. spretus exists, and most of the available sequence and polymorphic data is for one strain of M. spretus, Spret/Ei. In previous work, we mapped fifteen loci for skin cancer susceptibility using four different M. spretus by M. musculus F1 backcrosses. One locus, skin tumor susceptibility 5 (Skts5) on chromosome 12, shows strong linkage in one cross. RESULTS: To identify potential candidate genes for Skts5, we sequenced 65 named and unnamed genes and coding elements mapping to the peak linkage area in outbred spretus, Spret/EiJ, FVB/NJ, and NIH/Ola. We identified polymorphisms in 62 of 65 genes including 122 amino acid substitutions. To look for polymorphisms consistent with the linkage data, we sequenced exons with amino acid polymorphisms in two additional M. spretus strains and one additional M. musculus strain generating 40.1 kb of sequence data. Eight candidate variants were identified that fit with the linkage data. To determine the degree of variation across M. spretus, we conducted phylogenetic analyses. The relatedness of the M. spretus strains at this locus is consistent with the proximity of region of ascertainment of the ancestral mice. CONCLUSION: Our analyses suggest that, if Skts5 on chromosome 12 is representative of other regions in the genome, then published genomic data for Spret/EiJ are likely to be of high utility for genomic studies in other M. spretus strains.
Project description:A LINE-1 element, LIC105, was found in the Mus musculus domesticus inbred strain, C57BL/6J. Upon sequencing, this element was found to belong to a M. spretus LINE-1 subfamily originating within the last 0.2 million years. This is the second spretus-specific LINE-1 subfamily found to be represented in C57BL/6J. Although it is unclear how these M. spretus LINE-1s transferred from M. spretus to M. m. domesticus, it is now clear that at least two different spretus LINE-1 sequences have recently transferred. The limited divergence between the C57BL/6J spretus-like LINE-1s and their closest spretus ancestors suggests that the transfer did not involve an exceptionally long lineage of sequential transpositions.
Project description:BACKGROUND: Due to their high level of genotypic and phenotypic variability, Mus spretus strains were introduced in laboratories to investigate the genetic determinism of complex phenotypes including quantitative trait loci. Mus spretus diverged from Mus musculus around 2.5 million years ago and exhibits on average a single nucleotide polymorphism (SNP) in every 100 base pairs when compared with any of the classical laboratory strains. A genoproteomic approach was used to assess polymorphism of the major milk proteins between SEG/Pas and C57BL/6J, two inbred strains of mice representative of Mus spretus and Mus musculus species, respectively. RESULTS: The milk protein concentration was dramatically reduced in the SEG/Pas strain by comparison with the C57BL/6J strain (34 ± 9 g/L vs. 125 ± 12 g/L, respectively). Nine major proteins were identified in both milks using RP-HPLC, bi-dimensional electrophoresis and MALDI-Tof mass spectrometry. Two caseins (? and ?s1) and the whey acidic protein (WAP), showed distinct chromatographic and electrophoresis behaviours. These differences were partly explained by the occurrence of amino acid substitutions and splicing variants revealed by cDNA sequencing. A total of 34 SNPs were identified in the coding and 3'untranslated regions of the SEG/Pas Csn1s1 (11), Csn2 (7) and Wap (8) genes. In addition, a 3 nucleotide deletion leading to the loss of a serine residue at position 93 was found in the SEG/Pas Wap gene. CONCLUSION: SNP frequencies found in three milk protein-encoding genes between Mus spretus and Mus musculus is twice the values previously reported at the whole genome level. However, the protein structure and post-translational modifications seem not to be affected by SNPs characterized in our study. Splicing mechanisms (cryptic splice site usage, exon skipping, error-prone junction sequence), already identified in casein genes from other species, likely explain the existence of multiple ?s1-casein isoforms both in SEG/Pas and C57BL/6J strains. Finally, we propose a possible mechanism by which the hallmark tandem duplication of a 18-nt exon (14 copies) may have occurred in the mouse genome.
Project description:The V1R gene family comprises one of two types of putative pheromone receptors expressed in the mammalian vomeronasal organ (VNO). We searched the most recent mouse, rat, dog, chimpanzee, and human genome sequence assemblies to compile a near-complete repertoire of V1R genes for each species. Dog, human, and chimpanzee have very few intact V1Rs (8, 2, and 0, respectively) compared to more than a hundred intact V1Rs in each of the rat (106) and mouse (165) genomes. We also provide the first description of the diversity of V1R pseudogenes in these species. We identify at least 165 pseudogenes in mouse, 110 in rat, 102 in chimpanzee, 115 in human, and 54 in dog. Primate and dog pseudogenes are distributed among almost all V1R subfamilies seen in rodents, indicating that the common ancestor of these species had a diverse V1R repertoire. We find that V1R genes were subject to strikingly different fates in different species and in different subfamilies. In rodents, some subfamilies remained relatively stable or underwent roughly equivalent expansion in mouse and rat; other subfamilies expanded in one species but not the other. The small number of intact V1Rs in the dog genome is unexpected given the presumption that dogs, like rodents, have a functional VNO, and a complex system of pheromone-based behaviors. We identify an intact transient receptor potential channel 2beta in the dog genome, consistent with a functional VNO in dogs. The diminished V1R repertoire in dogs raises questions about the relative contributions of V1Rs versus other candidate pheromone receptor genes in the establishment of complex pheromone systems in mammals.
Project description:Sensory gene families are of special interest for both what they can tell us about molecular evolution and what they imply as mediators of social communication. The vomeronasal type-1 receptors (V1Rs) have often been hypothesized as playing a fundamental role in driving or maintaining species boundaries given their likely function as mediators of intraspecific mate choice, particularly in nocturnal mammals. Here, we employ a comparative genomic approach for revealing patterns of V1R evolution within primates, with a special focus on the small-bodied nocturnal mouse and dwarf lemurs of Madagascar (genera Microcebus and Cheirogaleus, respectively). By doubling the existing genomic resources for strepsirrhine primates (i.e. the lemurs and lorises), we find that the highly speciose and morphologically cryptic mouse lemurs have experienced an elaborate proliferation of V1Rs that we argue is functionally related to their capacity for rapid lineage diversification. Contrary to a previous study that found equivalent degrees of V1R diversity in diurnal and nocturnal lemurs, our study finds a strong correlation between nocturnality and V1R elaboration, with nocturnal lemurs showing elaborate V1R repertoires and diurnal lemurs showing less diverse repertoires. Recognized subfamilies among V1Rs show unique signatures of diversifying positive selection, as might be expected if they have each evolved to respond to specific stimuli. Furthermore, a detailed syntenic comparison of mouse lemurs with mouse (genus Mus) and other mammalian outgroups shows that orthologous mammalian subfamilies, predicted to be of ancient origin, tend to cluster in a densely populated region across syntenic chromosomes that we refer to as a V1R "hotspot."
Project description:In Mus spretus, the chloride channel 4 gene Clcn4-2 is X-linked and dosage compensated by X up-regulation and X inactivation, while in the closely related mouse species Mus musculus, Clcn4-2 has been translocated to chromosome 7. We sequenced Clcn4-2 in M. spretus and identified the breakpoints of the evolutionary translocation in the Mus lineage. Genetic and epigenetic differences were observed between the 5'ends of the autosomal and X-linked loci. Remarkably, Clcn4-2 introns have been truncated on chromosome 7 in M. musculus as compared with the X-linked loci from seven other eutherian mammals. Intron sequences specifically preserved in the X-linked loci were significantly enriched in AT-rich oligomers. Genome-wide analyses showed an overall enrichment in AT motifs unique to the eutherian X (except for genes that escape X inactivation), suggesting a role for these motifs in regulation of the X chromosome.
Project description:We applied a comprehensive data-mining strategy to examine the repertoires of rat and mouse odorant receptors (ORs) and type 1 pheromone receptors (V1Rs) using the mm5 (mouse) and rn3 (rat) genomes. We identified 1576 rat OR genes, including 292 pseudogenes. The rat V1R repertoire is composed of 115 intact genes and 72 pseudogenes. The mouse OR and V1R databases were updated using the new assembly mm5, from which 1375 mouse ORs and 308 V1Rs were identified, with more than 100 putative pseudogenes from mm2 now identified as intact because of the higher sequence quality. With these new data we have conducted a series of genomic analyses of the OR and V1R genes from mouse and rat. Orthologous OR clusters were identified in mouse and rat and comparison analysis was performed at three incremental levels: families, coding sequences, and motifs. At the family level, we found that V1R genes have more species-specific families than OR genes. About 20% of intact V1R genes have no orthologous counterpart in the same family, whereas less than 1% of intact ORs are similarly isolated. At the coding sequence level, OR genes are more conserved between mouse and rat than V1R genes. OR genes share greater similarity with their orthologous counterparts than with their closest neighbor, whereas V1R genes show the opposite tendency. Motifs were identified to obtain biological insights. Motifs specific for species or families were found in OR and V1R genes, which may result in the differential pheromone-dependent behaviors and perception of odors between mouse and rat.
Project description:Genomic data for the closest relatives of house mice (Mus musculus species complex) are surprisingly limited. Here, we present the first complete genome for a behaviorally and ecologically unique member of the sister clade to house mice, the mound-building mouse, Mus spicilegus Using read cloud sequencing and de novo assembly we produced a 2.50 Gbp genome with a scaffold N50 of 2.27 Mbp. We constructed >25 000 gene models, of which the majority had high homology to other Mus species. To evaluate the utility of the M. spicilegus genome for behavioral and ecological genomics, we extracted 196 vomeronasal receptor (VR) sequences from our genome and analyzed phylogenetic relationships between M. spicilegus VRs and orthologs from M. musculus and the Algerian mouse, M. spretus While most M. spicilegus VRs clustered with orthologs in M. musculus and M. spretus, 10 VRs with evidence of rapid divergence in M. spicilegus are strong candidate modulators of species-specific chemical communication. A high quality assembly and genome for M. spicilegus will help to resolve discordant ancestry patterns in house mouse genomes, and will provide an essential foundation for genetic dissection of phenotypes that distinguish commensal from non-commensal species, and the social and ecological characteristics that make M. spicilegus unique.
Project description:Wild populations of the house mouse (Mus musculus) represent the raw genetic material for the classical inbred strains in biomedical research and are a major model system for evolutionary biology. We provide whole genome sequencing data of individuals representing natural populations of M. m. domesticus (24 individuals from 3 populations), M. m. helgolandicus (3 individuals), M. m. musculus (22 individuals from 3 populations) and M. spretus (8 individuals from one population). We use a single pipeline to map and call variants for these individuals and also include 10 additional individuals of M. m. castaneus for which genomic data are publically available. In addition, RNAseq data were obtained from 10 tissues of up to eight adult individuals from each of the three M. m. domesticus populations for which genomic data were collected. Data and analyses are presented via tracks viewable in the UCSC or IGV genome browsers. We also provide information on available outbred stocks and instructions on how to keep them in the laboratory.
Project description:We have analyzed the organization and sequence of 73 V1R genes encoding putative pheromone receptors to identify regulatory features and characterize the evolutionary history of the V1R family. The 73 V1Rs arose from seven ancestral genes around the time of mouse-rat speciation through large local duplications, and this expansion may contribute to speciation events. Orthologous V1R genes appear to have been lost during primate evolution. Exceptional noncoding homology is observed across four V1R subfamilies at one cluster and thus may be important for locus-specific transcriptional regulation.