Project description:Retrotransposons have been shown to contribute to evolution of both structure and regulation of protein coding genes. It has been postulated that the primary mechanism by which retrotransposons contribute to structural gene evolution is through insertion into an intron or a gene flanking region, and subsequent incorporation into an exon.We found that Long Terminal Repeat (LTR) retrotransposons are associated with 1,057 human genes (5.8%). In 256 cases LTR retrotransposons were observed in protein-coding regions, while 50 distinct protein coding exons in 45 genes were comprised exclusively of LTR RetroTransposon Sequence (LRTS). We go on to reconstruct the evolutionary history of an alternatively spliced exon of the Interleukin 22 receptor, alpha 2 gene (IL22RA2) derived from a sequence of retrotransposon of the Mammalian apparent LTR retrotransposons (MaLR) family. Sequencing and analysis of the homologous regions of genomes of several primates indicate that the LTR retrotransposon was inserted into the IL22RA2 gene at least prior to the divergence of Apes and Old World monkeys from a common ancestor (approximately 25 MYA). We hypothesize that the recruitment of the part of LTR as a novel exon in great ape species occurred prior to the divergence of orangutans and humans from a common ancestor (approximately 14 MYA) as a result of a single mutation in the proto-splice site.Our analysis of LRTS exonization events has shown that the patterns of LRTS distribution in human exons support the hypothesis that LRTS played a significant role in human gene evolution by providing cis-regulatory sequences; direct incorporation of LTR sequences into protein coding regions was observed less frequently. Combination of computational and experimental approaches used for tracing the history of the LTR exonization process of IL22RA2 gene presents a promising strategy that could facilitate further studies of transposon initiated gene evolution.
Project description:BackgroundMost known eukaryotic genomes contain mobile copied elements called transposable elements. In some species, these elements account for the majority of the genome sequence. They have been subject to many mutations and other genomic events (copies, deletions, captures) during transposition. The identification of these transformations remains a difficult issue. The study of families of transposable elements is generally founded on a multiple alignment of their sequences, a critical step that is adapted to transposons containing mostly localized nucleotide mutations. Many transposons that have lost their protein-coding capacity have undergone more complex rearrangements, needing the development of more complex methods in order to characterize the architecture of sequence variations.ResultsIn this study, we introduce the concept of a transposable element module, a flexible motif present in at least two sequences of a family of transposable elements and built on a succession of maximal repeats. The paper proposes an assembly method working on a set of exact maximal repeats of a set of sequences to create such modules. It results in a graphical view of sequences segmented into modules, a representation that allows a flexible analysis of the transformations that have occurred between them. We have chosen as a demonstration data set in depth analysis of the transposable element Foldback in Drosophila melanogaster. Comparison with multiple alignment methods shows that our method is more sensitive for highly variable sequences. The study of this family and the two other families AtREP21 and SIDER2 reveals new copies of very different sizes and various combinations of modules which show the potential of our method.ConclusionsModuleOrganizer is available on the Genouest bioinformatics center at http://moduleorganizer.genouest.org.
Project description:BackgroundWhiteflies are agricultural pests that cause negative impacts globally to crop yields resulting at times in severe economic losses and food insecurity. The Bemisia tabaci whitefly species complex is the most damaging in terms of its broad crop host range and its ability to serve as vector for over 400 plant viruses. Genomes of whiteflies belonging to this species complex have provided valuable genomic data; however, transposable elements (TEs) within these genomes remain unexplored. This study provides the first accurate characterization of TE content within the B. tabaci species complex.ResultsThis study identified that an average of 40.61% of the genomes of three whitefly species (MEAM1, MEDQ, and SSA-ECA) consists of TEs. The majority of the TEs identified were DNA transposons (22.85% average) while SINEs (0.14% average) were the least represented. This study also compared the TE content of the three whitefly genomes with three other hemipteran genomes and found significantly more DNA transposons and less LINEs in the whitefly genomes. A total of 63 TE superfamilies were identified to be present across the three whitefly species (39 DNA transposons, six LTR, 16 LINE, and two SINE). The sequences of the identified TEs were clustered which generated 5766 TE clusters. A total of 2707 clusters were identified as uniquely found within the whitefly genomes while none of the generated clusters were from both whitefly and non-whitefly TE sequences. This study is the first to characterize TEs found within different B. tabaci species and has created a standardized annotation workflow that could be used to analyze future whitefly genomes.ConclusionThis study is the first to characterize the landscape of TEs within the B. tabaci whitefly species complex. The characterization of these elements within the three whitefly genomes shows that TEs occupy significant portions of B. tabaci genomes, with DNA transposons representing the vast majority. This study also identified TE superfamilies and clusters of TE sequences of potential interest, providing essential information, and a framework for future TE studies within this species complex.
Project description:The Bornean representatives of the genus Georissa (Hydrocenidae) have small, dextral, conical, calcareous shells consisting of ca. three teleoconch whorls. Our recent study on the Georissa of Malaysian Borneo has revealed high intra- and inter-specific variation in the "scaly" group (a group of species with striking scale-like surface sculpture). The present study on the "non-scaly" Georissa is the continuation of the species revision for the genus. The "non-scaly" species are also diverse in shell sculptures. This informal group comprises Georissa with subtle spiral and/or radial sculpture. The combination of detailed conchological assessment and molecular analyses provides clear distinctions for each of the species. Conchological, molecular, and biogeographic details are presented for 16 species of "non-scaly" Georissa. Three of these are new to science, namely Georissacorrugata sp. n., Georissainsulae sp. n., and Georissatrusmadi sp. n.
Project description:BackgroundWith the expansion of high throughput sequencing, we now have access to a larger number of genome-wide studies analyzing the Transposable elements (TEs) composition in a wide variety of organisms. However, genomic analyses often remain too limited in number and diversity of species investigated to study in depth the dynamics and evolutionary success of the different types of TEs among metazoans. Therefore, we chose to investigate the use of transcriptomes to describe the diversity of TEs in phylogenetically related species by conducting the first comparative analysis of TEs in two groups of polychaetes and evaluate the diversity of TEs that might impact genomic evolution as a result of their mobility.ResultsWe present a detailed analysis of TEs distribution in transcriptomes extracted from 15 polychaetes depending on the number of reads used during assembly, and also compare these results with additional TE scans on associated low-coverage genomes. We then characterized the clades defined by 1021 LTR-retrotransposon families identified in 26 species. Clade richness was highly dependent on the considered superfamily. Copia elements appear rare and are equally distributed in only three clades, GalEa, Hydra and CoMol. Among the eight BEL/Pao clades identified in annelids, two small clades within the Sailor lineage are new for science. We characterized 17 Gypsy clades of which only 4 are new; the C-clade largely dominates with a quarter of the families. Finally, all species also expressed for the majority two distinct transcripts encoding PIWI proteins, known to be involved in control of TEs mobilities.ConclusionsThis study shows that the use of transcriptomes assembled from 40 million reads was sufficient to access to the diversity and proportion of the transposable elements compared to those obtained by low coverage sequencing. Among LTR-retrotransposons Gypsy elements were unequivocally dominant but results suggest that the number of Gypsy clades, although high, may be more limited than previously thought in metazoans. For BEL/Pao elements, the organization of clades within the Sailor lineage appears more difficult to establish clearly. The Copia elements remain rare and result from the evolutionary consistent success of the same three clades.
Project description:The availability of the sequenced Drosophila melanogaster genome provides an opportunity to study sequence variation between copies within transposable element families. In this study,we analyzed the 624 copies of 22 transposable element (TE) families (14 LTR retrotransposons, five non-LTR retrotransposons, and three transposons). LTR and non-LTR retrotransposons possessed far fewer divergent elements than the transposons,suggesting that the difference depends on the transposition mechanism. However,there was not a continuous range of divergence of the copies in each class,which were either very similar to the canonical elements,or very divergent from them. This sequence homogeneity among TE family copies matches the theoretical models of the dynamics of these repeated sequences. The sequenced Drosophila genome thus appears to be composed of a mixture of TEs that are still active and of ancient relics that have degenerated and the distribution of which along the chromosomes results from natural selection. This clearly demonstrates that the TEs are highly active within the genome,suggesting that the genetic variability of the Drosophila genome is still being renewed by the action of TEs.
Project description:The Southwest Pacific represents an independent biogeographic province for deep-sea hydrothermal vent fauna. Different degrees of genetic connectivity among vent fields in Manus, North Fiji and Lau Basins have been reported for various molluscan and crustacean species, presumably reflecting their different levels of dispersal ability as swimming larvae. The present study investigates the population connectivity of the hydrothermal vent limpet Shinkailepas tollmanni (family Phenacolepadidae) in the Southwest Pacific. Our analyses using mitochondrial COI-gene sequences and shell morphometric traits suggest a panmictic population structure throughout its geographic and bathymetric ranges, spanning 4,000 km from the westernmost Manus Basin (151ºE; 1,300 m deep) to the easternmost Lau Basin (176ºE; 2,720 m). The measurements of its embryonic and larval shells demonstrate that the species hatches as a planktotrophic veliger larva with an embryonic shell diameter of 170-180 μm and settles at the vent environment with the larval shell diameter of 750-770 μm. This substantial growth as a feeding larva, ca. 80 times in volume, is comparable or even greater than those of confamilial species in the hydrothermal-vent and methane-seep environments in the Northwest Pacific and Atlantic Oceans. Large pigmented eyes in newly settled juveniles are another common feature in this and other phenacolepadids inhabiting the chemosynthetic environments. These results put together suggest that the larvae of S. tollmanni migrate vertically from deep-sea vents to surface waters to take advantages of richer food supplies and faster currents and stay pelagic for an extended period of time (> 1 year), as previously indicated for the confamilial species.
Project description:Neritidae is one of the most diverse families of Neritimorpha and possesses euryhaline properties. Members of this family usually live on tropical and subtropical coasts and are mainly gregarious. The phylogenetic relationships between several subclasses of Gastropoda have been controversial for many years. With an increase in the number of described species of Neritidae, the knowledge of the evolutionary relationships in this family has improved. In the present study, we sequenced four complete mitochondrial genomes from two genera (Clithon and Nerita) and compared them with available complete mitochondrial genomes of Neritidae. Gene order exhibited a highly conserved pattern among three genera in the Neritidae family. Our results improved the phylogenetic resolution within Neritidae, and more comprehensive taxonomic sampling of subclass Neritimorpha was proposed. Furthermore, we reconstructed the divergence among the main lineages of 19 Neritimorpha taxa under an uncorrelated relaxed molecular clock.
Project description:Eukaryotic genomes consist of several repetitive DNAs, including dispersed DNA sequences that move between chromosome sites, tandem repeats of DNA sequences, and multigene families. In this study, repeated sequences isolated from the genome of Characidium gomesi were analyzed and mapped to chromosomes in Characidium zebra and specimens from two populations of C. gomesi. The sequences were transposable elements (TEs) named retroelement of Xiphophorus (Rex); multigene families of U2 small nuclear RNA (U2 snRNA); and histones H1, H3, and H4. Sequence analyses revealed that U2 snRNA contains a major portion corresponding to the Tx1-type non-LTR retrotransposon Keno, the preferential insertion sites of which are U2 snRNA sequences. All histone sequences were found to be associated with TEs. In situ localization revealed that these DNA sequences are dispersed throughout the autosomes of the species, but they are not involved in differentiation of the specific region of the W sex chromosome in C. gomesi. We discuss mechanisms of TE invasion into multigene families that lead to microstructural variation in Characidium genomes.
Project description:BackgroundMiniature inverted-repeat transposable elements (MITEs) and long terminal repeat (LTR) retrotransposons are ubiquitous in plants genomes, and highly important in their evolution and diversity. However, their mechanisms of insertion/amplification and roles in Citrus genome's evolution/diversity are still poorly understood.ResultsTo address this knowledge gap, we developed different computational pipelines to analyze, annotate and classify MITEs and LTR retrotransposons in six different sequenced Citrus species. We identified 62,010 full-length MITEs from 110 distinguished families. We observed MITEs tend to insert in gene related regions and enriched in promoters. We found that DTM63 is possibly an active Mutator-like MITE family in the traceable past and may still be active in Citrus. The insertion of MITEs resulted in massive polymorphisms and played an important role in Citrus genome diversity and gene structure variations. In addition, 6630 complete LTR retrotransposons and 13,371 solo-LTRs were identified. Among them, 12 LTR lineages separated before the differentiation of mono- and dicotyledonous plants. We observed insertion and deletion of LTR retrotransposons was accomplished with a dynamic balance, and their half-life in Citrus was ~ 1.8 million years.ConclusionsThese findings provide insights into MITEs and LTR retrotransposons and their roles in genome diversity in different Citrus genomes.