A re-annotation of the Anopheles darlingi mobilome.
ABSTRACT: The mobilome, portion of the genome composed of transposable elements (TEs), of Anopheles darlingi was described together with the genome of this species. Here, this mobilome was revised using similarity and de novo search approaches. A total of 5.6% of the A. darlingi genome is derived of TEs. Class I gypsy and copia were the most abundant superfamilies, corresponding to 22.36% of the mobilome. Non-LTR elements of the R1 and Jockey superfamilies account for 11% of the TEs. Among Class II TEs, the mariner superfamily is the most abundant (16.01%). Approximately 87% of the A. darlingi mobilome consist of short, truncated and/or degenerated copies of TEs. Only three retrotransposons, two belonging to gypsy and one to copia superfamilies, are putatively active elements. Only one Class II element, belonging to the mariner superfamily, is putatively active, having 12 copies in the genome. The TE landscape of A. darlingi is formed primarily by degenerated elements and, therefore, somewhat stable. Future applications of TE-based vectors for genetic transformation of A. darlingi should take into consideration mariner and piggyBac transposons, because full length and putatively active copies of these elements are present in its genome.
Project description:BACKGROUND: Verticillium dahliae (Vd) and Verticillium albo-atrum (Va) are cosmopolitan soil fungi causing very disruptive vascular diseases on a wide range of crop plants. To date, no sexual stage has been identified in either microorganism suggesting that somatic mutation is a major force in generating genetic diversity. Whole genome comparative analysis of the recently sequenced strains VdLs.17 and VaMs.102 revealed that non-random insertions of transposable elements (TEs) have contributed to the generation of four lineage-specific (LS) regions in VdLs.17. RESULTS: We present here a detailed analysis of Class I retrotransposons and Class II "cut-and-paste" DNA elements detected in the sequenced Verticillium genomes. We report also of their distribution in other Vd and Va isolates from various geographic origins. In VdLs.17, we identified and characterized 56 complete retrotransposons of the Gypsy-, Copia- and LINE-like types, as well as 34 full-length elements of the "cut-and-paste" superfamilies Tc1/mariner, Activator and Mutator. While Copia and Tc1/mariner were present in multiple identical copies, Activator and Mutator sequences were highly divergent. Most elements comprised complete ORFs, had matching ESTs and showed active transcription in response to stress treatment. Noticeably, we found evidences of repeat-induced point mutation (RIP) only in some of the Gypsy retroelements. While Copia-, Gypsy- and Tc1/mariner-like transposons were prominent, a large variation in presence of the other types of mobile elements was detected in the other Verticillium spp. strains surveyed. In particular, neither complete nor defective "cut-and-paste" TEs were found in VaMs.102. CONCLUSIONS: Copia-, Gypsy- and Tc1/mariner-like transposons are the most wide-spread TEs in the phytopathogens V. dahliae and V. albo-atrum. In VdLs.17, we identified several retroelements and "cut-and-paste" transposons still potentially active. Some of these elements have undergone diversification and subsequent selective amplification after introgression into the fungal genome. Others, such as the ripped Copias, have been potentially acquired by horizontal transfer. The observed biased TE insertion in gene-rich regions within an individual genome (VdLs.17) and the "patchy" distribution among different strains point to the mobile elements as major generators of Verticillium intra- and inter-specific genomic variation.
Project description:Transposable elements (TEs) are mobile genetic elements that parasitize basically all eukaryotic species genomes. Due to their complexity, an in-depth TE characterization is only available for a handful of model organisms. In the present study, we performed a de novo and homology-based characterization of TEs in the genomes of 24 mosquito species and investigated their mode of inheritance. More than 40% of the genome of Aedes aegypti, Aedes albopictus, and Culex quinquefasciatus is composed of TEs, while it varied substantially among Anopheles species (0.13%-19.55%). Class I TEs are the most abundant among mosquitoes and at least 24 TE superfamilies were found. Interestingly, TEs have been extensively exchanged by horizontal transfer (172 TE families of 16 different superfamilies) among mosquitoes in the last 30 million years. Horizontally transferred TEs represents around 7% of the genome in Aedes species and a small fraction in Anopheles genomes. Most of these horizontally transferred TEs are from the three ubiquitous LTR superfamilies: Gypsy, Bel-Pao and Copia. Searching more than 32,000 genomes, we also uncovered transfers between mosquitoes and two different Phyla-Cnidaria and Nematoda-and two subphyla-Chelicerata and Crustacea, identifying a vector, the worm Wuchereria bancrofti, that enabled the horizontal spread of a Tc1-mariner element among various Anopheles species. These data also allowed us to reconstruct the horizontal transfer network of this TE involving more than 40 species. In summary, our results suggest that TEs are frequently exchanged by horizontal transfers among mosquitoes, influencing mosquito's genome size and variability.
Project description:Two retrotransposons from the superfamilies Copia and Gypsy named as Copia-LTR_SS and Gypsy-LTR_SS, respectively, were identified in the genomic bank of Sclerotinia sclerotiorum. These transposable elements (TEs) contained direct and preserved long terminal repeats (LTR). Domains related to codified regions for gag protein, integrase, reverse transcriptase and RNAse H were identified in Copia-LTR_SS, whereas in Gypsy-LTR_SS only domains for gag, reverse transcriptase and RNAse H were found. The abundance of identified LTR-Solo suggested possible genetic recombination events in the S. sclerotiorum genome. Furthermore, alignment of the sequences for LTR elements from each superfamily suggested the presence of a RIP (repeat-induced point mutation) silencing mechanism that may directly affect the evolution of this species.
Project description:Transposable elements with long terminal direct repeats (LTR TEs) are one of the best studied groups of mobile elements. They are ubiquitous elements present in almost all eukaryotic genomes. Their number and state of conservation can be a highlight of genome dynamics. We searched all published fungal genomes for LTR-containing retrotransposons, including both complete, functional elements and remnant copies. We identified a total of over 66,000 elements, all of which belong to the Ty1/Copia or Ty3/Gypsy superfamilies. Most of the detected Gypsy elements represent Chromoviridae, i.e. they carry a chromodomain in the pol ORF. We analyzed our data from a genome-ecology perspective, looking at the abundance of various types of LTR TEs in individual genomes and at the highest-copy element from each genome. The TE content is very variable among the analyzed genomes. Some genomes are very scarce in LTR TEs (<50 elements), others demonstrate huge expansions (>8000 elements). The data shows that transposon expansions in fungi usually involve an increase both in the copy number of individual elements and in the number of element types. The majority of the highest-copy TEs from all genomes are Ty3/Gypsy transposons. Phylogenetic analysis of these elements suggests that TE expansions have appeared independently of each other, in distant genomes and at different taxonomical levels. We also analyzed the evolutionary relationships between protein domains encoded by the transposon pol ORF and we found that the protease is the fastest evolving domain whereas reverse transcriptase and RNase H evolve much slower and in correlation with each other.
Project description:The availability of the Whole-Genome Sequence of the wheat pest Mayetiola destructor offers the opportunity to investigate the Transposable Elements (TEs) content and their relationship with the genes involved in the insect virulence. In this study, de novo annotation carried out using REPET pipeline showed that TEs occupy approximately 16% of the genome and are represented by 1038 lineages. Class II elements were the most frequent and most TEs were inactive due to the deletions they have accumulated. The analyses of TEs ages revealed a first burst at 20% of divergence from present that mobilized many TE families including mostly Tc1/mariner and Gypsy superfamilies and a second burst at 2% of divergence, which involved mainly the class II elements suggesting new TEs invasions. Additionally, 86 TEs insertions involving recently transposed elements were identified. Among them, several MITEs and Gypsy retrotransposons were inserted in the vicinity of SSGP and chemosensory genes. The findings represent a valuable resource for more in-depth investigation of the TE impact onto M. destructor genome and their possible influence on the expression of the virulence and chemosensory genes and consequently the behavior of this pest towards its host plants.
Project description:The most abundant transposable elements (TEs) in plant genomes are Class I long terminal repeat (LTR) retrotransposons represented by superfamilies gypsy and copia Amplification of these superfamilies directly impacts genome structure and contributes to differential patterns of genome size evolution among plant lineages. Utilizing short-read Illumina data and sequence information from a panel of Helianthus annuus (sunflower) full-length gypsy and copia elements, we explore the contribution of these sequences to genome size variation among eight diploid Helianthus species and an outgroup taxon, Phoebanthus tenuifolius We also explore transcriptional dynamics of these elements in both leaf and bud tissue via RT-PCR. We demonstrate that most LTR retrotransposon sublineages (i.e., families) display patterns of similar genomic abundance across species. A small number of LTR retrotransposon sublineages exhibit lineage-specific amplification, particularly in the genomes of species with larger estimated nuclear DNA content. RT-PCR assays reveal that some LTR retrotransposon sublineages are transcriptionally active across all species and tissue types, whereas others display species-specific and tissue-specific expression. The species with the largest estimated genome size, H. agrestis, has experienced amplification of LTR retrotransposon sublineages, some of which have proliferated independently in other lineages in the Helianthus phylogeny.
Project description:The evolutionary dynamics of long terminal repeat (LTR) retrotransposons in tree genomes has remained largely unknown. The availability of the complete genome sequences of the mulberry tree (Morus notabilis) has offered an unprecedented opportunity for us to characterize these retrotransposon elements. We investigated 202 and 114 families of Copia and Gypsy superfamilies, respectively, comprising 2916 intact elements in the mulberry genome. The tRNAMet was the most frequently used type of tRNA in both superfamilies. Phylogenetic analysis suggested that Copia and Gypsy from mulberry can be grouped into eight and six lineages, respectively. All previously characterized families of such elements could also be found in the mulberry genome. About 95% of the identified Copia and Gypsy full elements were estimated to have been inserted into the mulberry genome within the past 2–3 million years. Meanwhile, the estimated insertion times of members of the three most abundant families of the Copia superfamily (908 members from the three most abundant families) and Gypsy superfamily (783 members from the three most abundant families) revealed divergent life histories. Compared with the situation in Gypsy elements, three families of Copia elements are under positive selection pressure, which suggested that Copia elements may have a dominant influence in the evolution of mulberry genes. Analysis of insertion and deletion dynamics suggested that Copia and Gypsy elements exhibited a very long half-life in the mulberry genome. The present work provides new insights into the insertion and deletion dynamics of LTR retrotransposons, and it will greatly improve our understanding of the important roles transposable elements play in the architecture of the mulberry genome.
Project description:The expression of 1,613 transposable elements (TEs) represented in the Affymetrix Wheat Genome Chip was examined during cold treatment in crowns of four hexaploid wheat genotypes that vary in tolerance to cold and in flowering time. The TE expression profiles showed a constant level of expression throughout the experiment in three of the genotypes. In winter Norstar, the most cold-hardy of the four genotypes, a subset of the TEs showed a burst of expression after vernalization saturation was achieved. About 47% of the TEs were expressed, and both Class I (retrotransposons) and Class II (DNA transposons) types were well represented. Gypsy and Copia were the most represented among the retrotransposons while CACTA and Mariner were the most represented DNA transposons. The data suggests that the Vrn-A1 region plays a role in the stage-specific induction of TE expression in this genotype.
Project description:<h4>Background</h4>Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression.<h4>Results</h4>Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (? 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution.<h4>Conclusions</h4>The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the Copia LTR superfamily are inserted close to coding regions, we suggest Copia elements have a greater influence on recent flax genome evolution while Gypsy elements have become residual and highly mutated.
Project description:In Eucalyptus genus, studies on genome composition and transposable elements (TEs) are particularly scarce. Nearly half of the recently released Eucalyptus grandis genome is composed by retrotransposons and this data provides an important opportunity to understand TE dynamics in Eucalyptus genome and transcriptome.We characterized nine families of transcriptionally active LTR retrotransposons from Copia and Gypsy superfamilies in Eucalyptus grandis genome and we depicted genomic distribution and copy number in two Eucalyptus species. We also evaluated genomic polymorphism and transcriptional profile in three organs of five Eucalyptus species. We observed contrasting genomic and transcriptional behavior in the same family among different species. RLC_egMax_1 was the most prevalent family and RLC_egAngela_1 was the family with the lowest copy number. Most families of both superfamilies have their insertions occurring <3 million years, except one Copia family, RLC_egBianca_1. Protein theoretical models suggest different properties between Copia and Gypsy domains. IRAP and REMAP markers suggested genomic polymorphisms among Eucalyptus species. Using EST analysis and qRT-PCRs, we observed transcriptional activity in several tissues and in all evaluated species. In some families, osmotic stress increases transcript values.Our strategy was successful in isolating transcriptionally active retrotransposons in Eucalyptus, and each family has a particular genomic and transcriptional pattern. Overall, our results show that retrotransposon activity have differentially affected genome and transcriptome among Eucalyptus species.