The impact of Ty3-gypsy group LTR retrotransposons Fatima on B-genome specificity of polyploid wheats.
ABSTRACT: BACKGROUND: Transposable elements (TEs) are a rapidly evolving fraction of the eukaryotic genomes and the main contributors to genome plasticity and divergence. Recently, occupation of the A- and D-genomes of allopolyploid wheat by specific TE families was demonstrated. Here, we investigated the impact of the well-represented family of gypsy LTR-retrotransposons, Fatima, on B-genome divergence of allopolyploid wheat using the fluorescent in situ hybridisation (FISH) method and phylogenetic analysis. RESULTS: FISH analysis of a BAC clone (BAC_2383A24) initially screened with Spelt1 repeats demonstrated its predominant localisation to chromosomes of the B-genome and its putative diploid progenitor Aegilops speltoides in hexaploid (genomic formula, BBAADD) and tetraploid (genomic formula, BBAA) wheats as well as their diploid progenitors. Analysis of the complete BAC_2383A24 nucleotide sequence (113,605 bp) demonstrated that it contains 55.6% TEs, 0.9% subtelomeric tandem repeats (Spelt1), and five genes. LTR retrotransposons are predominant, representing 50.7% of the total nucleotide sequence. Three elements of the gypsy LTR retrotransposon family Fatima make up 47.2% of all the LTR retrotransposons in this BAC. In situ hybridisation of the Fatima_2383A24-3 subclone suggests that individual representatives of the Fatima family contribute to the majority of the B-genome specific FISH pattern for BAC_2383A24. Phylogenetic analysis of various Fatima elements available from databases in combination with the data on their insertion dates demonstrated that the Fatima elements fall into several groups. One of these groups, containing Fatima_2383A24-3, is more specific to the B-genome and proliferated around 0.5-2.5 MYA, prior to allopolyploid wheat formation. CONCLUSION: The B-genome specificity of the gypsy-like Fatima, as determined by FISH, is explained to a great degree by the appearance of a genome-specific element within this family for Ae. speltoides. Moreover, its proliferation mainly occurred in this diploid species before it entered into allopolyploidy.Most likely, this scenario of emergence and proliferation of the genome-specific variants of retroelements, mainly in the diploid species, is characteristic of the evolution of all three genomes of hexaploid wheat.
Project description:The environment can have a decisive influence on the structure of the genome, changing it in a certain direction. Therefore, the genomic distribution of environmentally sensitive transposable elements may vary measurably across a species area. In the present research, we aimed to detect and evaluate the level of LTR retrotransposon intraspecific variability in Aegilops speltoides (2n = 2x = 14), a wild cross-pollinated relative of cultivated wheat. The interretrotransposon amplified polymorphism (IRAP) protocol was applied to detect and evaluate the level of retrotransposon intraspecific variability in Ae. speltoides and closely related species. IRAP analysis revealed significant diversity in TE distribution. Various genotypes from the 13 explored populations significantly differ with respect to the patterns of the four explored LTR retrotransposons (WIS2, Wilma, Daniela, and Fatima). This diversity points to a constant ongoing process of LTR retrotransposon fraction restructuring in populations of Ae. speltoides throughout the species' range and within single populations in time. Maximum changes were recorded in genotypes from small stressed populations. Principal component analysis showed that the dynamics of the Fatima element significantly differ from those of WIS2, Wilma, and Daniela. In terms of relationships between Sitopsis species, IRAP analysis revealed a grouping with Ae. sharonensis and Ae. longissima forming a separate unit, Ae. speltoides appearing as a dispersed group, and Ae. bicornis being in an intermediate position. IRAP display data revealed dynamic changes in LTR retrotransposon fractions in the genome of Ae. speltoides. The process is permanent and population specific, ultimately leading to the separation of small stressed populations from the main group.
Project description:From a wild diploid species that is a relative of wheat, Aegilops speltoides, a 301-bp repeat containing 16 copies of a CAA microsatellite was isolated. Southern blot and fluorescence in situ hybridization revealed that approximately 250 bp of the sequence is tandemly arrayed at the centromere regions of A- and B-genome chromosomes of common wheat and rye chromosomes. Although the DNA sequence of this 250-bp repeat showed no notable homology in the databases, the flanking or intervening sequences between the repeats showed high homologies (>82%) to two separate sequences of the gag gene and its upstream region in cereba, a Ty3/gypsy-like retroelement of Hordeum vulgare. Since the amino acid sequence deduced from the 250 bp with seven CAAs showed some similarity ( approximately 53%) to that of the gag gene, we concluded that the 250-bp repeats had also originated from the cereba-like retroelements in diploid wheat such as Ae. speltoides and had formed tandem arrays, whereas the 300-bp repeats were dispersed as a part of cereba-like retroelements. This suggests that some tandem repeats localized at the centromeric regions of cereals and other plant species originated from parts of retrotransposons.
Project description:Five diploid Aegilops species of the Sitopsis section: Ae. speltoides, Ae. longissima, Ae. sharonensis, Ae. searsii, and Ae. bicornis, two tetraploid species Ae. peregrina (= Ae. variabilis) and Ae. kotschyi (Aegilops section) and hexaploid Ae. vavilovii (Vertebrata section) carry the S-genomes. The B- and G-genomes of polyploid wheat are also the derivatives of the S-genome. Evolution of the S-genome species was studied using Giemsa C-banding and fluorescence in situ hybridization (FISH) with DNA probes representing 5S (pTa794) and 18S-5.8S-26S (pTa71) rDNAs as well as nine tandem repeats: pSc119.2, pAesp_SAT86, Spelt-1, Spelt-52, pAs1, pTa-535, and pTa-s53. To correlate the C-banding and FISH patterns we used the microsatellites (CTT)10 and (GTT)9, which are major components of the C-banding positive heterochromatin in wheat. According to the results obtained, diploid species split into two groups corresponding to Emarginata and Truncata sub-sections, which differ in the C-banding patterns, distribution of rDNA and other repeats. The B- and G-genomes of polyploid wheat are most closely related to the S-genome of Ae. speltoides. The genomes of allopolyploid wheat have been evolved as a result of different species-specific chromosome translocations, sequence amplification, elimination and re-patterning of repetitive DNA sequences. These events occurred independently in different wheat species and in Ae. speltoides . The 5S rDNA locus of chromosome 1S was probably lost in ancient Ae. speltoides prior to formation of Timopheevii wheat, but after the emergence of ancient emmer. Evolution of Emarginata species was associated with an increase of C-banding and (CTT)10-positive heterochromatin, amplification of Spelt-52, re-pattering of the pAesp_SAT86, and a gradual decrease in the amount of the D-genome-specific repeats pAs1, pTa-535, and pTa-s53. The emergence of Ae. peregrina and Ae. kotschyi did not lead to significant changes of the S*-genomes. However, partial elimination of 45S rDNA repeats from 5S* and 6S* chromosomes and alterations of C-banding and FISH-patterns have been detected. Similarity of the Sv-genome of Ae. vavilovii with the Ss genome of diploid Ae. searsii confirmed the origin of this hexaploid. A model of the S-genome evolution is suggested.
Project description:Repetitive DNA-specifically, transposable elements (TEs)-is a prevailing genomic fraction in cereals that underlies extensive genome reshuffling and intraspecific diversification in the wild. Although large amounts of data have been accumulated, the effect of TEs on the genome architecture and functioning is not fully understood. Here, plant genome organization was addressed by means of cloning and sequencing TE fragments of different types, which compose the largest portion of the Aegilops speltoides genome. Individual genotypes were analyzed cytogenetically using the cloned TE fragments as the DNA probes for fluorescence in situ hybridization (FISH). The obtained TE sequences of the Ty1-copia, Ty3-gypsy, LINE, and CACTA superfamilies showed the relatedness of the Ae. speltoides genome to the Triticeae tribe and similarities to evolutionarily distant species. A significant number of clones consisted of intercalated fragments of TEs of various types, in which Fatima (Ty3-gypsy) sequences predominated. At the chromosomal level, different TE clones demonstrated sequence-specific patterning, emphasizing the effect of the TE fraction on the Ae. speltoides genome architecture and intraspecific diversification. Altogether, the obtained data highlight the current species-specific organization and patterning of the mobile element fraction and point to ancient evolutionary events in the genome of Ae. speltoides.
Project description:Following allopolyploidization, nascent polyploid wheat species react with massive genomic rearrangements, including deletion of transposable element-containing sequences. While such massive rearrangements are considered to be a prominent process in wheat genome evolution and speciation, their structure, extent, and underlying mechanisms remain poorly understood. In this study, we retrieved ~3500 insertions of a specific variant of Fatima, one of the most dynamic gypsy long-terminal repeat retrotransposons in wheat from the recently available high-quality genome drafts of Triticum aestivum (bread wheat) and Triticum turgidum ssp. dicoccoides or wild emmer, the allotetraploid mother of all modern wheats. The dynamic nature of Fatima facilitated the identification of large (i.e., up to ~ 1 million bases) Fatima-containing insertions/deletions (indels) upon comparison of bread wheat and wild emmer genomes. We characterized 11 such indels using computer-assisted analysis followed by PCR validation, and found that they might have occurred via unequal intra-strand recombination or double-strand break (DSB) events. Additionally, we observed one case of introgression of novel DNA fragments from an unknown source into the wheat genome. Our data thus indicate that massive large-scale DNA rearrangements might play a prominent role in wheat speciation.
Project description:BACKGROUND:Long terminal repeat (LTR) retrotransposons constitute a major fraction of the genomes of higher plants. For example, retrotransposons comprise more than 50% of the maize genome and more than 90% of the wheat genome. LTR retrotransposons are believed to have contributed significantly to the evolution of genome structure and function. The genome sequencing of selected experimental and agriculturally important species is providing an unprecedented opportunity to view the patterns of variation existing among the entire complement of retrotransposons in complete genomes. RESULTS:Using a new data-mining program, LTR_STRUC, (LTR retrotransposon structure program), we have mined the GenBank rice (Oryza sativa) database as well as the more extensive (259 Mb) Monsanto rice dataset for LTR retrotransposons. Almost two-thirds (37) of the 59 families identified consist of copia-like elements, but gypsy-like elements outnumber copia-like elements by a ratio of approximately 2:1. At least 17% of the rice genome consists of LTR retrotransposons. In addition to the ubiquitous gypsy- and copia-like classes of LTR retrotransposons, the rice genome contains at least two novel families of unusually small, non-coding (non-autonomous) LTR retrotransposons. CONCLUSIONS:Each of the major clades of rice LTR retrotransposons is more closely related to elements present in other species than to the other clades of rice elements, suggesting that horizontal transfer may have occurred over the evolutionary history of rice LTR retrotransposons. Like LTR retrotransposons in other species with relatively small genomes, many rice LTR retrotransposons are relatively young, indicating a high rate of turnover.
Project description:Long terminal repeat (LTR) retrotransposons are highly abundant in plant genomes and require transcriptional activity for their proliferative mode of replication. These sequences exist in plant genomes as diverse sublineages within the main element superfamilies (i.e., gypsy and copia). While transcriptional activity of these elements is increasingly recognized as a regular attribute of plant transcriptomes, it is currently unknown the extent to which different sublineages of these elements are transcriptionally active both within and across species. In the current report, we utilize next generation sequencing methods to examine genomic copy number abundance of diverse LTR retrotransposon sublineages and their corresponding levels of transcriptional activity in three diploid wild sunflower species, Helianthus agrestis, H. carnosus and H. porteri.The diploid sunflower species under investigation differ in genome size 2.75-fold, with 2C values of 22.93 for H. agrestis, 12.31 for H. carnosus and 8.33 for H. porteri. The same diverse gypsy and copia sublineages of LTR retrotransposons were identified across species, but with gypsy sequences consistently more abundant than copia and with global gypsy sequence abundance positively correlated with nuclear genome size. Transcriptional activity was detected for multiple copia and gypsy sequences, with significantly higher activity levels detected for copia versus gypsy. Interestingly, of 11 elements identified as transcriptionally active, 5 exhibited detectable expression in all three species and 3 exhibited detectable expression in two species.Combined analyses of LTR retrotransposon genomic abundance and transcriptional activity across three sunflower species provides novel insights into genome size evolution and transposable element dynamics in this group. Despite considerable variation in nuclear genome size among species, relatively conserved patterns of LTR retrotransposon transcriptional activity were observed, with a highly overlapping set of copia and gypsy sequences observed to be transcriptionally active across species. A higher proportion of copia versus gypsy elements were found to be transcriptionally active and these sequences also were expressed at higher levels.
Project description:Centromeric regions of plants are generally composed of large array of satellites from a specific lineage of Gypsy LTR-retrotransposons, called Centromeric Retrotransposons. Repeated sequences interact with a specific H3 histone, playing a crucial function on kinetochore formation. To study the structure and composition of centromeric regions in the genus Coffea, we annotated and classified Centromeric Retrotransposons sequences from the allotetraploid C. arabica genome and its two diploid ancestors: Coffea canephora and C. eugenioides. Ten distinct CRC (Centromeric Retrotransposons in Coffea) families were found. The sequence mapping and FISH experiments of CRC Reverse Transcriptase domains in C. canephora, C. eugenioides, and C. arabica clearly indicate a strong and specific targeting mainly onto proximal chromosome regions, which can be associated also with heterochromatin. PacBio genome sequence analyses of putative centromeric regions on C. arabica and C. canephora chromosomes showed an exceptional density of one family of CRC elements, and the complete absence of satellite arrays, contrasting with usual structure of plant centromeres. Altogether, our data suggest a specific centromere organization in Coffea, contrasting with other plant genomes.
Project description:Long Terminal Repeat (LTR) retrotransposons constitute a significant part of eukaryotic genomes and play an important role in genome evolution especially in plants. Jute is an important fiber crop with a large genome of 1,250 Mbps. This genome is still mostly unexplored. In this study we aimed at identifying and characterizing the LTR retrotransposons of jute with a view to understanding the jute genome better. In this study, the Reverse Transcriptase domain of Ty1-copia and Ty3-gypsy LTR retrotransposons of jute were amplified by degenerate primers and their expressions were examined by reverse transcription PCR. Copy numbers of reverse transcriptase (RT) genes of Ty1-copia and Ty3-gypsy elements were determined by dot blot analysis. Sequence analysis revealed higher heterogeneity among Ty1-copia retrotransposons than Ty3-gypsy and clustered each of them in three groups. Copy number of RT genes in Ty1-copia was found to be higher than that of Ty3-gypsy elements from dot blot hybridization. Cumulatively Ty1-copia and Ty3-gypsy may constitute around 19% of the jute genome where two groups of Ty1-copia were found to be transcriptionally active. Since the LTR retrotransposons constitute a large portion of jute genome, these findings imply the importance of these elements in the evolution of jute genome.
Project description:The most abundant transposable elements (TEs) in plant genomes are Class I long terminal repeat (LTR) retrotransposons represented by superfamilies gypsy and copia Amplification of these superfamilies directly impacts genome structure and contributes to differential patterns of genome size evolution among plant lineages. Utilizing short-read Illumina data and sequence information from a panel of Helianthus annuus (sunflower) full-length gypsy and copia elements, we explore the contribution of these sequences to genome size variation among eight diploid Helianthus species and an outgroup taxon, Phoebanthus tenuifolius We also explore transcriptional dynamics of these elements in both leaf and bud tissue via RT-PCR. We demonstrate that most LTR retrotransposon sublineages (i.e., families) display patterns of similar genomic abundance across species. A small number of LTR retrotransposon sublineages exhibit lineage-specific amplification, particularly in the genomes of species with larger estimated nuclear DNA content. RT-PCR assays reveal that some LTR retrotransposon sublineages are transcriptionally active across all species and tissue types, whereas others display species-specific and tissue-specific expression. The species with the largest estimated genome size, H. agrestis, has experienced amplification of LTR retrotransposon sublineages, some of which have proliferated independently in other lineages in the Helianthus phylogeny.