An anciently diverged family of RNA binding proteins maintain correct splicing of ultra-long exons through cryptic splice site repression [iCLIP-seq]
Ontology highlight
ABSTRACT: An anciently diverged family of RNA binding proteins maintain correct splicing of ultra-long exons through cryptic splice site repression [iCLIP-seq]
Project description:We previously showed that the germ cell specific nuclear protein RBMXL2 represses cryptic splicing patterns during meiosis and is required for male fertility. RBMXL2 evolved from the X-linked RBMX gene, which is silenced during meiosis due to sex chromosome inactivation. It has been unknown whether RBMXL2 provides a direct replacement for RBMX in meiosis, or whether RBMXL2 evolved to deal with the transcriptionally permissive environment of meiosis. Here we find that RBMX primarily operates as a splicing repressor in somatic cells, and specifically regulates a distinct class of exons that exceed the median human exon size. RBMX protein-RNA interactions are enriched within ultra-long exons, particularly within genes involved in genome stability, and repress the selection of cryptic splice sites that would compromise gene function. These similarities in overall function suggested that RBMXL2 might replace the function of RBMX during meiosis. To test this prediction we carried out inducible expression of RBMXL2 and the more distantly related RBMY protein in somatic cells, finding each could rescue aberrant patterns of RNA processing in response to RBMX depletion. The C-terminal disordered domain of RBMXL2 is sufficient to rescue proper splicing control after RBMX depletion. Our data indicate that RBMX and RBMXL2 have parallel roles in somatic tissues and the germline that must have have been conserved over at least 200 million years of mammalian evolution. We propose RBMX family proteins are particularly important for the splicing inclusion of ultra-long exons because these would be particularly susceptible to disruption by cryptic splice site selection.
Project description:An anciently diverged family of RNA binding proteins maintain correct splicing of ultra-long exons through cryptic splice site repression
Project description:An anciently diverged family of RNA binding proteins maintain correct splicing of ultra-long exons through cryptic splice site repression [RNA-seq]
Project description:We previously showed that the germ cell specific nuclear protein RBMXL2 represses cryptic splicing patterns during meiosis and is required for male fertility. RBMXL2 evolved from the X-linked RBMX gene, which is silenced during meiosis due to sex chromosome inactivation. It has been unknown whether RBMXL2 provides a direct replacement for RBMX in meiosis, or whether RBMXL2 evolved to deal with the transcriptionally permissive environment of meiosis. Here we find that RBMX primarily operates as a splicing repressor in somatic cells, and specifically regulates a distinct class of exons that exceed the median human exon size. RBMX protein-RNA interactions are enriched within ultra-long exons, particularly within genes involved in genome stability, and repress the selection of cryptic splice sites that would compromise gene function. These similarities in overall function suggested that RBMXL2 might replace the function of RBMX during meiosis. To test this prediction we carried out inducible expression of RBMXL2 and the more distantly related RBMY protein in somatic cells, finding each could rescue aberrant patterns of RNA processing in response to RBMX depletion. The C-terminal disordered domain of RBMXL2 is sufficient to rescue proper splicing control after RBMX depletion. Our data indicate that RBMX and RBMXL2 have parallel roles in somatic tissues and the germline that must have have been conserved over at least 200 million years of mammalian evolution. We propose RBMX family proteins are particularly important for the splicing inclusion of ultra-long exons because these would be particularly susceptible to disruption by cryptic splice site selection.
Project description:Previously, we showed that the germ cell-specific nuclear protein RBMXL2 represses cryptic splicing patterns during meiosis and is required for male fertility (Ehrmann et al., 2019). Here, we show that in somatic cells the similar yet ubiquitously expressed RBMX protein has similar functions. RBMX regulates a distinct class of exons that exceed the median human exon size. RBMX protein-RNA interactions are enriched within ultra-long exons, particularly within genes involved in genome stability, and repress the selection of cryptic splice sites that would compromise gene function. The RBMX gene is silenced during male meiosis due to sex chromosome inactivation. To test whether RBMXL2 might replace the function of RBMX during meiosis we induced expression of RBMXL2 and the more distantly related RBMY protein in somatic cells, finding each could rescue aberrant patterns of RNA processing caused by RBMX depletion. The C-terminal disordered domain of RBMXL2 is sufficient to rescue proper splicing control after RBMX depletion. Our data indicate that RBMX and RBMXL2 have parallel roles in somatic tissues and the germline that must have been conserved for at least 200 million years of mammalian evolution. We propose RBMX family proteins are particularly important for the splicing inclusion of some ultra-long exons with increased intrinsic susceptibility to cryptic splice site selection.
Project description:Eukaryotic cells express a large number of transcripts from a single gene due to alternative splicing. Despite hundreds of thousands of splice isoforms being annotated in databases, it has been reported that the current exon catalogs remain incomplete. At the same time, introns of human protein-coding genes contain a large number of evolutionarily conserved elements with unknown function. Here, we explore the possibility that some of them represent cryptic exons that are expressed in rare conditions. We identified a group of cryptic exons that are similar to the annotated exons in terms of evolutionary conservation and RNA-seq read coverage in the GTEx dataset. Most of them were poison, i.e. generated an NMD isoform upon inclusion, and many showed signs of tissue-specific and cancer-specific expression and regulation. We performed RNA-seq in A549 cell line treated with cycloheximide to inactivate NMD, and confirmed using qPCR that seven of eight exons tested are, indeed, expressed. This study shows that introns of human protein-coding genes contain cryptic poison exons, which reside in conserved intronic regions and remain not fully annotated due to insufficient representation in RNA-seq libraries.
Project description:Transactive response DNA binding protein 43 kDa (TDP-43) is a protein that regulates splicing, the loss of which underlies the pathophysiology of a variety of neurodegenerative disorders, including amyotrophic lateral sclerosis and frontotemporal dementia (ALS-FTD). In disease states, TDP-43 is cleared from the nucleus of neurons, leading to the loss of its splicing repressor function and the aberrant inclusion of cryptic exons. Frequent inclusion of premature termination codons (PTCs) by cryptic exons leads to degradation of cryptic mRNA by nonsense-mediated mRNA decay (NMD). However, detection of cryptic exons relies on inefficient NMD; therefore cryptic exons efficiently targeted and eliminated by NMD remain undetectable through conventional RNA-sequencing. Here we generated a comprehensive set of neuronal targets of TDP-43 in human IPSC-derived i3Neurons (i3N) by combining TDP-43 knockdown with inhibition of several factors regulating NMD, and identified various cryptic exons that were either underestimated or entirely undetected by TDP-43 knockdown alone.
Project description:PurposeWe aim to report noncoding pathogenic variants in patients with FRMD7-related infantile nystagmus (FIN).MethodsGenome sequencing (n = 2 families) and reanalysis of targeted panel next generation sequencing (n = 2 families) was performed in genetically unsolved cases of suspected FIN. Previous sequence analysis showed no pathogenic coding variants in genes associated with infantile nystagmus. SpliceAI, SpliceRover, and Alamut consensus programs were used to annotate noncoding variants. Minigene splicing assay was performed to confirm aberrant splicing. In silico analysis of exonic splicing enhancer and silencer was also performed.ResultsFRMD7 intronic variants were identified based on genome sequencing and targeted next-generation sequencing analysis. These included c.285-12A>G (pedigree 1), c.284+63T>A (pedigrees 2 and 3), and c. 383-1368A>G (pedigree 4). All variants were absent in gnomAD, and the both c.285-12A>G and c.284+63T>A variants were predicted to enhance new splicing acceptor gains with SpliceAI, SpliceRover, and Alamut consensus approaches. However, the c.383-1368 A>G variant only had a significant impact score on the SpliceRover program. The c.383-1368A>G variant was predicted to promote pseudoexon inclusion by binding of exonic splicing enhancer. Aberrant exonizations were validated through minigene constructs, and all variants were segregated in the families.ConclusionsDeep learning-based annotation of noncoding variants facilitates the discovery of hidden genetic variations in patients with FIN. This study provides evidence of effectiveness of combined deep learning-based splicing tools to identify hidden pathogenic variants in previously unsolved patients with infantile nystagmus.Translational relevanceThese results demonstrate robust analysis using two deep learning splicing predictions and in vitro functional study can lead to finding hidden genetic variations in unsolved patients.