Project description:Although large exons cannot be readily recognized by the spliceosome, many are evolutionarily conserved and constitutively spliced for inclusion in the processed transcript. Furthermore, whether large exons may be enriched in a certain subset of proteins, or mediate specific functions, has remained unclear. Here, we identify a set of nearly 3,000 SRSF3-dependent large constitutive exons (S3-LCEs) in human and mouse cells. These exons are enriched for cytidine-rich sequence motifs, which bind and recruit the splicing factors hnRNP K and SRSF3. We find that hnRNP K suppresses S3-LCE splicing, an effect that is mitigated by SRSF3 to thus achieve constitutive splicing of S3-LCEs. S3-LCEs are enriched in genes for components of transcription machineries, including mediator and BAF complexes, and frequently contain intrinsically disordered regions (IDRs). In a subset of analyzed S3-LCE-containing transcription factors, SRSF3 depletion leads to deletion of the IDRs due to S3-LCE exon skipping, thereby disrupting phase-separated assemblies of these factors. Cytidine enrichment in large exons introduces proline/serine codon bias in intrinsically disordered regions and appears to have been evolutionarily acquired in vertebrates. We propose that layered splicing regulation by hnRNP K and SRSF3 ensures proper phase-separation of these S3-LCE-containing transcription factors in vertebrates.
Project description:The tightly regulated production of distinct erythrocyte protein 4.1R isoforms involves differential splicing of 3 mutually exclusive first exons (1A, 1B, 1C) to the alternative 3' splice sites (ss) of exon 2'/2. Here, we demonstrate that exon 1 and 2'/2 splicing diversity is regulated by a transcription-coupled splicing mechanism. We also implicate distinctive regulatory elements that promote the splicing of exon 1A to the distal 3' ss and exon 1B to the proximal 3' ss in murine erythroleukemia cells. A hybrid minigene driven by cytomegalovirus promoter mimicked 1B-promoter-driven splicing patterns but differed from 1A-promoter-driven splicing patterns, suggesting that promoter identity affects exon 2'/2 splicing. Furthermore, splicing factor SF2/ASF ultraviolet (UV) cross-linked to the exon 2'/2 junction CAGAGAA, a sequence that overlaps the distal U2AF(35)-binding 3' ss. Consequently, depletion of SF2/ASF allowed exon 1B to splice to the distal 3' ss but had no effect on exon 1A splicing. These findings identify for the first time that an SF2/ASF binding site also can serve as a 3' ss in a transcript-dependent manner. Taken together, our results suggest that 4.1R gene expression involves transcriptional regulation coupled with a complex splicing regulatory network.
Project description:Tight regulation of gene transcription and mRNA splicing is essential for plant growth and development. Here we demonstrate that a plant-specific protein, EMBRYO DEFECTIVE 1579 (EMB1579), controls multiple growth and developmental processes in Arabidopsis. We demonstrate that EMB1579 forms liquid-like condensates both in vitro and in vivo, and the formation of normal-sized EMB1579 condensates is crucial for its cellular functions. We found that some chromosomal and RNA-related proteins interact with EMB1579 compartments, and loss of function of EMB1579 affects global gene transcription and mRNA splicing. Using floral transition as a physiological process, we demonstrate that EMB1579 is involved in FLOWERING LOCUS C (FLC)-mediated repression of flowering. Interestingly, we found that EMB1579 physically interacts with a homologue of Drosophila nucleosome remodeling factor 55-kDa (p55) called MULTIPLE SUPPRESSOR OF IRA 4 (MSI4), which has been implicated in repressing the expression of FLC by forming a complex with DNA Damage Binding Protein 1 (DDB1) and Cullin 4 (CUL4). This complex, named CUL4-DDB1MSI4, physically associates with a CURLY LEAF (CLF)-containing Polycomb Repressive Complex 2 (CLF-PRC2). We further demonstrate that EMB1579 interacts with CUL4 and DDB1, and EMB1579 condensates can recruit and condense MSI4 and DDB1. Furthermore, emb1579 phenocopies msi4 in terms of the level of H3K27 trimethylation on FLC. This allows us to propose that EMB1579 condensates recruit and condense CUL4-DDB1MSI4 complex, which facilitates the interaction of CUL4-DDB1MSI4 with CLF-PRC2 and promotes the role of CLF-PRC2 in establishing and/or maintaining the level of H3K27 trimethylation on FLC. Thus, we report a new mechanism for regulating plant gene transcription, mRNA splicing, and growth and development.
Project description:One challenge faced by scientists from the alternative RNA splicing field is to decode the cooperative or antagonistic effects of splicing factors (SFs) to understand and eventually predict splicing outcomes on a genome-wide scale. In this manuscript, we introduce SplicingLore, an open-access database and web resource that help to fill this gap in a straightforward manner. The database contains a collection of RNA-sequencing-derived lists of alternative exons regulated by a total of 75 different SFs. All datasets were processed in a standardized manner, ensuring valid comparisons and correlation analyses. The user can easily retrieve a factor-specific set of differentially included exons from the database or provide a list of exons and search which SF(s) control(s) their inclusion. Our simple workflow is fast and easy to run, and it ensures a reliable calculation of correlation scores between the tested datasets. As a proof of concept, we predicted and experimentally validated a novel functional cooperation between the RNA helicases DDX17 and DDX5 and the heterogeneous nuclear ribonucleoprotein C (HNRNPC) protein. SplicingLore is available at https://splicinglore.ens-lyon.fr/. Database URL: https://splicinglore.ens-lyon.fr/.
Project description:Transcription and splicing are intrinsically coupled. Alternative splicing of internal exons can fine-tune gene expression through a recently described phenomenon called exon-mediated activation of transcription starts (EMATS). However, the association of this phenomenon with human diseases remains unknown. Here, we develop a strategy to activate gene expression through EMATS and demonstrate its potential for treatment of genetic diseases caused by loss of expression of essential genes. We first identified a catalog of human EMATS genes and provide a list of their pathological variants. To test if EMATS can be used to activate gene expression, we constructed stable cell lines expressing a splicing reporter based on the alternative splicing of motor neuron 2 (SMN2) gene. Using small molecules and antisense oligonucleotides (ASOs) currently used for treatment of spinal muscular atrophy, we demonstrated that increase of inclusion of alternative exons can trigger an activation of gene expression up to 45-fold by enhancing transcription in EMATS-like genes. We observed the strongest effects in genes under the regulation of weak human promoters located proximal to highly included skipped exons.
Project description:Components of the transcription machinery can undergo liquid-liquid phase separation, but the functional importance of phase-separated condensates in transcriptional control is not well understood. Here we report that disease-causing mutations in several transcription factors (TFs) alter the phase separation capacity of those TFs. We first demonstrate that the Hoxd13 TF, and its intrinsically disordered N-terminus form phase-separated condensates. Expansions of a polyalanine repeat, which cause hereditary synpolydactyly in humans, facilitate phase separation of Hoxd13, and alter the transcriptional program of several cell types in a cell-specific manner in vivo. Disease-associated expansions of aminoacid repeats in intrinsically disordered regions of other TFs were similarly found to alter phase separation. These results suggest that aberrant phase separation of transcriptional regulators may underlie a spectrum of human pathologies. The paper is available at https://doi.org/10.1016/j.cell.2020.04.018
Project description:BackgroundAlternatively spliced exons play an important role in the diversification of gene function in most metazoans and are highly regulated by conserved motifs in exons and introns. Two contradicting properties have been associated to evolutionary conserved alternative exons: higher sequence conservation and higher rate of non-synonymous substitutions, relative to constitutive exons. In order to clarify this issue, we have performed an analysis of the evolution of alternative and constitutive exons, using a large set of protein coding exons conserved between human and mouse and taking into account the conservation of the transcript exonic structure. Further, we have also defined a measure of the variation of the arrangement of exonic splicing enhancers (ESE-conservation score) to study the evolution of splicing regulatory sequences. We have used this measure to correlate the changes in the arrangement of ESEs with the divergence of exon and intron sequences.ResultsWe find evidence for a relation between the lack of conservation of the exonic structure and the weakening of the sequence evolutionary constraints in alternative and constitutive exons. Exons in transcripts with non-conserved exonic structures have higher synonymous (dS) and non-synonymous (dN) substitution rates than exons in conserved structures. Moreover, alternative exons in transcripts with non-conserved exonic structure are the least constrained in sequence evolution, and at high EST-inclusion levels they are found to be very similar to constitutive exons, whereas alternative exons in transcripts with conserved exonic structure have a dS significantly lower than average at all EST-inclusion levels. We also find higher conservation in the arrangement of ESEs in constitutive exons compared to alternative ones. Additionally, the sequence conservation at flanking introns remains constant for constitutive exons at all ESE-conservation values, but increases for alternative exons at high ESE-conservation values.ConclusionWe conclude that most of the differences in dN observed between alternative and constitutive exons can be explained by the conservation of the transcript exonic structure. Low dS values are more characteristic of alternative exons with conserved exonic structure, but not of those with non-conserved exonic structure. Additionally, constitutive exons are characterized by a higher conservation in the arrangement of ESEs, and alternative exons with an ESE-conservation similar to that of constitutive exons are characterized by a conservation of the flanking intron sequences higher than average, indicating the presence of more intronic regulatory signals.