Project description:Genome-wide association studies have linked common variation in ZNF804A with an increased risk of schizophrenia. However, little is known about the biology of ZNF804A and its role in schizophrenia. Here, we investigate the function of ZNF804A using a variety of complementary molecular techniques. We show that ZNF804A is a nuclear protein that interacts with neuronal RNA splicing factors and RNA-binding proteins including RBFOX1, which is also associated with schizophrenia, CELF3/4, components of the ubiquitin-proteasome system and the ZNF804A paralog, GPATCH8. GPATCH8 also interacts with splicing factors and is localized to nuclear speckles indicative of a role in pre-messenger RNA (mRNA) processing. Sequence analysis showed that GPATCH8 contains ultraconserved, alternatively spliced poison exons that are also regulated by RBFOX proteins. ZNF804A knockdown in SH-SY5Y cells resulted in robust changes in gene expression and pre-mRNA splicing converging on pathways associated with nervous system development, synaptic contact, and cell adhesion. We observed enrichment (P = 1.66 × 10-9) for differentially spliced genes in ZNF804A-depleted cells among genes that contain RBFOX-dependent alternatively spliced exons. Differentially spliced genes in ZNF804A-depleted cells were also enriched for genes harboring de novo loss of function mutations in autism spectrum disorder (P = 6.25 × 10-7, enrichment 2.16) and common variant alleles associated with schizophrenia (P = .014), bipolar disorder and schizophrenia (P = .003), and autism spectrum disorder (P = .005). These data suggest that ZNF804A and its paralogs may interact with neuronal-splicing factors and RNA-binding proteins to regulate the expression of a subset of synaptic and neurodevelopmental genes.
Project description:DDX5 and DDX17 are DEAD-box RNA helicase paralogs which regulate several aspects of gene expression, especially transcription and splicing, through incompletely understood mechanisms. A transcriptome analysis of DDX5/DDX17-depleted human cells confirmed the large impact of these RNA helicases on splicing and revealed a widespread deregulation of 3' end processing. In silico analyses and experiments in cultured cells showed the binding and functional contribution of the genome organizing factor CTCF to chromatin sites at or near a subset of DDX5/DDX17-dependent exons that are characterized by a high GC content and a high density of RNA Polymerase II. We propose the existence of an RNA helicase-dependent relationship between CTCF and the dynamics of transcription across DNA and/or RNA structured regions, that contributes to the processing of internal and terminal exons. Moreover, local DDX5/DDX17-dependent chromatin loops spatially connect RNA helicase-regulated exons with their cognate promoter, and we provide the first direct evidence that de novo gene looping modifies alternative splicing and polyadenylation. Overall our findings uncover the impact of DDX5/DDX17-dependent chromatin folding on pre-messenger RNA processing.
Project description:Messenger RNA (mRNA) processing plays important roles in gene expression in all domains of life. A number of cases of mRNA cleavage have been documented in Archaea, but available data are fragmentary. We have examined RNAs present in Methanocaldococcus (Methanococcus) jannaschii for evidence of RNA processing upstream of protein-coding genes. Of 123 regions covered by the data, 31 were found to be processed, with 30 including a cleavage site 12-16 nucleotides upstream of the corresponding translation start site. Analyses with 3'-RACE (rapid amplification of cDNA ends) and 5'-RACE indicate that the processing is endonucleolytic. Analyses of the sequences surrounding the processing sites for functional sites, sequence motifs, or potential RNA secondary structure elements did not reveal any recurring features except for an AUG translation start codon and (in most cases) a ribosome binding site. These properties differ from those of all previously described mRNA processing systems. Our data suggest that the processing alters the representation of various genes in the RNA pool and therefore, may play a significant role in defining the balance of proteins in the cell.
Project description:Retinitis pigmentosa (RP) is the most common inherited retinal disease characterized by progressive degeneration of photoreceptors and/or retinal pigment epithelium that eventually results in blindness. Mutations in pre-mRNA processing factors (PRPF3, 4, 6, 8, 31, SNRNP200, and RP9) have been linked to 15-20% of autosomal dominant RP (adRP) cases. Current evidence indicates that PRPF mutations cause retinal specific global spliceosome dysregulation, leading to mis-splicing of numerous genes that are involved in a variety of retina-specific functions and/or general biological processes, including phototransduction, retinol metabolism, photoreceptor disk morphogenesis, retinal cell polarity, ciliogenesis, cytoskeleton and tight junction organization, waste disposal, inflammation, and apoptosis. Importantly, additional PRPF functions beyond RNA splicing have been documented recently, suggesting a more complex mechanism underlying PRPF-RPs driven disease pathogenesis. The current review focuses on the key RP-PRPF genes, depicting the current understanding of their roles in RNA splicing, impact of their mutations on retinal cell's transcriptome and phenome, discussed in the context of model species including yeast, zebrafish, and mice. Importantly, information on PRPF functions beyond RNA splicing are discussed, aiming at a holistic investigation of PRPF-RP pathogenesis. Finally, work performed in human patient-specific lab models and developing gene and cell-based replacement therapies for the treatment of PRPF-RPs are thoroughly discussed to allow the reader to get a deeper understanding of the disease mechanisms, which we believe will facilitate the establishment of novel and better therapeutic strategies for PRPF-RP patients.
Project description:Adult-onset autosomal dominant leukodystrophy (ADLD) is a slowly progressive neurological disorder characterized by autonomic dysfunction, followed by cerebellar and pyramidal features. ADLD is caused by duplication of the lamin B1 gene (LMNB1), which leads to its increased expression. The molecular pathways involved in the disease are still poorly understood. Hence, we analyzed global gene expression in fibroblasts and whole blood of LMNB1 duplication carriers and used Gene Set Enrichment Analysis to explore their gene signatures. We found that LMNB1 duplication is associated with dysregulation of genes involved in the immune system, neuronal and skeletal development. Genes with an altered transcriptional profile clustered in specific genomic regions. Among the dysregulated genes, we further studied the role of RAVER2, which we found to be overexpressed at mRNA and protein level. RAVER2 encodes a putative trans regulator of the splicing repressor polypyrimidine tract binding protein (PTB) and is likely implicated in alternative splicing regulation. Functional studies demonstrated an abnormal splicing pattern of several PTB-target genes and of the myelin protein gene PLP1, previously demonstrated to be involved in ADLD. Mutant mice with different lamin B1 expression levels confirmed that Raver2 expression is dependent on lamin B1 in neural tissue and determines an altered splicing pattern of PTB-target genes and Plp1. Overall our results demonstrate that deregulation of lamin B1 expression induces modified splicing of several genes, likely driven by raver-2 overexpression, and suggest that an alteration of mRNA processing could be a pathogenic mechanism in ADLD.
Project description:Splicing and nuclear export are vital components of eukaryotic gene expression. Defects in splicing due to cis mutations are known to cause a number of human diseases. Here we present a dual reporter system that can be used to look at splicing or export deficiencies resulting from an insufficiency in components of the cotranscriptional machinery. The constructs use a bidirectional promoter to coexpress a test reporter and a control reporter. In the splicing construct, maximal expression of the test reporter is dependent on efficient splicing and splicing-related nuclear export, whereas the control reporter is an intronless complementary DNA expression cassette. The dual reporters allow a robust ratiometric output that is independent of cell number or transfection efficiency. Therefore, our construct is internally controlled and amenable to high-throughput analysis. As a counterscreen, we have a nonsplicing control construct in which neither reporter bears an intron. We demonstrate the sensitivity of our construct to defects in nuclear export by depleting UAP56 and NXF1, essential components of the cotranscriptional machinery.
Project description:A major pathway of eukaryotic messenger RNA (mRNA) turnover begins with deadenylation, followed by decapping and 5' to 3' exonucleolytic decay. We provide evidence that mRNA decapping and 5' to 3' degradation occur in discrete cytoplasmic foci in yeast, which we call processing bodies (P bodies). First, proteins that activate or catalyze decapping are concentrated in P bodies. Second, inhibiting mRNA turnover before decapping leads to loss of P bodies; however, inhibiting turnover at, or after, decapping, increases the abundance and size of P bodies. Finally, mRNA degradation intermediates are localized to P bodies. These results define the flux of mRNAs between polysomes and P bodies as a critical aspect of cytoplasmic mRNA metabolism and a possible site for regulation of mRNA degradation.
Project description:Sickle cell disease results from a point mutation in exon 1 of the β-globin gene (total 3 exons). Replacing sickle β-globin exon 1 (and exon 2) with a normal sequence by trans-splicing is a potential therapeutic strategy. Therefore, this study sought to develop trans-splicing targeting β-globin pre-messenger RNA among human erythroid cells. Binding domains from random β-globin sequences were comprehensively screened. Six candidates had optimal binding, and all targeted intron 2. Next, lentiviral vectors encoding RNA trans-splicing molecules were constructed incorporating a unique binding domain from these candidates, artificial 5' splice site, and γ-globin cDNA, and trans-splicing was evaluated in CD34+ cell-derived erythroid cells from healthy individuals. Lentiviral transduction was efficient, with vector copy numbers of 9.7 to 15.3. The intended trans-spliced RNA product, including exon 3 of endogenous β-globin and γ-globin, was detected at the molecular level. Trans-splicing efficiency was improved to 0.07-0.09% by longer binding domains, including the 5' splice site of intron 2. In summary, screening was performed to select efficient binding domains for trans-splicing. Detectable levels of trans-splicing were obtained for endogenous β-globin RNA in human erythroid cells. These methods provide the basis for future trans-splicing directed gene therapy.
Project description:BackgroundAnnotation of eukaryotic genomes is a complex endeavor that requires the integration of evidence from multiple, often contradictory, sources. With the ever-increasing amount of genome sequence data now available, methods for accurate identification of large numbers of genes have become urgently needed. In an effort to create a set of very high-quality gene models, we used the sequence of 5,000 full-length gene transcripts from Arabidopsis to re-annotate its genome. We have mapped these transcripts to their exact chromosomal locations and, using alignment programs, have created gene models that provide a reference set for this organism.ResultsApproximately 35% of the transcripts indicated that previously annotated genes needed modification, and 5% of the transcripts represented newly discovered genes. We also discovered that multiple transcription initiation sites appear to be much more common than previously known, and we report numerous cases of alternative mRNA splicing. We include a comparison of different alignment software and an analysis of how the transcript data improved the previously published annotation.ConclusionsOur results demonstrate that sequencing of large numbers of full-length transcripts followed by computational mapping greatly improves identification of the complete exon structures of eukaryotic genes. In addition, we are able to find numerous introns in the untranslated regions of the genes.