Project description:Compared to other model organisms and despite the clinical relevance of the pathogenic yeast Candida albicans, no comprehensive analysis has been done to provide experimental support of its in silico-based genome annotation. Here we have undertaken a genome-wide experimental annotation to accurately uncover the transcriptional landscape of the pathogenic yeast C. albicans using strand-specific high-density tiling arrays. RNAs were purified from cells growing under conditions relevant to C. albicans pathogenicity, including biofilm, lab-grown yeast and serum-induced hyphae as well as cells isolated from the mouse caecum. This work provides a genome-wide experimental validation for a large number of predicted ORFs for which transcription had not been detected by other approaches. Additionally, we identified more than 2000 novel transcriptional segments, including new ORFs and exons, non-coding RNAs (ncRNA) as well as convincing cases of antisense gene transcription. We also characterized the 5’- and 3’-untranslated regions (UTR) of expressed ORFs, and established that genes with long 5’UTRs are significantly enriched in regulatory functions controlling filamentous growth. Furthermore, we found that genomic regions adjacent to telomeres harbor a cluster of expressed ncRNAs. To validate and confirm new ncRNA candidates, we adapted an iterative strategy combining both genome-wide occupancy of the different subunits of RNA polymerases I, II and III, and expression data. This comprehensive approach allowed the identification of different families of ncRNA. In summary, we provide a comprehensive expression atlas that covers relevant C. albicans pathogenic developmental stages in addition to a discovery of new ORF and non-coding genetic elements.
Project description:Compared to other model organisms and despite the clinical relevance of the pathogenic yeast Candida albicans, no comprehensive analysis has been done to provide experimental support of its in silico-based genome annotation. Here we have undertaken a genome-wide experimental annotation to accurately uncover the transcriptional landscape of the pathogenic yeast C. albicans using strand-specific high-density tiling arrays. RNAs were purified from cells growing under conditions relevant to C. albicans pathogenicity, including biofilm, lab-grown yeast and serum-induced hyphae as well as cells isolated from the mouse caecum. This work provides a genome-wide experimental validation for a large number of predicted ORFs for which transcription had not been detected by other approaches. Additionally, we identified more than 2000 novel transcriptional segments, including new ORFs and exons, non-coding RNAs (ncRNA) as well as convincing cases of antisense gene transcription. We also characterized the 5’- and 3’-untranslated regions (UTR) of expressed ORFs, and established that genes with long 5’UTRs are significantly enriched in regulatory functions controlling filamentous growth. Furthermore, we found that genomic regions adjacent to telomeres harbor a cluster of expressed ncRNAs. To validate and confirm new ncRNA candidates, we adapted an iterative strategy combining both genome-wide occupancy of the different subunits of RNA polymerases I, II and III, and expression data. This comprehensive approach allowed the identification of different families of ncRNA. In summary, we provide a comprehensive expression atlas that covers relevant C. albicans pathogenic developmental stages in addition to a discovery of new ORF and non-coding genetic elements. We have undertaken a genome-wide experimental annotation to accurately uncover the transcriptional landscape of the pathogenic yeast C. albicans using strand-specific high-density tiling arrays. RNAs were purified from cells growing under conditions relevant to Candida albicans pathogenicity, including biofilm, lab-grown yeast and serum-induced hyphae as well as cells isolated from the mouse caecum. We also adapted a strategy in which genome-wide occupancy of different subunits of RNA polymerases (RNAP) I, II and III, is combined with expression data to annotate ncRNAs resulting from real transcriptional events. For this purpose we have performed ChIP-chip of subunits that represent the three RNAP machines in C. albicans cells growing in rich media (YPD) at 30°C. In this study, we performed peak detection only for RNA Polymerase III (Rpc82p). All detected peaks and their genomic features are included as a supplementary file on the Sample record (GSM561024).
Project description:Experiments conducted on this tiling array are used to (1) validate the frozen gene sets of the current genome annotation, (2) improve the predicted gene structures by empirically determining UTRs and intron-exon boundaries, identifying missing upstream, internal, and downstream exons and alternative transcripts, (3) propose gene structure models in transcribed regions containing no predicted genes and (4) delineate transcriptionally active regions of the genome from intergenic, intronic and genic regions. Signal to background ratios were determined by first calling probes that fluoresced at intensities greater than 99% of the random probes’ signal intensities; therefore only 1% of fluorescing experimental probes should be false positives.
Project description:Experiments conducted on this tiling array are used to (1) validate the frozen gene sets of the current genome annotation, (2) improve the predicted gene structures by empirically determining UTRs and intron-exon boundaries, identifying missing upstream, internal, and downstream exons and alternative transcripts, (3) propose gene structure models in transcribed regions containing no predicted genes and (4) delineate transcriptionally active regions of the genome from intergenic, intronic and genic regions. Signal to background ratios were determined by first calling probes that fluoresced at intensities greater than 99% of the random probes’ signal intensities; therefore only 1% of fluorescing experimental probes should be false positives.
Project description:Experiments conducted on this tiling array are used to (1) validate the frozen gene sets of the current genome annotation, (2) improve the predicted gene structures by empirically determining UTRs and intron-exon boundaries, identifying missing upstream, internal, and downstream exons and alternative transcripts, (3) propose gene structure models in transcribed regions containing no predicted genes and (4) delineate transcriptionally active regions of the genome from intergenic, intronic and genic regions. Signal to background ratios were determined by first calling probes that fluoresced at intensities greater than 99% of the random probes’ signal intensities; therefore, only 1% of fluorescing experimental probes should be false positives.
Project description:The sea urchin S. purpuratus is a model organism for study of the genomic control circuitry underlying embryonic development. We examined the complete repertoire of genes expressed in the S. purpuratus embryo, up to late gastrula stage, by means of high-resolution custom tiling arrays covering the whole genome. We detected complete spliced structures even for genes known to be expressed at low leveles in only a few cells. At least 11,000 to 12,000 genes are used in embryogenesis. These include most of the genes encoding transcription factors and signaling proteins, as well as some classes of general cytoskeletal and metabolic proteins, but only a minor fraction of genes encoding immune functions and sensory receptors. Thousands of small asymmetric transcripts of unknown function were also detected in intergenic regions throughout the genome. The tiling array data were used to correct and authenticate several thousand gene models during the genome annotation process. Keywords: high-resolution tiling array, sea urchin, embryo Keywords: other
Project description:Epigenetic modifications of chromatin serve an important role in regulating the expression and accessibility of genomic DNA. We report here a genomewide approach for fractionating yeast chromatin into two functionally distinct parts, one containing RNA polymerase II transcribed sequences, and the other comprising noncoding sequences and genes transcribed by RNA polymerases I and III. Noncoding regions could be further fractionated into promoters and segments lacking promoters. The observed separations were apparently based on differential crosslinking efficiency of chromatin in different genomic regions. The results reveal a genomewide molecular mechanism for marking promoters and genomic regions that have a license to be transcribed by RNA polymerase II, a previously unrecognized level of genomic complexity that may exist in all eukaryotes. Our approach has broad potential use as a tool for genome annotation and for the characterization of global changes in chromatin structure that accompany different genetic, environmental, and disease states. Keywords: Genomewide demarcation of RNA PolII transcription units by physical fractionation of chromatin- Intergenic enrichment
Project description:Epigenetic modifications of chromatin serve an important role in regulating the expression and accessibility of genomic DNA. We report here a genomewide approach for fractionating yeast chromatin into two functionally distinct parts, one containing RNA polymerase II transcribed sequences, and the other comprising noncoding sequences and genes transcribed by RNA polymerases I and III. Noncoding regions could be further fractionated into promoters and segments lacking promoters. The observed separations were apparently based on differential crosslinking efficiency of chromatin in different genomic regions. The results reveal a genomewide molecular mechanism for marking promoters and genomic regions that have a license to be transcribed by RNA polymerase II, a previously unrecognized level of genomic complexity that may exist in all eukaryotes. Our approach has broad potential use as a tool for genome annotation and for the characterization of global changes in chromatin structure that accompany different genetic, environmental, and disease states. Keywords: Genomewide mapping of regulatory elements through differential fractionation of crosslinked chromatin based on nucleosome occupancy.