Project description:The human transcriptome consists of various RNA biotypes including multiple types of non-coding RNAs (ncRNAs). Current ncRNA compendia remain incomplete partially because they are almost exclusively derived from the interrogation of small- and polyadenylated RNAs. Here, we present a more comprehensive atlas of the human transcriptome that is derived from matching polyA-, total-, and small-RNA profiles of a heterogenous collection of nearly 300 human tissues and cell lines. We report thousands of novel RNA species across all major RNA biotypes, including a hitherto poorly-cataloged class of non-polyadenylated single-exon long non-coding RNAs. In addition, we exploit intron abundance estimates from total RNA-sequencing to predict the regulatory potential of various non-coding RNAs. Our study represents a substantial expansion of the current catalogue of human ncRNAs and their regulatory interactions. All data and results are accessible through the R2 webtool and serve as a basis to further explore RNA biology and function.
Project description:Non-coding RNAs play an important role in the pathogenesis of human malignancies. So far, microRNAs have been investigated in detail in clear cell renal cell carcinoma, but the knowledge about other small non-coding RNAs like snoRNA, tRNA and piRNA remains small. There is increasing evidence that these non-coding RNAs are also involved in regulation of gene expression, and we therefore performed small RNA sequencing in a cohort of 18 corresponding normal and malignant tissue samples from patients with clear cell renal cell carcinoma. We observed differential expression of microRNAs, but also some dysregulated tRNA and snoRNA in clear cell renal cell carcinoma tissue
Project description:Small RNAs, including microRNAs (miRNAs), phased secondary small interfering RNAs (phasiRNA), and heterochromatic small interfering RNAs (hc-siRNA) are an essential component of gene regulation. To establish a broad potato small RNA atlas, we constructed an expression atlas of leaves, flowers, roots, and tubers of Desiree and Eva, which are commercially important potato (Solanum tuberosum) cultivars. All small RNAs identified were observed to be conserved between both cultivars, supporting the hypothesis that small RNAs have a low evolutionary rate and are mostly conserved between lineages. However, we also found that a few miRNAs showed differential accumulation between the two potato cultivars, and that hc-siRNAs have a tissue specific expression. We further identified dozens of reproductive and non-reproductive phasiRNAs originating from coding and noncoding regions that appeared to exhibit tissue-specific expression. Together, this study provides an extensive small RNA profiling of different potato tissues that might be used as a resource for future investigations.
Project description:small non-coding RNA-seq was performed to study the transcriptomic changes in human lung tissue post infection with either Influenza A virus (IAV), or Pseudomonas aeruginosa (PA) or Mycobacterium bovis (BCG)
Project description:Long non-coding RNAs (lncRNAs) are defined as non-protein-coding transcripts that are at least 200 nucleotides long. They are known to play pivotal roles in regulating gene expression, especially during stress responses in plants. We used a large collection of in-house transcriptome data from various soybean (Glycine max and Glycine soja) tissues treated under different conditions to perform a comprehensive identification of soybean lncRNAs. We also retrieved publicly available soybean transcriptome data that were of sufficient quality and sequencing depth to enrich our analysis. In total, RNA-seq data of 332 samples were used for this analysis. An integrated reference-based, de novo transcript assembly was developed that identified ~69,000 lncRNA gene loci. We showed that lncRNAs are distinct from both protein-coding transcripts and genomic background noise in terms of length, number of exons, transposable element composition, and sequence conservation level across legume species. The tissue-specific and time-specific transcriptional responses of the lncRNA genes under some stress conditions may suggest their biological relevance. The transcription start sites of lncRNA gene loci tend to be close to their nearest protein-coding genes, and they may be transcriptionally related to the protein-coding genes, particularly for antisense and intronic lncRNAs. A previously unreported subset of small peptide-coding transcripts was identified from these lncRNA loci via tandem mass spectrometry, which paved the way for investigating their functional roles. Our results also highlight the current inadequacy of the bioinformatic definition of lncRNA, which excludes those lncRNA gene loci with small open reading frames (ORFs) from being regarded as protein-coding.