Genomics

Dataset Information

0

Direct full-length RNA sequencing reveals unexpected transcriptome complexity during C. elegans development


ABSTRACT: High throughput RNA sequencing (RNA-seq) using cDNA has played a key role in delineating transcriptome complexity, including alternative transcription initiation, splicing, polyadenylation and base modification. However, the reads derived from current RNA-seq technologies are usually short and deprived of information on modification during reverse transcription, compromising their potential in defining transcriptome complexity. Here we applied a direct RNA sequencing method with ultra-long reads from Oxford Nanopore Technologies (ONT) to study the transcriptome complexity in C. elegans. We sequenced native poly-A tailed mRNAs by generating approximately six million reads from embryos, L1 larvae and young adult animals, with average read lengths ranging from 900 to 1,100 bps across stages. Around half of the reads represent full-length transcripts, judged by the presence of a splicing-leader or their full coverage of an existing transcript. To take advantage of the full-length transcripts in defining transcriptome complexity, we devised a novel algorithm to predict novel isoforms or group them with exiting isoforms using their mapping tracks rather than the existing intron/exon structures, which allowed us to identify roughly 57,000 novel isoforms and recover at least 26,000 out of the 33,500 existing isoforms. Intriguingly, stage-specific expression at the level of gene and isoform demonstrates little correlation. Finally, we observed an elevated level of modification in all bases in the coding region relative to the UTR. Taken together, the ONT long reads are expected to deliver new insights into RNA processing and modification and their underlying biology.

ORGANISM(S): Caenorhabditis elegans

PROVIDER: GSE130044 | GEO | 2019/09/10

REPOSITORIES: GEO

Similar Datasets

2023-09-18 | GSE212569 | GEO
2023-09-18 | GSE212570 | GEO
2023-09-18 | GSE212573 | GEO
2023-09-18 | GSE212572 | GEO
2023-09-18 | GSE212571 | GEO
2020-08-02 | GSE144828 | GEO
2020-03-18 | GSE147118 | GEO
2019-12-15 | GSE141641 | GEO
2019-06-15 | GSE132766 | GEO
2019-03-14 | E-MTAB-7778 | biostudies-arrayexpress