Transcriptomics

Dataset Information

0

ESPRESSO: Robust discovery and quantification of transcript isoforms from error-prone long-read RNA-seq data


ABSTRACT: Long-read RNA sequencing (RNA-seq) holds great potential for characterizing transcriptome variation and full-length transcript isoforms, but the relatively high error rate of current long-read sequencing platforms poses a major challenge. We present ESPRESSO, a computational tool for robust discovery and quantification of transcript isoforms from error-prone long reads. ESPRESSO jointly considers alignments of all long reads aligned to a gene and uses error profiles of individual reads to improve the identification of splice junctions and the discovery of their corresponding transcript isoforms. On both a synthetic spike-in RNA sample and human RNA samples, ESPRESSO outperforms multiple contemporary tools in not only transcript isoform discovery but also transcript isoform quantification. In total, we generated and analyzed ~1.1 billion nanopore RNA-seq reads covering 30 human tissue samples and three human cell lines. ESPRESSO and its companion dataset provide a useful resource for studying the RNA repertoire of eukaryotic transcriptomes.

ORGANISM(S): Homo sapiens

PROVIDER: GSE192955 | GEO | 2022/10/14

REPOSITORIES: GEO

Similar Datasets

2020-03-18 | GSE147118 | GEO
2019-06-15 | GSE132766 | GEO
2021-05-05 | GSE155375 | GEO
2021-05-05 | GSE155920 | GEO
2021-05-05 | GSE155919 | GEO
2021-03-12 | GSE168776 | GEO
2023-09-18 | GSE212569 | GEO
2023-09-18 | GSE212571 | GEO
2023-09-18 | GSE212570 | GEO
2023-09-18 | GSE212573 | GEO