Unknown

Dataset Information

0

Rnnotator: an automated de novo transcriptome assembly pipeline from stranded RNA-Seq reads.


ABSTRACT:

Background

Comprehensive annotation and quantification of transcriptomes are outstanding problems in functional genomics. While high throughput mRNA sequencing (RNA-Seq) has emerged as a powerful tool for addressing these problems, its success is dependent upon the availability and quality of reference genome sequences, thus limiting the organisms to which it can be applied.

Results

Here, we describe Rnnotator, an automated software pipeline that generates transcript models by de novo assembly of RNA-Seq data without the need for a reference genome. We have applied the Rnnotator assembly pipeline to two yeast transcriptomes and compared the results to the reference gene catalogs of these organisms. The contigs produced by Rnnotator are highly accurate (95%) and reconstruct full-length genes for the majority of the existing gene models (54.3%). Furthermore, our analyses revealed many novel transcribed regions that are absent from well annotated genomes, suggesting Rnnotator serves as a complementary approach to analysis based on a reference genome for comprehensive transcriptomics.

Conclusions

These results demonstrate that the Rnnotator pipeline is able to reconstruct full-length transcripts in the absence of a complete reference genome.

SUBMITTER: Martin J 

PROVIDER: S-EPMC3152782 | biostudies-literature | 2010 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Rnnotator: an automated de novo transcriptome assembly pipeline from stranded RNA-Seq reads.

Martin Jeffrey J   Bruno Vincent M VM   Fang Zhide Z   Meng Xiandong X   Blow Matthew M   Zhang Tao T   Sherlock Gavin G   Snyder Michael M   Wang Zhong Z  

BMC genomics 20101124


<h4>Background</h4>Comprehensive annotation and quantification of transcriptomes are outstanding problems in functional genomics. While high throughput mRNA sequencing (RNA-Seq) has emerged as a powerful tool for addressing these problems, its success is dependent upon the availability and quality of reference genome sequences, thus limiting the organisms to which it can be applied.<h4>Results</h4>Here, we describe Rnnotator, an automated software pipeline that generates transcript models by de  ...[more]

Similar Datasets

| S-EPMC5322684 | biostudies-literature
| S-EPMC4760927 | biostudies-literature
| S-EPMC3910276 | biostudies-literature
| S-EPMC4537571 | biostudies-literature
| S-EPMC4342890 | biostudies-literature
| S-EPMC3485621 | biostudies-literature
| S-EPMC4134189 | biostudies-literature
| S-EPMC4332758 | biostudies-literature
| S-EPMC4222968 | biostudies-literature
| S-EPMC3287467 | biostudies-literature