Dataset Information


Evaluating de Bruijn graph assemblers on 454 transcriptomic data.

ABSTRACT: Next generation sequencing (NGS) technologies have greatly changed the landscape of transcriptomic studies of non-model organisms. Since there is no reference genome available, de novo assembly methods play key roles in the analysis of these data sets. Because of the huge amount of data generated by NGS technologies for each run, many assemblers, e.g., ABySS, Velvet and Trinity, are developed based on a de Bruijn graph due to its time- and space-efficiency. However, most of these assemblers were developed initially for the Illumina/Solexa platform. The performance of these assemblers on 454 transcriptomic data is unknown. In this study, we evaluated and compared the relative performance of these de Bruijn graph based assemblers on both simulated and real 454 transcriptomic data. The results suggest that Trinity, the Illumina/Solexa-specialized transcriptomic assembler, performs the best among the multiple de Bruijn graph assemblers, comparable to or even outperforming the standard 454 assembler Newbler which is based on the overlap-layout-consensus algorithm. Our evaluation is expected to provide helpful guidance for researchers to choose assemblers when analyzing 454 transcriptomic data.


PROVIDER: S-EPMC3517413 | BioStudies | 2012-01-01

REPOSITORIES: biostudies

Similar Datasets

2011-01-01 | S-EPMC3056720 | BioStudies
2015-01-01 | S-EPMC4449708 | BioStudies
2012-01-01 | S-EPMC3488206 | BioStudies
2013-01-01 | S-EPMC3848682 | BioStudies
2014-01-01 | S-EPMC4304221 | BioStudies
1000-01-01 | S-EPMC5411769 | BioStudies
2012-01-01 | S-EPMC3290790 | BioStudies
2013-01-01 | S-EPMC3799473 | BioStudies
2020-01-01 | S-EPMC7499882 | BioStudies
2016-01-01 | S-EPMC5103826 | BioStudies