Unknown

Dataset Information

0

A near complete snapshot of the Zea mays seedling transcriptome revealed from ultra-deep sequencing.


ABSTRACT: RNA-sequencing (RNA-seq) enables in-depth exploration of transcriptomes, but typical sequencing depth often limits its comprehensiveness. In this study, we generated nearly 3 billion RNA-Seq reads, totaling 341 Gb of sequence, from a Zea mays seedling sample. At this depth, a near complete snapshot of the transcriptome was observed consisting of over 90% of the annotated transcripts, including lowly expressed transcription factors. A novel hybrid strategy combining de novo and reference-based assemblies yielded a transcriptome consisting of 126,708 transcripts with 88% of expressed known genes assembled to full-length. We improved current annotations by adding 4,842 previously unannotated transcript variants and many new features, including 212 maize transcripts, 201 genes, 10 genes with undocumented potential roles in seedlings as well as maize lineage specific gene fusion events. We demonstrated the power of deep sequencing for large transcriptome studies by generating a high quality transcriptome, which provides a rich resource for the research community.

SUBMITTER: Martin JA 

PROVIDER: S-EPMC3970191 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

A near complete snapshot of the Zea mays seedling transcriptome revealed from ultra-deep sequencing.

Martin Jeffrey A JA   Johnson Nicole V NV   Gross Stephen M SM   Schnable James J   Meng Xiandong X   Wang Mei M   Coleman-Derr Devin D   Lindquist Erika E   Wei Chia-Lin CL   Kaeppler Shawn S   Chen Feng F   Wang Zhong Z  

Scientific reports 20140331


RNA-sequencing (RNA-seq) enables in-depth exploration of transcriptomes, but typical sequencing depth often limits its comprehensiveness. In this study, we generated nearly 3 billion RNA-Seq reads, totaling 341 Gb of sequence, from a Zea mays seedling sample. At this depth, a near complete snapshot of the transcriptome was observed consisting of over 90% of the annotated transcripts, including lowly expressed transcription factors. A novel hybrid strategy combining de novo and reference-based as  ...[more]

Similar Datasets

| S-EPMC3306378 | biostudies-literature
| S-EPMC3911899 | biostudies-literature
| S-EPMC5006096 | biostudies-literature
| S-EPMC5447894 | biostudies-literature
| S-EPMC5331654 | biostudies-literature
2010-12-10 | GSE25994 | GEO
2015-03-29 | E-MTAB-4246 | biostudies-arrayexpress