Dataset Information


RNA-seq of wild type Arabidopsis seedlings to compare cross-replicate variability and false discovery rate in differential gene expression tools

ABSTRACT: RNA-Seq data from 17 wild-type biological replicates of Arabidopsis thaliana used to explore read count measurements across replicates along with the False Discovery Rate of Differential Gene Expression tools. Although A. thaliana has a relatively small genome, its transcriptome is similar in scale and complexity to that of model mammal species and its genome is extensively annotated and the conclusions presented here provide useful guidance for work in other complex eukaryotes. The findings show that the negative binomial and log-normal distributions are both good choices as models for the cross-replicate variability of RNA-seq read counts. 6 of 9 DGE tools controlled their identification of false positives well even with only 3 replicates. Our results reinforce the conclusions reached by Schurch et. al. (2015 RNA) in yeast.

INSTRUMENT(S): ERCC Spike-in Kit (Ambion) & Illumina TruSeq Stranded Total RNA with Ribo-Zero Plant kit, Illumina HiSeq 2000, RNeasy Plant Mini Kit & TURBOTM DNase (Ambion)

ORGANISM(S): Arabidopsis thaliana  

SUBMITTER: Geoff J Barton   Gordon G Simpson   Cline Duc   Nick Schurch   Marek Gierliski   Katarzyna Mackinnon   Kimon Froussios  

PROVIDER: E-MTAB-5446 | ArrayExpress | 2019-02-08



Similar Datasets

2019-05-29 | E-MTAB-7990 | ArrayExpress
2011-01-11 | E-MEXP-2539 | ArrayExpress
| PRJNA218873 | ENA
| PRJNA302625 | ENA
2014-09-25 | E-GEOD-57862 | ArrayExpress
| GSE57862 | GEO
| GSE75138 | GEO
2011-01-01 | E-MEXP-2574 | ArrayExpress
| GSE71802 | GEO
2019-03-13 | PXD012535 | Pride