Dataset Information


SubSeq: determining appropriate sequencing depth through efficient read subsampling.



Next-generation sequencing experiments, such as RNA-Seq, play an increasingly important role in biological research. One complication is that the power and accuracy of such experiments depend substantially on the number of reads sequenced, so it is important and challenging to determine the optimal read depth for an experiment or to verify whether one has adequate depth in an existing experiment.


By randomly sampling lower depths from a sequencing experiment and determining where the saturation of power and accuracy occurs, one can determine what the most useful depth should be for future experiments, and furthermore, confirm whether an existing experiment had sufficient depth to justify its conclusions. We introduce the subSeq R package, which uses a novel efficient approach to perform this subsampling and to calculate informative metrics at each depth.

Availability and implementation

The subSeq R package is available at http://github.com/StoreyLab/subSeq/.


PROVIDER: S-EPMC4296149 | BioStudies | 2014-01-01

REPOSITORIES: biostudies

Similar Datasets

2009-01-01 | S-EPMC2697147 | BioStudies
1000-01-01 | S-EPMC5418619 | BioStudies
2018-01-01 | S-EPMC6075720 | BioStudies
2012-01-01 | S-EPMC3601603 | BioStudies
2019-01-01 | S-EPMC6636137 | BioStudies
2019-01-01 | S-EPMC6694302 | BioStudies
2011-01-01 | S-EPMC3179661 | BioStudies
2014-01-01 | S-EPMC3957067 | BioStudies
2009-01-01 | S-EPMC3087348 | BioStudies
2013-01-01 | S-EPMC3605148 | BioStudies