Unknown

Dataset Information

0

DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data.


ABSTRACT: RNA sequencing (RNA-seq) has become a widely used technology for analyzing global gene-expression changes during certain biological processes. It is generally acknowledged that RNA-seq data displays equidispersion and overdispersion characteristics; therefore, most RNA-seq analysis methods were developed based on a negative binomial model capable of capturing both equidispersed and overdispersed data. In this study, we reported that in addition to equidispersion and overdispersion, RNA-seq data also displays underdispersion characteristics that cannot be adequately captured by general RNA-seq analysis methods. Based on a double Poisson model capable of capturing all data characteristics, we developed a new RNA-seq analysis method (DREAMSeq). Comparison of DREAMSeq with five other frequently used RNA-seq analysis methods using simulated datasets showed that its performance was comparable to or exceeded that of other methods in terms of type I error rate, statistical power, receiver operating characteristics (ROC) curve, area under the ROC curve, precision-recall curve, and the ability to detect the number of differentially expressed genes, especially in situations involving underdispersion. These results were validated by quantitative real-time polymerase chain reaction using a real Foxtail dataset. Our findings demonstrated DREAMSeq as a reliable, robust, and powerful new method for RNA-seq data mining. The DREAMSeq R package is available at http://tanglab.hebtu.edu.cn/tanglab/Home/DREAMSeq.

SUBMITTER: Gao Z 

PROVIDER: S-EPMC6284200 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data.

Gao Zhihua Z   Zhao Zhiying Z   Tang Wenqiang W  

Frontiers in genetics 20181130


RNA sequencing (RNA-seq) has become a widely used technology for analyzing global gene-expression changes during certain biological processes. It is generally acknowledged that RNA-seq data displays equidispersion and overdispersion characteristics; therefore, most RNA-seq analysis methods were developed based on a negative binomial model capable of capturing both equidispersed and overdispersed data. In this study, we reported that in addition to equidispersion and overdispersion, RNA-seq data  ...[more]

Similar Datasets

| S-EPMC8234728 | biostudies-literature
| S-EPMC5592911 | biostudies-literature
| S-EPMC5151178 | biostudies-literature
| S-EPMC3381971 | biostudies-literature
| S-EPMC5178351 | biostudies-literature