Dataset Information


Dynamic Model for RNA-seq Data Analysis.

ABSTRACT: By measuring messenger RNA levels for all genes in a sample, RNA-seq provides an attractive option to characterize the global changes in transcription. RNA-seq is becoming the widely used platform for gene expression profiling. However, real transcription signals in the RNA-seq data are confounded with measurement and sequencing errors and other random biological/technical variation. To extract biologically useful transcription process from the RNA-seq data, we propose to use the second ODE for modeling the RNA-seq data. We use differential principal analysis to develop statistical methods for estimation of location-varying coefficients of the ODE. We validate the accuracy of the ODE model to fit the RNA-seq data by prediction analysis and 5-fold cross validation. To further evaluate the performance of the ODE model for RNA-seq data analysis, we used the location-varying coefficients of the second ODE as features to classify the normal and tumor cells. We demonstrate that even using the ODE model for single gene we can achieve high classification accuracy. We also conduct response analysis to investigate how the transcription process responds to the perturbation of the external signals and identify dozens of genes that are related to cancer.


PROVIDER: S-EPMC4539434 | BioStudies | 2015-01-01

REPOSITORIES: biostudies

Similar Datasets

1000-01-01 | S-EPMC3623791 | BioStudies
2019-01-01 | S-EPMC6547432 | BioStudies
2020-01-01 | S-EPMC7326199 | BioStudies
2017-01-01 | S-EPMC5667649 | BioStudies
2011-01-01 | S-EPMC3167048 | BioStudies
1000-01-01 | S-EPMC3091629 | BioStudies
2017-01-01 | S-EPMC5677165 | BioStudies
1000-01-01 | S-EPMC4271460 | BioStudies
2019-01-01 | S-EPMC6821224 | BioStudies
1000-01-01 | S-EPMC3232367 | BioStudies