Unknown

Dataset Information

0

A novel algorithm for calling mRNA m6A peaks by modeling biological variances in MeRIP-seq data.


ABSTRACT: N(6)-methyl-adenosine (m(6)A) is the most prevalent mRNA methylation but precise prediction of its mRNA location is important for understanding its function. A recent sequencing technology, known as Methylated RNA Immunoprecipitation Sequencing technology (MeRIP-seq), has been developed for transcriptome-wide profiling of m(6)A. We previously developed a peak calling algorithm called exomePeak. However, exomePeak over-simplifies data characteristics and ignores the reads' variances among replicates or reads dependency across a site region. To further improve the performance, new model is needed to address these important issues of MeRIP-seq data.We propose a novel, graphical model-based peak calling method, MeTPeak, for transcriptome-wide detection of m(6)A sites from MeRIP-seq data. MeTPeak explicitly models read count of an m(6)A site and introduces a hierarchical layer of Beta variables to capture the variances and a Hidden Markov model to characterize the reads dependency across a site. In addition, we developed a constrained Newton's method and designed a log-barrier function to compute analytically intractable, positively constrained Beta parameters. We applied our algorithm to simulated and real biological datasets and demonstrated significant improvement in detection performance and robustness over exomePeak. Prediction results on publicly available MeRIP-seq datasets are also validated and shown to be able to recapitulate the known patterns of m(6)A, further validating the improved performance of MeTPeak.The package 'MeTPeak' is implemented in R and C?++, and additional details are available at https://github.com/compgenomics/MeTPeakyufei.huang@utsa.edu or xdchoi@gmail.comSupplementary data are available at Bioinformatics online.

SUBMITTER: Cui X 

PROVIDER: S-EPMC4908365 | biostudies-literature | 2016 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel algorithm for calling mRNA m6A peaks by modeling biological variances in MeRIP-seq data.

Cui Xiaodong X   Meng Jia J   Zhang Shaowu S   Chen Yidong Y   Huang Yufei Y  

Bioinformatics (Oxford, England) 20160601 12


<h4>Motivation</h4>N(6)-methyl-adenosine (m(6)A) is the most prevalent mRNA methylation but precise prediction of its mRNA location is important for understanding its function. A recent sequencing technology, known as Methylated RNA Immunoprecipitation Sequencing technology (MeRIP-seq), has been developed for transcriptome-wide profiling of m(6)A. We previously developed a peak calling algorithm called exomePeak. However, exomePeak over-simplifies data characteristics and ignores the reads' vari  ...[more]

Similar Datasets

| S-EPMC7170965 | biostudies-literature
| S-EPMC7320601 | biostudies-literature
| S-EPMC5001242 | biostudies-literature
| S-EPMC6396939 | biostudies-literature
| S-EPMC4364623 | biostudies-literature
2019-04-05 | E-MTAB-6791 | biostudies-arrayexpress
2019-04-05 | E-MTAB-7783 | biostudies-arrayexpress
| S-EPMC4357668 | biostudies-literature
| S-ECPF-GEOD-53370 | biostudies-other