Methylation profiling

Dataset Information

0

An evaluation of analysis pipelines for DNA methylation profiling using the Illumina Human Methylation 450k platform


ABSTRACT: Abstract The proper identification of differentially methylated CpGs is central in most epigenetic studies. The Illumina Human Methylation 450k BeadChip is widely used to quantify DNA methylation, nevertheless the design of an appropriate analysis pipeline faces severe challenges due to the convolution of biological and technical variability and the presence of a signal bias between Infinium I and II probe design types. Despite recent attempts to investigate how to analyze DNA methylation data with such an array design, it has not been possible to perform a comprehensive comparison between different bioinformatics pipelines due to the lack of appropriate datasets having both large sample size and sufficient number of technical replicates. Here we perform such a comparative analysis, targeting the problems of reducing the technical variability, eliminating the probe design bias and reducing the batch effect by exploiting two unpublished datasets, which included technical replicates and were profiled for DNA methylation either on peripheral blood, monocytes or muscle biopsies. The blood samples included individuals with Multiple Sclerosis (MS). We evaluated the performance of different analysis pipelines and demonstrated that a) it is critical to correct for the probe design type, since the amplitude of the measured methylation change depends on the underlying chemistry; b) the effect of different normalization schemes is mixed, and the most effective method in our hands were quantile normalization and Beta Mixture Quantile dilation (BMIQ); c) it is beneficial to correct for batch effects. In conclusion, our comparative analysis using a comprehensive dataset suggests an efficient pipeline for proper identification of differentially methylated CpGs using the Illumina 450k arrays.

ORGANISM(S): Homo sapiens

PROVIDER: GSE43976 | GEO | 2013/03/14

SECONDARY ACCESSION(S): PRJNA188414

REPOSITORIES: GEO

Similar Datasets

2013-03-14 | E-GEOD-43976 | biostudies-arrayexpress
2016-01-01 | GSE52635 | GEO
2013-08-16 | GSE49908 | GEO
2018-07-01 | GSE108567 | GEO
2016-07-03 | E-GEOD-52635 | biostudies-arrayexpress
2015-11-19 | GSE75153 | GEO
2013-08-16 | GSE49907 | GEO
2013-08-16 | GSE49905 | GEO
2015-11-19 | E-GEOD-75153 | biostudies-arrayexpress
2020-07-16 | GSE148425 | GEO