Genomics

Dataset Information

0

Longitudinal dataset: An optimized library for reference-based deconvolution of whole-blood biospecimens assayed using the Illumina HumanMethylationEPIC BeadArray (I)


ABSTRACT: DNA methylation assessments of peripheral blood DNA can be used to accurately estimate the relative proportions of underlying leukocyte subtypes. Such cell deconvolution analysis relies on libraries of discriminating differentially methylated regions that are developed for each specific cell type measured. The relationship between estimated cell type proportions can then be tested for their association with phenotypes, disease states, and subject outcomes, or used in multivariable models as terms for adjustment in epigenome-wide association studies (EWAS). We obtained purified neutrophils, monocytes, B-lymphocytes, natural killer (NK) cells, CD4+ T-cells, and CD8+ T-cells from healthy subjects and measured DNA methylation with the Illumina HumanMethylationEPIC array platform. In addition, we measured DNA methylation with the EPIC array in two sets of artificial DNA mixtures comprising the above cell types. We compared three separate approaches to select reference differentially methylated region libraries (DMR library), for cell type proportion inference. The IDOL algorithm identified an optimal DMR library consisting of 450 CpG sites for inferring leukocyte subtype proportions (average R2=99.2). Importantly, the majority of CpG sites (69%) in the IDOL DMR library were unique to the new EPIC methylation array, in that they were not present on the 450K array. Our new reference DMR library is available as a Bioconductor package, has the potential to reduce any unintended technical differences arising from the combination of different generations of array platforms, and may be helpful in generating larger DMR libraries that include novel cell subtypes. A longitudinal dataset of 12 measurements in a single healthy subject (age 33 at the beginning of the blood collection) was used to apply our new DMR library. The subject was followed up for approximately 400 days with samples collected every 40 days. One of the samples was excluded from the final analysis due to a potential mix-up.

ORGANISM(S): Homo sapiens

PROVIDER: GSE110530 | GEO | 2018/05/08

REPOSITORIES: GEO

Similar Datasets

2018-05-08 | GSE110554 | GEO
2018-05-08 | GSE112618 | GEO
2022-02-07 | GSE182379 | GEO
2022-02-07 | GSE180970 | GEO
2022-02-07 | GSE180683 | GEO
2022-02-07 | GSE167998 | GEO
2021-12-31 | E-MTAB-11279 | biostudies-arrayexpress
| PRJNA546267 | ENA
2020-12-15 | GSE154566 | GEO
| PRJNA512879 | ENA