Genomics

Dataset Information

0

Accurate ethnicity prediction from placental DNA methylation data


ABSTRACT: The influence of genetics on DNA methylation (DNAme) variation is well documented, yet confounding from population stratification is often unaccounted for in DNAme association studies. Existing approaches have been developed to address confounding by population stratification by directly using DNAme data, but have not been validated in additional human populations or tissues, such as the placenta. Results: To aid future placental DNAme studies in assessing population stratification, we developed an ethnicity classifier, PLANET (placental elastic net DNAme ethnicity classifier), on combined Infinium Human Methylation 450k BeadChip array (HM450k) data from placental samples. We used data from five North American cohorts from private and public repositories (n = 509) and show that PLANET can not only accurately predict (accuracy = 0.9379, kappa = 0.8227) major classes of self-reported ethnicity/race (African: n = 58, Asian: n = 53, Caucasian: n = 389), but can also produce probabilities that are highly correlated with genetic ancestry inferred from genome-wide SNP (>2.5 million SNP) and ancestry informative markers (n=50) data. We found that PLANET’s ethnicity classification relies on 1860 DNAme microarray sites, and over half of these were also linked to nearby genetic polymorphisms (n=955). Lastly, we found our placental-optimized method outperforms existing approaches in assessing population stratification in our placental samples from individuals of Asian, African, and Caucasian ethnicities. Conclusion: PLANET outperforms existing methods and heavily relies on the genetic signal present in DNAme microarray data. PLANET can be used to address population stratification in future placental DNAme association studies, and will be especially useful when ethnicity information is missing and genotyping markers are unavailable.

ORGANISM(S): Homo sapiens

PROVIDER: GSE128827 | GEO | 2020/10/14

REPOSITORIES: GEO

Similar Datasets

2016-02-29 | GSE77716 | GEO
2016-02-29 | E-GEOD-77716 | biostudies-arrayexpress
2016-04-11 | GSE69749 | GEO
2016-08-16 | GSE85594 | GEO
2015-08-10 | E-GEOD-70044 | biostudies-arrayexpress
2023-11-09 | GSE234729 | GEO
| PRJNA529078 | ENA
2020-10-21 | GSE159526 | GEO
| EGAS00001004682 | EGA
| MSV000086147 | MassIVE