Unknown

Dataset Information

0

Variations in the Intragene Methylation Profiles Hallmark Induced Pluripotency.


ABSTRACT: We demonstrate the potential of differentiating embryonic and induced pluripotent stem cells by the regularized linear and decision tree machine learning classification algorithms, based on a number of intragene methylation measures. The resulting average accuracy of classification has been proven to be above 95%, which overcomes the earlier achievements. We propose a constructive and transparent method of feature selection based on classifier accuracy. Enrichment analysis reveals statistically meaningful presence of stemness group and cancer discriminating genes among the selected best classifying features. These findings stimulate the further research on the functional consequences of these differences in methylation patterns. The presented approach can be broadly used to discriminate the cells of different phenotype or in different state by their methylation profiles, identify groups of genes constituting multifeature classifiers, and assess enrichment of these groups by the sets of genes with a functionality of interest.

SUBMITTER: Druzhkov P 

PROVIDER: S-EPMC4651640 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Variations in the Intragene Methylation Profiles Hallmark Induced Pluripotency.

Druzhkov Pavel P   Zolotykh Nikolay N   Meyerov Iosif I   Alsaedi Ahmed A   Shutova Maria M   Ivanchenko Mikhail M   Zaikin Alexey A  

BioMed research international 20151105


We demonstrate the potential of differentiating embryonic and induced pluripotent stem cells by the regularized linear and decision tree machine learning classification algorithms, based on a number of intragene methylation measures. The resulting average accuracy of classification has been proven to be above 95%, which overcomes the earlier achievements. We propose a constructive and transparent method of feature selection based on classifier accuracy. Enrichment analysis reveals statistically  ...[more]

Similar Datasets

| S-EPMC4053963 | biostudies-literature
| S-EPMC4284806 | biostudies-literature
| S-EPMC4977272 | biostudies-literature
2019-07-01 | GSE117448 | GEO
| S-EPMC4809190 | biostudies-literature
| S-EPMC5641056 | biostudies-literature
| S-EPMC3712966 | biostudies-literature
| S-DIXA-D-1161 | biostudies-other
| S-EPMC7294618 | biostudies-literature
| S-EPMC3353120 | biostudies-literature