Unknown

Dataset Information

0

Identifying Effective Feature Selection Methods for Alzheimer's Disease Biomarker Gene Detection Using Machine Learning.


ABSTRACT: Alzheimer's disease (AD) is a complex genetic disorder that affects the brain and has been the focus of many bioinformatics research studies. The primary objective of these studies is to identify and classify genes involved in the progression of AD and to explore the function of these risk genes in the disease process. The aim of this research is to identify the most effective model for detecting biomarker genes associated with AD using several feature selection methods. We compared the efficiency of feature selection methods with an SVM classifier, including mRMR, CFS, the Chi-Square Test, F-score, and GA. We calculated the accuracy of the SVM classifier using validation methods such as 10-fold cross-validation. We applied these feature selection methods with SVM to a benchmark AD gene expression dataset consisting of 696 samples and 200 genes. The results indicate that the mRMR and F-score feature selection methods with SVM classifier achieved a high accuracy of around 84%, with a number of genes between 20 and 40. Furthermore, the mRMR and F-score feature selection methods with SVM classifier outperformed the GA, Chi-Square Test, and CFS methods. Overall, these findings suggest that the mRMR and F-score feature selection methods with SVM classifier are effective in identifying biomarker genes related to AD and could potentially lead to more accurate diagnosis and treatment of the disease.

SUBMITTER: Alshamlan H 

PROVIDER: S-EPMC10217314 | biostudies-literature | 2023 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identifying Effective Feature Selection Methods for Alzheimer's Disease Biomarker Gene Detection Using Machine Learning.

Alshamlan Hala H   Omar Samar S   Aljurayyad Rehab R   Alabduljabbar Reham R  

Diagnostics (Basel, Switzerland) 20230517 10


Alzheimer's disease (AD) is a complex genetic disorder that affects the brain and has been the focus of many bioinformatics research studies. The primary objective of these studies is to identify and classify genes involved in the progression of AD and to explore the function of these risk genes in the disease process. The aim of this research is to identify the most effective model for detecting biomarker genes associated with AD using several feature selection methods. We compared the efficien  ...[more]

Similar Datasets

| S-EPMC11380031 | biostudies-literature
| S-EPMC10191821 | biostudies-literature
| S-EPMC8286592 | biostudies-literature
| S-EPMC11914577 | biostudies-literature
| S-EPMC5481359 | biostudies-other
| S-EPMC7719177 | biostudies-literature
| S-EPMC10746470 | biostudies-literature
| S-EPMC8683500 | biostudies-literature
| S-EPMC9684182 | biostudies-literature
| S-EPMC6917601 | biostudies-literature