Unknown

Dataset Information

0

A machine learning-based data mining in medical examination data: a biological features-based biological age prediction model.


ABSTRACT:

Background

Biological age (BA) has been recognized as a more accurate indicator of aging than chronological age (CA). However, the current limitations include: insufficient attention to the incompleteness of medical data for constructing BA; Lack of machine learning-based BA (ML-BA) on the Chinese population; Neglect of the influence of model overfitting degree on the stability of the association results.

Methods and results

Based on the medical examination data of the Chinese population (45-90 years), we first evaluated the most suitable missing interpolation method, then constructed 14 ML-BAs based on biomarkers, and finally explored the associations between ML-BAs and health statuses (healthy risk indicators and disease). We found that round-robin linear regression interpolation performed best, while AutoEncoder showed the highest interpolation stability. We further illustrated the potential overfitting problem in ML-BAs, which affected the stability of ML-Bas' associations with health statuses. We then proposed a composite ML-BA based on the Stacking method with a simple meta-model (STK-BA), which overcame the overfitting problem, and associated more strongly with CA (r = 0.66, P < 0.001), healthy risk indicators, disease counts, and six types of disease.

Conclusion

We provided an improved aging measurement method for middle-aged and elderly groups in China, which can more stably capture aging characteristics other than CA, supporting the emerging application potential of machine learning in aging research.

SUBMITTER: Yang Q 

PROVIDER: S-EPMC9528174 | biostudies-literature | 2022 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

A machine learning-based data mining in medical examination data: a biological features-based biological age prediction model.

Yang Qing Q   Gao Sunan S   Lin Junfen J   Lyu Ke K   Wu Zexu Z   Chen Yuhao Y   Qiu Yinwei Y   Zhao Yanrong Y   Wang Wei W   Lin Tianxiang T   Pan Huiyun H   Chen Ming M  

BMC bioinformatics 20221003 1


<h4>Background</h4>Biological age (BA) has been recognized as a more accurate indicator of aging than chronological age (CA). However, the current limitations include: insufficient attention to the incompleteness of medical data for constructing BA; Lack of machine learning-based BA (ML-BA) on the Chinese population; Neglect of the influence of model overfitting degree on the stability of the association results.<h4>Methods and results</h4>Based on the medical examination data of the Chinese pop  ...[more]

Similar Datasets

| S-EPMC9501106 | biostudies-literature
| S-EPMC9798507 | biostudies-literature
| S-EPMC9310626 | biostudies-literature
| S-EPMC10879664 | biostudies-literature
| S-EPMC8774637 | biostudies-literature
| S-EPMC10307844 | biostudies-literature
| S-EPMC7244024 | biostudies-literature
| S-EPMC9068780 | biostudies-literature
| S-EPMC7224118 | biostudies-literature
| S-EPMC10135598 | biostudies-literature