Unknown

Dataset Information

0

Using biological constraints to improve prediction in precision oncology.


ABSTRACT: Many gene signatures have been developed by applying machine learning (ML) on omics profiles, however, their clinical utility is often hindered by limited interpretability and unstable performance. Here, we show the importance of embedding prior biological knowledge in the decision rules yielded by ML approaches to build robust classifiers. We tested this by applying different ML algorithms on gene expression data to predict three difficult cancer phenotypes: bladder cancer progression to muscle-invasive disease, response to neoadjuvant chemotherapy in triple-negative breast cancer, and prostate cancer metastatic progression. We developed two sets of classifiers: mechanistic, by restricting the training to features capturing specific biological mechanisms; and agnostic, in which the training did not use any a priori biological information. Mechanistic models had a similar or better testing performance than their agnostic counterparts, with enhanced interpretability. Our findings support the use of biological constraints to develop robust gene signatures with high translational potential.

SUBMITTER: Omar M 

PROVIDER: S-EPMC9958363 | biostudies-literature | 2023 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using biological constraints to improve prediction in precision oncology.

Omar Mohamed M   Dinalankara Wikum W   Mulder Lotte L   Coady Tendai T   Zanettini Claudio C   Imada Eddie Luidy EL   Younes Laurent L   Geman Donald D   Marchionni Luigi L  

iScience 20230202 3


Many gene signatures have been developed by applying machine learning (ML) on <i>omics</i> profiles, however, their clinical utility is often hindered by limited interpretability and unstable performance. Here, we show the importance of embedding prior biological knowledge in the decision rules yielded by ML approaches to build robust classifiers. We tested this by applying different ML algorithms on gene expression data to predict three difficult cancer phenotypes: bladder cancer progression to  ...[more]

Similar Datasets

| S-EPMC9369403 | biostudies-literature
| S-EPMC10860535 | biostudies-literature
| S-EPMC9060513 | biostudies-literature
2022-10-31 | GSE186566 | GEO
| S-EPMC11907791 | biostudies-literature
| S-EPMC6119176 | biostudies-other
| S-EPMC7446508 | biostudies-literature
| S-EPMC7971903 | biostudies-literature
| S-EPMC5321161 | biostudies-literature
| S-EPMC9415718 | biostudies-literature