Ontology highlight
ABSTRACT:
SUBMITTER: Orzechowski P
PROVIDER: S-EPMC9683726 | biostudies-literature | 2022 Nov
REPOSITORIES: biostudies-literature
Orzechowski Patryk P Moore Jason H JH
Science advances 20221123 47
Understanding the strengths and weaknesses of machine learning (ML) algorithms is crucial to determine their scope of application. Here, we introduce the Diverse and Generative ML Benchmark (DIGEN), a collection of synthetic datasets for comprehensive, reproducible, and interpretable benchmarking of ML algorithms for classification of binary outcomes. The DIGEN resource consists of 40 mathematical functions that map continuous features to binary targets for creating synthetic datasets. These 40 ...[more]