Ontology highlight
ABSTRACT: Background
The quality of predictive modeling in biomedicine depends on the amount of data available for model building.Objective
To study the effect of combining microarray data sets on feature selection and predictive modeling performance.Methods
Empirical evaluation of stability of feature selection and discriminatory power of classifiers using three previously published gene expression data sets, analyzed both individually and in combination.Results
Feature selection was not robust for the individual as well as for the combined data sets. The classification performance of models built on individual and combined data sets was heavily dependent on the data set from which the features were extracted.Conclusion
We identified volatility of feature selection as contributing factor to some of the problems faced by predictive modeling using microarray data.
SUBMITTER: Osl M
PROVIDER: S-EPMC3041313 | biostudies-literature | 2010 Nov
REPOSITORIES: biostudies-literature

Osl Melanie M Dreiseitl Stephan S Kim Jihoon J Patel Kiltesh K Baumgartner Christian C Ohno-Machado Lucila L
AMIA ... Annual Symposium proceedings. AMIA Symposium 20101113
<h4>Background</h4>The quality of predictive modeling in biomedicine depends on the amount of data available for model building.<h4>Objective</h4>To study the effect of combining microarray data sets on feature selection and predictive modeling performance.<h4>Methods</h4>Empirical evaluation of stability of feature selection and discriminatory power of classifiers using three previously published gene expression data sets, analyzed both individually and in combination.<h4>Results</h4>Feature se ...[more]