Dataset Information


How many separable sources? Model selection in independent components analysis.

ABSTRACT: Unlike mixtures consisting solely of non-Gaussian sources, mixtures including two or more Gaussian components cannot be separated using standard independent components analysis methods that are based on higher order statistics and independent observations. The mixed Independent Components Analysis/Principal Components Analysis (mixed ICA/PCA) model described here accommodates one or more Gaussian components in the independent components analysis model and uses principal components analysis to characterize contributions from this inseparable Gaussian subspace. Information theory can then be used to select from among potential model categories with differing numbers of Gaussian components. Based on simulation studies, the assumptions and approximations underlying the Akaike Information Criterion do not hold in this setting, even with a very large number of observations. Cross-validation is a suitable, though computationally intensive alternative for model selection. Application of the algorithm is illustrated using Fisher's iris data set and Howells' craniometric data set. Mixed ICA/PCA is of potential interest in any field of scientific investigation where the authenticity of blindly separated non-Gaussian sources might otherwise be questionable. Failure of the Akaike Information Criterion in model selection also has relevance in traditional independent components analysis where all sources are assumed non-Gaussian.


PROVIDER: S-EPMC4374758 | BioStudies | 2015-01-01

REPOSITORIES: biostudies

Similar Datasets

2020-01-01 | S-EPMC7181150 | BioStudies
1000-01-01 | S-EPMC5789861 | BioStudies
2020-01-01 | S-EPMC7530342 | BioStudies
2009-01-01 | S-EPMC6870574 | BioStudies
2017-01-01 | S-EPMC5437155 | BioStudies
2016-01-01 | S-EPMC5568547 | BioStudies
2017-01-01 | S-EPMC6877114 | BioStudies
2018-01-01 | S-EPMC5801305 | BioStudies
2009-01-01 | S-EPMC3120963 | BioStudies
2013-01-01 | S-EPMC3709093 | BioStudies