Dataset Information


Fast Bayesian Inference in Dirichlet Process Mixture Models.

ABSTRACT: There has been increasing interest in applying Bayesian nonparametric methods in large samples and high dimensions. As Markov chain Monte Carlo (MCMC) algorithms are often infeasible, there is a pressing need for much faster algorithms. This article proposes a fast approach for inference in Dirichlet process mixture (DPM) models. Viewing the partitioning of subjects into clusters as a model selection problem, we propose a sequential greedy search algorithm for selecting the partition. Then, when conjugate priors are chosen, the resulting posterior conditionally on the selected partition is available in closed form. This approach allows testing of parametric models versus nonparametric alternatives based on Bayes factors. We evaluate the approach using simulation studies and compare it with four other fast nonparametric methods in the literature. We apply the proposed approach to three datasets including one from a large epidemiologic study. Matlab codes for the simulation and data analyses using the proposed approach are available online in the supplemental materials.


PROVIDER: S-EPMC3812957 | BioStudies | 2011-01-01

REPOSITORIES: biostudies

Similar Datasets

2016-01-01 | S-EPMC5915294 | BioStudies
1000-01-01 | S-EPMC4550296 | BioStudies
2018-01-01 | S-EPMC6035010 | BioStudies
2014-01-01 | S-EPMC4225571 | BioStudies
2017-01-01 | S-EPMC5587402 | BioStudies
1000-01-01 | S-EPMC3590929 | BioStudies
2019-01-01 | S-EPMC6582336 | BioStudies
2016-01-01 | S-EPMC5036949 | BioStudies
2010-01-01 | S-EPMC2861699 | BioStudies
2016-01-01 | S-EPMC5200948 | BioStudies