Unknown

Dataset Information

0

MULTIVARIATE MIXED MEMBERSHIP MODELING: INFERRING DOMAIN-SPECIFIC RISK PROFILES.


ABSTRACT: Characterizing the shared memberships of individuals in a classification scheme poses severe interpretability issues, even when using a moderate number of classes (say 4). Mixed membership models quantify this phenomenon, but they typically focus on goodness-of-fit more than on interpretable inference. To achieve a good numerical fit, these models may in fact require many extreme profiles, making the results difficult to interpret. We introduce a new class of multivariate mixed membership models that, when variables can be partitioned into subject-matter based domains, can provide a good fit to the data using fewer profiles than standard formulations. The proposed model explicitly accounts for the blocks of variables corresponding to the distinct domains along with a cross-domain correlation structure, which provides new information about shared membership of individuals in a complex classification scheme. We specify a multivariate logistic normal distribution for the membership vectors, which allows easy introduction of auxiliary information leveraging a latent multivariate logistic regression. A Bayesian approach to inference, relying on Pólya gamma data augmentation, facilitates efficient posterior computation via Markov Chain Monte Carlo. We apply this methodology to a spatially explicit study of malaria risk over time on the Brazilian Amazon frontier.

SUBMITTER: Russo M 

PROVIDER: S-EPMC9222983 | biostudies-literature | 2022 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

MULTIVARIATE MIXED MEMBERSHIP MODELING: INFERRING DOMAIN-SPECIFIC RISK PROFILES.

Russo Massimiliano M   Singer Burton H BH   Dunson David B DB  

The annals of applied statistics 20220328 1


Characterizing the shared memberships of individuals in a classification scheme poses severe interpretability issues, even when using a moderate number of classes (say 4). Mixed membership models quantify this phenomenon, but they typically focus on goodness-of-fit more than on interpretable inference. To achieve a good numerical fit, these models may in fact require many extreme profiles, making the results difficult to interpret. We introduce a new class of multivariate mixed membership models  ...[more]

Similar Datasets

| S-EPMC4159106 | biostudies-literature
| S-EPMC3119541 | biostudies-literature
| S-EPMC11623444 | biostudies-literature
| S-EPMC8233119 | biostudies-literature
| S-EPMC387299 | biostudies-literature
| S-EPMC7322631 | biostudies-literature
| S-EPMC2553439 | biostudies-literature
| S-EPMC11423932 | biostudies-literature
| S-EPMC4548941 | biostudies-literature
| S-EPMC6365942 | biostudies-literature