Expanding a lower-dimensional problem to a higher-dimensional space and then projecting back is often beneficial. This article rigorously investigates this perspective in the context of finite mixture models, specifically how to improve inference for mixture models by using auxiliary variables. Despite the large literature in mixture models and several empirical examples, there is no previous work that gives general theoretical justification for including auxiliary variables in mixture models, even for special cases. We provide a theoretical basis for comparing inference for mixture multivariate models with the corresponding inference for marginal univariate mixture models. Analytical results for several special cases are established. We show that the probability of correctly allocating mixture memberships and the information number for the means of the primary outcome in a bivariate model with two Gaussian mixtures are generally larger than those in each univariate model. Simulations under a range of scenarios, including mis-specified models, are conducted to examine the improvement. The method is illustrated by two real applications in ecology and causal inference.

Improving inference of Gaussian mixtures using auxiliary variables / Mercatanti, Andrea; Li, Fan; Mealli, Fabrizia. - In: STATISTICAL ANALYSIS AND DATA MINING. - ISSN 1932-1864. - STAMPA. - 8:(2015), pp. 34-48. [10.1002/sam.11256]

Improving inference of Gaussian mixtures using auxiliary variables

MERCATANTI, ANDREA;MEALLI, FABRIZIA
2015

Abstract

Expanding a lower-dimensional problem to a higher-dimensional space and then projecting back is often beneficial. This article rigorously investigates this perspective in the context of finite mixture models, specifically how to improve inference for mixture models by using auxiliary variables. Despite the large literature in mixture models and several empirical examples, there is no previous work that gives general theoretical justification for including auxiliary variables in mixture models, even for special cases. We provide a theoretical basis for comparing inference for mixture multivariate models with the corresponding inference for marginal univariate mixture models. Analytical results for several special cases are established. We show that the probability of correctly allocating mixture memberships and the information number for the means of the primary outcome in a bivariate model with two Gaussian mixtures are generally larger than those in each univariate model. Simulations under a range of scenarios, including mis-specified models, are conducted to examine the improvement. The method is illustrated by two real applications in ecology and causal inference.
2015
8
34
48
Mercatanti, Andrea; Li, Fan; Mealli, Fabrizia
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1013345
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 9
social impact