The paper deals with the problem of unsupervised learning with structured data, proposing a mixture model approach to cluster tree samples. First, we discuss how to use the Switching-Parent Hidden Tree Markov Model, a compositional model for learning tree distributions, to define a finite mixture model where the number of components is fixed by a hyperparameter. Then, we show how to relax such an assumption by introducing a Bayesian non-parametric mixture model where the number of necessary hidden tree components is learned from data. Experimental validation on synthetic and real datasets show the benefit of mixture models over simple hidden tree models in clustering applications. Further, we provide a characterization of the behaviour of the two mixture models for different choices of their hyperparameters.

Bayesian Mixtures of Hidden Tree Markov Models for Structured Data Clustering / Davide Bacciu; Daniele Castellana. - In: NEUROCOMPUTING. - ISSN 0925-2312. - ELETTRONICO. - 342:(2019), pp. 49-59. [10.1016/j.neucom.2018.11.091]

Bayesian Mixtures of Hidden Tree Markov Models for Structured Data Clustering

Daniele Castellana
2019

Abstract

The paper deals with the problem of unsupervised learning with structured data, proposing a mixture model approach to cluster tree samples. First, we discuss how to use the Switching-Parent Hidden Tree Markov Model, a compositional model for learning tree distributions, to define a finite mixture model where the number of components is fixed by a hyperparameter. Then, we show how to relax such an assumption by introducing a Bayesian non-parametric mixture model where the number of necessary hidden tree components is learned from data. Experimental validation on synthetic and real datasets show the benefit of mixture models over simple hidden tree models in clustering applications. Further, we provide a characterization of the behaviour of the two mixture models for different choices of their hyperparameters.
2019
342
49
59
Davide Bacciu; Daniele Castellana
File in questo prodotto:
File Dimensione Formato  
bayesian-mixtures-hidden.pdf

accesso aperto

Licenza: Open Access
Dimensione 1.02 MB
Formato Adobe PDF
1.02 MB Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1304186
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 3
social impact