The paper deals with the problem of unsupervised learning with structured data, proposing a mixture model approach to cluster tree samples. First, we discuss how to use the Switching-Parent Hidden Tree Markov Model, a compositional model for learning tree distributions, to define a finite mixture model where the number of components is fixed by a hyperparameter. Then, we show how to relax such an assumption by introducing a Bayesian non-parametric mixture model where the number of necessary hidden tree components is learned from data. Experimental validation on synthetic and real datasets show the benefit of mixture models over simple hidden tree models in clustering applications. Further, we provide a characterization of the behaviour of the two mixture models for different choices of their hyperparameters.
Bayesian Mixtures of Hidden Tree Markov Models for Structured Data Clustering / Davide Bacciu; Daniele Castellana. - In: NEUROCOMPUTING. - ISSN 0925-2312. - ELETTRONICO. - 342:(2019), pp. 49-59. [10.1016/j.neucom.2018.11.091]
Bayesian Mixtures of Hidden Tree Markov Models for Structured Data Clustering
Daniele Castellana
2019
Abstract
The paper deals with the problem of unsupervised learning with structured data, proposing a mixture model approach to cluster tree samples. First, we discuss how to use the Switching-Parent Hidden Tree Markov Model, a compositional model for learning tree distributions, to define a finite mixture model where the number of components is fixed by a hyperparameter. Then, we show how to relax such an assumption by introducing a Bayesian non-parametric mixture model where the number of necessary hidden tree components is learned from data. Experimental validation on synthetic and real datasets show the benefit of mixture models over simple hidden tree models in clustering applications. Further, we provide a characterization of the behaviour of the two mixture models for different choices of their hyperparameters.File | Dimensione | Formato | |
---|---|---|---|
bayesian-mixtures-hidden.pdf
accesso aperto
Licenza:
Open Access
Dimensione
1.02 MB
Formato
Adobe PDF
|
1.02 MB | Adobe PDF |
I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.