When the target variable exhibits a semicontinuous behaviour (a point mass in a single value and a continuous distribution elsewhere) parametric ‘two-part models’ have been extensively used and investigated. The applications have mainly been related to non-negative variables with a point mass in zero (zero-inflated data). In this paper a semiparametric Bayesian twopart model for dealing with such variables is proposed. The model allows a semiparametric expression for the two parts of the model by using Dirichlet processes. A motivating example, based on grape wine production in Tuscany (an Italian region), is used to show the capabilities of the model. Finally, two simulation experiments evaluate the model. Results show a satisfactory performance of the suggested approach for modelling and predicting semicontinuous data when parametric assumptions are not reasonable.
A Bayesian semiparametric model for non-negative semicontinuous data / Dreassi, Emanuela; Rocco, Emilia. - In: COMMUNICATIONS IN STATISTICS, THEORY AND METHODS. - ISSN 1532-415X. - ELETTRONICO. - 46:(2017), pp. 5133-5146. [10.1080/03610926.2015.1096389]
A Bayesian semiparametric model for non-negative semicontinuous data
DREASSI, EMANUELA;ROCCO, EMILIA
2017
Abstract
When the target variable exhibits a semicontinuous behaviour (a point mass in a single value and a continuous distribution elsewhere) parametric ‘two-part models’ have been extensively used and investigated. The applications have mainly been related to non-negative variables with a point mass in zero (zero-inflated data). In this paper a semiparametric Bayesian twopart model for dealing with such variables is proposed. The model allows a semiparametric expression for the two parts of the model by using Dirichlet processes. A motivating example, based on grape wine production in Tuscany (an Italian region), is used to show the capabilities of the model. Finally, two simulation experiments evaluate the model. Results show a satisfactory performance of the suggested approach for modelling and predicting semicontinuous data when parametric assumptions are not reasonable.I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.