When dealing with complex environmental datasets, it is also difficult to establish the strength of the input-output relation among variables. Correlation analysis may yield a preliminary indication, but is limited to the linear case. Mutual Information (MI) is a more powerful method which can establish input-output dependence regardless of the nature of their interaction. However, to avoid the heavy computational demand of MI, a simple method is presented based on fuzzy clustering and Bayes’ rule. After a preliminary conditioning phase, the data are grouped by fuzzy clustering and approximated with the value of the most relevant centroid. Then the prior and likelihood probabilities are computed by frequentist methods by counting the occurrences of each sample with respect to the precomputed clusters. In this way the MI can be quickly computed, to yield the relative importance of the informative content of each input.

Assessing input-output relations in environemntal data by means of fuzzy clustering and Bayesian inference / Bigozzi, Lisa; EL BASRI, Emanuele; Iannicello, Francesca; MARSILI LIBELLI, Stefano; Simonetti, Irene. - ELETTRONICO. - (2016), pp. 583-592. (Intervento presentato al convegno 8th International Congress on Environmental Modelling and Software tenutosi a Toluose, France nel 10-14 July 2016).

Assessing input-output relations in environemntal data by means of fuzzy clustering and Bayesian inference

EL BASRI, EMANUELE;MARSILI LIBELLI, STEFANO;SIMONETTI, IRENE
2016

Abstract

When dealing with complex environmental datasets, it is also difficult to establish the strength of the input-output relation among variables. Correlation analysis may yield a preliminary indication, but is limited to the linear case. Mutual Information (MI) is a more powerful method which can establish input-output dependence regardless of the nature of their interaction. However, to avoid the heavy computational demand of MI, a simple method is presented based on fuzzy clustering and Bayes’ rule. After a preliminary conditioning phase, the data are grouped by fuzzy clustering and approximated with the value of the most relevant centroid. Then the prior and likelihood probabilities are computed by frequentist methods by counting the occurrences of each sample with respect to the precomputed clusters. In this way the MI can be quickly computed, to yield the relative importance of the informative content of each input.
2016
Environmental Modelling and Software for Supporting a Sustainable Future, Proceedings - 8th International Congress on Environmental Modelling and Software
8th International Congress on Environmental Modelling and Software
Toluose, France
10-14 July 2016
Bigozzi, Lisa; EL BASRI, Emanuele; Iannicello, Francesca; MARSILI LIBELLI, Stefano; Simonetti, Irene
File in questo prodotto:
File Dimensione Formato  
ASSESSING INPUT-OUTPUT RELATIONS IN ENVIRONMENTAL DATA BY MEANS O.pdf

Accesso chiuso

Tipologia: Pdf editoriale (Version of record)
Licenza: Tutti i diritti riservati
Dimensione 285.34 kB
Formato Adobe PDF
285.34 kB Adobe PDF   Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1101412
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact