When dealing with complex environmental datasets, it is also difficult to establish the strength of the input-output relation among variables. Correlation analysis may yield a preliminary indication, but is limited to the linear case. Mutual Information (MI) is a more powerful method which can establish input-output dependence regardless of the nature of their interaction. However, to avoid the heavy computational demand of MI, a simple method is presented based on fuzzy clustering and Bayes’ rule. After a preliminary conditioning phase, the data are grouped by fuzzy clustering and approximated with the value of the most relevant centroid. Then the prior and likelihood probabilities are computed by frequentist methods by counting the occurrences of each sample with respect to the precomputed clusters. In this way the MI can be quickly computed, to yield the relative importance of the informative content of each input.
Assessing input-output relations in environemntal data by means of fuzzy clustering and Bayesian inference / Bigozzi, Lisa; EL BASRI, Emanuele; Iannicello, Francesca; MARSILI LIBELLI, Stefano; Simonetti, Irene. - ELETTRONICO. - (2016), pp. 583-592. (Intervento presentato al convegno 8th International Congress on Environmental Modelling and Software tenutosi a Toluose, France nel 10-14 July 2016).
Assessing input-output relations in environemntal data by means of fuzzy clustering and Bayesian inference
EL BASRI, EMANUELE;MARSILI LIBELLI, STEFANO;SIMONETTI, IRENE
2016
Abstract
When dealing with complex environmental datasets, it is also difficult to establish the strength of the input-output relation among variables. Correlation analysis may yield a preliminary indication, but is limited to the linear case. Mutual Information (MI) is a more powerful method which can establish input-output dependence regardless of the nature of their interaction. However, to avoid the heavy computational demand of MI, a simple method is presented based on fuzzy clustering and Bayes’ rule. After a preliminary conditioning phase, the data are grouped by fuzzy clustering and approximated with the value of the most relevant centroid. Then the prior and likelihood probabilities are computed by frequentist methods by counting the occurrences of each sample with respect to the precomputed clusters. In this way the MI can be quickly computed, to yield the relative importance of the informative content of each input.File | Dimensione | Formato | |
---|---|---|---|
ASSESSING INPUT-OUTPUT RELATIONS IN ENVIRONMENTAL DATA BY MEANS O.pdf
Accesso chiuso
Tipologia:
Pdf editoriale (Version of record)
Licenza:
Tutti i diritti riservati
Dimensione
285.34 kB
Formato
Adobe PDF
|
285.34 kB | Adobe PDF | Richiedi una copia |
I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.