Ensuring sustainability in the agri-food sector requires comprehensive data analysis. This study examines missing data patterns in a large-scale survey of Italian agri-food companies within the Italian National Research Center for Technology in Agriculture (Agritech), focusing on sustainability variables. The underlying idea is that failure to report a value for these variables indicates low attention to sustainability. We employ graphical Ising models to infer the conditional independence structure among missingness indicators and fully observed farm characteristics, which are modeled as binary variables. The graph structure is selected through node-wise logistic regressions with variable selection based on a backward stepwise procedure guided by the Bayesian Information Criterion (BIC). This approach enables the recovery of sparse and interpretable graphs while controlling for model complexity. We are not interested in causal relationships between missingness indicators and fully observed variables; rather, our focus is on the dependence structure among these variables. The method is applied to the Agritech data, yielding both a national graph and macro-regional graphs, as well as differential networks that highlight structural differences between each macro-region and the national graph. These results provide new insights into systematic patterns of missing data, offering a rigorous framework for improving data quality in terms of completeness and reliability.

Graphical Ising Models for Missing Data Patterns Detection in Sustainability Surveys / Mecca, Andrea; Gottard, Anna; Gagliardi, Francesca. - In: SOCIAL INDICATORS RESEARCH. - ISSN 0303-8300. - ELETTRONICO. - 182:(2026), pp. 1-28. [10.1007/s11205-026-03844-6]

Graphical Ising Models for Missing Data Patterns Detection in Sustainability Surveys

Mecca, Andrea
;
Gottard, Anna;
2026

Abstract

Ensuring sustainability in the agri-food sector requires comprehensive data analysis. This study examines missing data patterns in a large-scale survey of Italian agri-food companies within the Italian National Research Center for Technology in Agriculture (Agritech), focusing on sustainability variables. The underlying idea is that failure to report a value for these variables indicates low attention to sustainability. We employ graphical Ising models to infer the conditional independence structure among missingness indicators and fully observed farm characteristics, which are modeled as binary variables. The graph structure is selected through node-wise logistic regressions with variable selection based on a backward stepwise procedure guided by the Bayesian Information Criterion (BIC). This approach enables the recovery of sparse and interpretable graphs while controlling for model complexity. We are not interested in causal relationships between missingness indicators and fully observed variables; rather, our focus is on the dependence structure among these variables. The method is applied to the Agritech data, yielding both a national graph and macro-regional graphs, as well as differential networks that highlight structural differences between each macro-region and the national graph. These results provide new insights into systematic patterns of missing data, offering a rigorous framework for improving data quality in terms of completeness and reliability.
2026
182
1
28
Mecca, Andrea; Gottard, Anna; Gagliardi, Francesca
File in questo prodotto:
File Dimensione Formato  
Mecca_et_al-2026-Social_Indicators_Research.pdf

Accesso chiuso

Tipologia: Pdf editoriale (Version of record)
Licenza: Open Access
Dimensione 3.55 MB
Formato Adobe PDF
3.55 MB Adobe PDF   Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1463992
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact