Ensuring sustainability in the agri-food sector requires comprehensive data analysis. This study examines missing data patterns in a large-scale survey of Italian agri-food companies within the Italian National Research Center for Technology in Agriculture (Agritech), focusing on sustainability variables. The underlying idea is that failure to report a value for these variables indicates low attention to sustainability. We employ graphical Ising models to infer the conditional independence structure among missingness indicators and fully observed farm characteristics, which are modeled as binary variables. The graph structure is selected through node-wise logistic regressions with variable selection based on a backward stepwise procedure guided by the Bayesian Information Criterion (BIC). This approach enables the recovery of sparse and interpretable graphs while controlling for model complexity. We are not interested in causal relationships between missingness indicators and fully observed variables; rather, our focus is on the dependence structure among these variables. The method is applied to the Agritech data, yielding both a national graph and macro-regional graphs, as well as differential networks that highlight structural differences between each macro-region and the national graph. These results provide new insights into systematic patterns of missing data, offering a rigorous framework for improving data quality in terms of completeness and reliability.

Mecca, A., Gottard, A., Gagliardi, F. (2026). Graphical Ising Models for Missing Data Patterns Detection in Sustainability Surveys. SOCIAL INDICATORS RESEARCH, 182(3) [10.1007/s11205-026-03844-6].

Graphical Ising Models for Missing Data Patterns Detection in Sustainability Surveys

Andrea Mecca;Francesca Gagliardi
2026-01-01

Abstract

Ensuring sustainability in the agri-food sector requires comprehensive data analysis. This study examines missing data patterns in a large-scale survey of Italian agri-food companies within the Italian National Research Center for Technology in Agriculture (Agritech), focusing on sustainability variables. The underlying idea is that failure to report a value for these variables indicates low attention to sustainability. We employ graphical Ising models to infer the conditional independence structure among missingness indicators and fully observed farm characteristics, which are modeled as binary variables. The graph structure is selected through node-wise logistic regressions with variable selection based on a backward stepwise procedure guided by the Bayesian Information Criterion (BIC). This approach enables the recovery of sparse and interpretable graphs while controlling for model complexity. We are not interested in causal relationships between missingness indicators and fully observed variables; rather, our focus is on the dependence structure among these variables. The method is applied to the Agritech data, yielding both a national graph and macro-regional graphs, as well as differential networks that highlight structural differences between each macro-region and the national graph. These results provide new insights into systematic patterns of missing data, offering a rigorous framework for improving data quality in terms of completeness and reliability.
2026
Mecca, A., Gottard, A., Gagliardi, F. (2026). Graphical Ising Models for Missing Data Patterns Detection in Sustainability Surveys. SOCIAL INDICATORS RESEARCH, 182(3) [10.1007/s11205-026-03844-6].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1314254
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo