With reference to a binary outcome and a binary mediator, we derive identification bounds for natural effects under a reduced set of assumptions. Specifically, no assumptions about confounding are made that involve the outcome; we only assume no unobserved exposure-mediator confounding as well as a pair of conditions termed Partially Constant Cross-World Dependence (PC-CWD) and Logit Constancy (LC). These assumptions pose fewer constraints on the counterfactual probabilities than the set of assumptions they replace. The proposed strategy permits to achieve interval identification of the total effect, which is no longer point identified under the considered set of assumptions. Our derivations are based on postulating a logistic regression model for the mediator as well as for the outcome. However, in both cases the functional form governing the dependence on the explanatory variables is allowed to be arbitrary, thereby resulting in a semi-parametric approach. To account for sampling variability, we provide delta-method approximations of standard errors to build uncertainty intervals from identification bounds. The method is compared to some alternative ones in a simulation study and then applied to a dataset gathered from a Spanish prospective cohort study, with the aim to evaluate whether the effect of smoking on lung cancer risk is mediated by the onset of pulmonary emphysema.

Interval identification of natural effects in the presence of outcome‐related unmeasured confounding / Doretti, Marco; Stanghellini, Elena. - In: SCANDINAVIAN JOURNAL OF STATISTICS. - ISSN 0303-6898. - ELETTRONICO. - --:(2026), pp. 1-26. [10.1111/sjos.70055]

Interval identification of natural effects in the presence of outcome‐related unmeasured confounding

Doretti, Marco
;
2026

Abstract

With reference to a binary outcome and a binary mediator, we derive identification bounds for natural effects under a reduced set of assumptions. Specifically, no assumptions about confounding are made that involve the outcome; we only assume no unobserved exposure-mediator confounding as well as a pair of conditions termed Partially Constant Cross-World Dependence (PC-CWD) and Logit Constancy (LC). These assumptions pose fewer constraints on the counterfactual probabilities than the set of assumptions they replace. The proposed strategy permits to achieve interval identification of the total effect, which is no longer point identified under the considered set of assumptions. Our derivations are based on postulating a logistic regression model for the mediator as well as for the outcome. However, in both cases the functional form governing the dependence on the explanatory variables is allowed to be arbitrary, thereby resulting in a semi-parametric approach. To account for sampling variability, we provide delta-method approximations of standard errors to build uncertainty intervals from identification bounds. The method is compared to some alternative ones in a simulation study and then applied to a dataset gathered from a Spanish prospective cohort study, with the aim to evaluate whether the effect of smoking on lung cancer risk is mediated by the onset of pulmonary emphysema.
2026
--
1
26
Doretti, Marco; Stanghellini, Elena
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1450975
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
social impact