UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Bayesian Variable Selection Under Sample Selection and Model Misspecification

Rubio Alvarez, Javier; Iqbal, Adam; Ogundimu, Emmanuel; (2025) Bayesian Variable Selection Under Sample Selection and Model Misspecification. Bayesian Analysis 10.1214/25-BA1567. Green open access

[thumbnail of BASS.pdf]
Preview
Text
BASS.pdf - Published Version

Download (351kB) | Preview

Abstract

Sample selection bias arises when missingness in the outcome of interest correlates with the outcome itself, leading to non-randomly selected samples. A common approach to correct bias from sample selection is to use sample selection models that jointly model the selection mechanism and the outcome of interest. Formulating these models typically rely on exclusion restrictions (variables that are predictors of selection but not appearing in the outcome equation) to ensure identifiability of the parameters. However, the choice of exclusion restrictions often depends on heuristics or expert judgment, potentially leading to the inclusion of irrelevant variables or the omission of important ones. Additionally, distributional misspecification and omitted variable bias are frequent challenges in this framework. To formally address these issues, we propose a Bayesian variable selection (BVS) methodology that incorporates both local priors (LPs) and non-local priors (NLPs), enabling the identification of variables with predictive power for the outcome and selection processes. We develop computational tools to conduct BVS in sample selection models based on a Laplace approximation of the marginal likelihood, and characterize the resulting Bayes factor rates under model misspecification. We establish model selection consistency for both classes of priors, showing that the proposed methodology correctly identifies active variables for both the selection process and outcome process asymptotically. The priors are calibrated to account for the possibility of distributional misspecification and omitted variable bias. We present a simulation study and real-data applications to explore the finite-sample effects of model misspecification on BVS. We compare the performance of the proposed methodology against BVS based on spike-and-slab (SS) priors and the Adaptive LASSO (ALASSO), an adaptive weighting of the least absolute shrinkage and selection operator (LASSO).

Type: Article
Title: Bayesian Variable Selection Under Sample Selection and Model Misspecification
Open access status: An open access version is available from UCL Discovery
DOI: 10.1214/25-BA1567
Publisher version: http://doi.org/10.1214/25-BA1567
Language: English
Additional information: This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).
Keywords: Laplace approximation , local priors , Non-local priors , sample selection models
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI: https://discovery.ucl.ac.uk/id/eprint/10215630
Downloads since deposit
8Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item