Item

Quantifying selection bias in flash estimates derived from cumulative non-probability samples

Gomez-Echeverry,Santiago
Van Delden,Arnout
De Waal,Ton
Pavlopoulos,Dimitris
Stoel,Reinoud
Abstract
The increasing reliance on non-probability samples, including administrative data, big data, and surveys with selective non-response, has brought the issue of selection bias to the forefront of applied research and official statistics production. This study investigates selection bias in the context of flash estimates, focusing on their application in short-term economic indicators such as the Gross Domestic Product and the Consumer Price Index. Leveraging prominent theoretical frameworks for modeling selection bias, we develop and evaluate a series of estimators that incorporate information from a fully-observed lagged target variable, current auxiliary variables, or combined data sources. Through simulations and a case study using turnover (i.e., revenue) data from Statistics Netherlands, we assess these estimators across diverse scenarios, including variations in variable distribution, selectivity levels, and correlations between auxiliary and target variables. Results indicate that estimators combining lagged and current auxiliary information provide more consistent results than those relying on a single data source. Additionally, estimators based on combined data sources perform relatively well under high selectivity and non-normal target distributions. These findings provide practical and easily implementable tools to address selection bias in non-probability samples, enhancing the reliability and timeliness of official statistics.
Description
Date
2025-09
Journal Title
Journal ISSN
Volume Title
Publisher
Research Projects
Organizational Units
Journal Issue
Keywords
Flash estimates, Non-probabilistic sampling, Official statistics, Selection error, Survey non-response
Citation
Gomez-Echeverry, S, Van Delden, A, De Waal, T, Pavlopoulos, D & Stoel, R 2025, 'Quantifying selection bias in flash estimates derived from cumulative non-probability samples', Journal of Survey Statistics and Methodology. https://doi.org/10.1093/jssam/smaf016
Embedded videos