Loading...
Calibrated imputation for multivariate categorical data
de Waal,T. ; Daalmans,J.
de Waal,T.
Daalmans,J.
Abstract
Non-response is a major problem for anyone collecting and processing data. A commonly used technique to deal with missing data is imputation, where missing values are estimated and filled in into the dataset. Imputation can become challenging if the variable to be imputed has to comply with a known total. Even more challenging is the case where several variables in the same dataset need to be imputed and, in addition to known totals, logical restrictions between variables have to be satisfied. In our paper, we develop an approach for a broad class of imputation methods for multivariate categorical data such that previously published totals are preserved while logical restrictions on the data are satisfied. The developed approach can be used in combination with any imputation model that estimates imputation probabilities, i.e. the probability that imputation of a certain category for a variable in a certain unit leads to the correct value for this variable and unit.
Description
Date
2023
Journal Title
Journal ISSN
Volume Title
Publisher
Research Projects
Organizational Units
Journal Issue
Keywords
Edit rules, Fully conditional specification, Mass imputation, Non-response
Citation
de Waal, T & Daalmans, J 2023, 'Calibrated imputation for multivariate categorical data', Asta-advances in Statistical Analysis. https://doi.org/10.1007/s10182-023-00481-z
License
info:eu-repo/semantics/openAccess
