Multiple imputation of incomplete multilevel data using Heckman selection models

dc.contributor.authorMuñoz, Johanna
dc.contributor.authorEfthimiou, Orestis
dc.contributor.authorAudigier, Vincent
dc.contributor.authorJong, Valentijn M. T.
dc.contributor.authorDebray, Thomas P. A.
dc.date.accessioned2024-02-14T08:38:32Z
dc.date.available2024-02-14T08:38:32Z
dc.date.issued2024-02
dc.description.abstractAbstract Missing data is a common problem in medical research, and is commonly addressed using multiple imputation. Although traditional imputation methods allow for valid statistical inference when data are missing at random (MAR), their implementation is problematic when the presence of missingness depends on unobserved variables, that is, the data are missing not at random (MNAR). Unfortunately, this MNAR situation is rather common, in observational studies, registries and other sources of real‐world data. While several imputation methods have been proposed for addressing individual studies when data are MNAR, their application and validity in large datasets with multilevel structure remains unclear. We therefore explored the consequence of MNAR data in hierarchical data in‐depth, and proposed a novel multilevel imputation method for common missing patterns in clustered datasets. This method is based on the principles of Heckman selection models and adopts a two‐stage meta‐analysis approach to impute binary and continuous variables that may be outcomes or predictors and that are systematically or sporadically missing. After evaluating the proposed imputation model in simulated scenarios, we illustrate it use in a cross‐sectional community survey to estimate the prevalence of malaria parasitemia in children aged 2‐10 years in five regions in Uganda.en_US
dc.identifier.citationMuñoz, Johanna, Orestis Efthimiou, Vincent Audigier, et al. 'Multiple Imputation of Incomplete Multilevel Data using Heckman Selection Models', Statistics in Medicine, vol. 43/no. 3, (2024), pp. 514-533.en_US
dc.identifier.issnISSN 0277-6715
dc.identifier.issnEISSN 1097-0258
dc.identifier.urihttps://nru.uncst.go.ug/handle/123456789/9404
dc.language.isoenen_US
dc.publisherJohn Wiley & Sons, Ltden_US
dc.subjectHeckman model, IPDMA, missing not at random, selection models, multiple imputationen_US
dc.titleMultiple imputation of incomplete multilevel data using Heckman selection modelsen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Statistics in Medicine - 2023 - Muñoz - Multiple imputation of incomplete multilevel data using Heckman selection models.pdf
Size:
3.97 MB
Format:
Adobe Portable Document Format
Description:
Journal Article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: