Regression-based imputation of explanatory discrete missing data

Hernández-Herrera, Gilma; Navarro, Albert; Moriña, David; Hernández-Herrera, Gilma; Navarro, Albert; Moriña, David

Regression-based imputation of explanatory discrete missing data

Autor/a

Hernández-Herrera, Gilma

Navarro, Albert

Moriña, David

Fecha de publicación

2024-10-16T13:49:38Z

2024-09-01

2024-10-16T13:49:38Z

Resumen

Imputation of missing values is a strategy for handling non-responses in surveys or data loss in measurement processes, which may be more effective than ignoring the losses and omitting them. The characteristics of variables presenting missing values must be considered when choosing the imputation method to be used; in particular when the variable is a count the literature dealing with this issue is scarce. If the variable has an excess of zeros it is necessary to consider models including parameters for handling zero-inflation. Likewise, if problems of over- or under-dispersion are observed, generalizations of the Poisson, such as the Hermite or Conway Maxwell Poisson distributions are recommended for carrying out imputation. The aim of this study was to assess the performance of various regression models in the imputation of a discrete variable based on Poisson generalizations, in comparison with classical counting models, through a comprehensive simulation study considering a variety of scenarios and a real data example. To do so we compared the results of estimations using only complete data, and using imputations based on the most common count models. The COMPoisson distribution provides in general better results in any dispersion scenario, especially when the amount of missing information is large.

Tipo de documento

Artículo

Versión aceptada

Lengua

Inglés

Materias y palabras clave

Anàlisi de regressió; Variables (Matemàtica); Matemàtica discreta; Regression analysis; Variables (Mathematics); Discrete mathematics

Publicado por

Taylor & Francis

Documentos relacionados

Versió postprint del document publicat a: https://doi.org/10.1080/03610918.2022.2149805

Communications in Statistics-Simulation and Computation, 2024, vol. 53, num.9, p. 4363-4379

https://doi.org/10.1080/03610918.2022.2149805

Citación recomendada

Esta citación se ha generado automáticamente.

Exportar

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Derechos

Este ítem aparece en la(s) siguiente(s) colección(ones)

Econometria, Estadística i Economia Aplicada [1099]

ISGlobal - Institut de Salut Global de Barcelona [61437]

Regression-based imputation of explanatory discrete missing data

Autor/a

Fecha de publicación

Compartir

Resumen

Tipo de documento

Lengua

Materias y palabras clave

Publicado por

Documentos relacionados

Citación recomendada

Exportar

Derechos

Este ítem aparece en la(s) siguiente(s) colección(ones)