Regression-based imputation of explanatory discrete missing data

dc.contributor.author
Hernández-Herrera, Gilma
dc.contributor.author
Navarro, Albert
dc.contributor.author
Moriña, David
dc.date.issued
2024-10-16T13:49:38Z
dc.date.issued
2024-10-16T13:49:38Z
dc.date.issued
2024-09-01
dc.date.issued
2024-10-16T13:49:38Z
dc.identifier
0361-0918
dc.identifier
https://hdl.handle.net/2445/215820
dc.identifier
732907
dc.description.abstract
Imputation of missing values is a strategy for handling non-responses in surveys or data loss in measurement processes, which may be more effective than ignoring the losses and omitting them. The characteristics of variables presenting missing values must be considered when choosing the imputation method to be used; in particular when the variable is a count the literature dealing with this issue is scarce. If the variable has an excess of zeros it is necessary to consider models including parameters for handling zero-inflation. Likewise, if problems of over- or under-dispersion are observed, generalizations of the Poisson, such as the Hermite or Conway Maxwell Poisson distributions are recommended for carrying out imputation. The aim of this study was to assess the performance of various regression models in the imputation of a discrete variable based on Poisson generalizations, in comparison with classical counting models, through a comprehensive simulation study considering a variety of scenarios and a real data example. To do so we compared the results of estimations using only complete data, and using imputations based on the most common count models. The COMPoisson distribution provides in general better results in any dispersion scenario, especially when the amount of missing information is large.
dc.format
16 p.
dc.format
application/pdf
dc.language
eng
dc.publisher
Taylor & Francis
dc.relation
Versió postprint del document publicat a: https://doi.org/10.1080/03610918.2022.2149805
dc.relation
Communications in Statistics-Simulation and Computation, 2024, vol. 53, num.9, p. 4363-4379
dc.relation
https://doi.org/10.1080/03610918.2022.2149805
dc.rights
(c) Taylor & Francis, 2024
dc.rights
info:eu-repo/semantics/openAccess
dc.source
Articles publicats en revistes (Econometria, Estadística i Economia Aplicada)
dc.subject
Anàlisi de regressió
dc.subject
Variables (Matemàtica)
dc.subject
Matemàtica discreta
dc.subject
Regression analysis
dc.subject
Variables (Mathematics)
dc.subject
Discrete mathematics
dc.title
Regression-based imputation of explanatory discrete missing data
dc.type
info:eu-repo/semantics/article
dc.type
info:eu-repo/semantics/acceptedVersion


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)