Login / Signup

A method of correction for heaping error in the variables using validation data.

Amar S AhmadMunther Al-HassanHamid Y HussainNirmin F JuberFred N KiwanukaMohammed Hag-AliRaghib Ali
Published in: Statistical papers (Berlin, Germany) (2023)
When self-reported data are used in statistical analysis to estimate the mean and variance, as well as the regression parameters, the estimates tend, in many cases, to be biased. This is because interviewees have a tendency to heap their answers to certain values. The aim of the paper is to examine the bias-inducing effect of the heaping error in self-reported data, and study the effect on the heaping error on the mean and variance of a distribution as well as the regression parameters. As a result a new method is introduced to correct the effects of bias due to the heaping error using validation data. Using publicly available data and simulation studies, it can be shown that the newly developed method is practical and can easily be applied to correct the bias in the estimated mean and variance, as well as in the estimated regression parameters computed from self-reported data. Hence, using the method of correction presented in this paper allows researchers to draw accurate conclusions leading to the right decisions, e.g. regarding health care planning and delivery.
Keyphrases
  • electronic health record
  • big data
  • healthcare
  • magnetic resonance imaging
  • data analysis
  • machine learning
  • mass spectrometry
  • high resolution
  • health insurance