Login / Signup

Robust Multiple Regression.

David W ScottZhipeng Wang
Published in: Entropy (Basel, Switzerland) (2021)
As modern data analysis pushes the boundaries of classical statistics, it is timely to reexamine alternate approaches to dealing with outliers in multiple regression. As sample sizes and the number of predictors increase, interactive methodology becomes less effective. Likewise, with limited understanding of the underlying contamination process, diagnostics are likely to fail as well. In this article, we advocate for a non-likelihood procedure that attempts to quantify the fraction of bad data as a part of the estimation step. These ideas also allow for the selection of important predictors under some assumptions. As there are many robust algorithms available, running several and looking for interesting differences is a sensible strategy for understanding the nature of the outliers.
Keyphrases
  • data analysis
  • machine learning
  • risk assessment
  • drinking water
  • electronic health record
  • deep learning
  • minimally invasive
  • high intensity
  • health risk
  • human health
  • heavy metals