Login / Signup

Adaptive Supervised Learning on Data Streams in Reproducing Kernel Hilbert Spaces with Data Sparsity Constraint.

Haodong WangQuefeng LiYufeng Liu
Published in: Stat (2022)
Data are generated at an unprecedented rate and scale these days across many disciplines. The field of streaming data analysis has emerged as a result of new data collection and storage technologies in various areas, such as air pollution monitoring, detection of traffic congestion, disease surveillance, and recommendation systems. In this paper, we consider the problem of model estimation for data streams in reproducing kernel Hilbert spaces. We propose an adaptive supervised learning method with a data sparsity constraint that uses limited storage spaces and can handle non-stationary models. We demonstrate the competitive performance of the proposed method using simulations and analysis of the bike sharing dataset.
Keyphrases
  • data analysis
  • electronic health record
  • air pollution
  • big data
  • machine learning
  • public health
  • healthcare
  • cystic fibrosis
  • social media
  • molecular dynamics
  • lung function
  • health information
  • quantum dots