Login / Signup

An additive Gaussian process regression model for interpretable non-parametric analysis of longitudinal data.

Lu ChengSiddharth RamchandranTommi VatanenNiina LietzénRiitta LahesmaaAki VehtariHarri Lähdesmäki
Published in: Nature communications (2019)
Biomedical research typically involves longitudinal study designs where samples from individuals are measured repeatedly over time and the goal is to identify risk factors (covariates) that are associated with an outcome value. General linear mixed effect models are the standard workhorse for statistical analysis of longitudinal data. However, analysis of longitudinal data can be complicated for reasons such as difficulties in modelling correlated outcome values, functional (time-varying) covariates, nonlinear and non-stationary effects, and model inference. We present LonGP, an additive Gaussian process regression model that is specifically designed for statistical analysis of longitudinal data, which solves these commonly faced challenges. LonGP can model time-varying random effects and non-stationary signals, incorporate multiple kernel learning, and provide interpretable results for the effects of individual covariates and their interactions. We demonstrate LonGP's performance and accuracy by analysing various simulated and real longitudinal -omics datasets.
Keyphrases
  • electronic health record
  • risk factors
  • cross sectional
  • big data
  • liquid chromatography