Login / Signup

Comparison of Methods for Estimating Temporal Topic Models From Primary Care Clinical Text Data: Retrospective Closed Cohort Study.

Christopher MeaneyMichael EscobarTherese A StukelPeter C AustinLiisa Jaakkimainen
Published in: JMIR medical informatics (2022)
Nonnegative matrix factorization, latent Dirichlet allocation, structural topic model, and BERTopic are based on different underlying statistical frameworks (eg, linear algebra and optimization, Bayesian graphical models, and neural embeddings), require tuning unique hyperparameters (optimizers, priors, etc), and have distinct computational requirements (data structures, computational hardware, etc). Despite the heterogeneity in statistical methodology, the learned latent topical summarizations and their temporal evolution over the study period were consistently estimated. Temporal topic models represent an interesting class of models for characterizing and monitoring the primary health care system.
Keyphrases
  • primary care
  • electronic health record
  • big data
  • high resolution
  • single cell
  • artificial intelligence
  • data analysis
  • neural network