Login / Signup

The first step is the hardest: pitfalls of representing and tokenizing temporal data for large language models.

Dimitris SpathisFahim Kawsar
Published in: Journal of the American Medical Informatics Association : JAMIA (2024)
We underscore that despite promising capabilities, LLMs cannot meaningfully process temporal data unless the input representation is addressed. We argue that this paradigm shift in how we leverage pretrained models will particularly affect the area of biomedical signals, given the lack of modality-specific foundation models.
Keyphrases
  • electronic health record
  • autism spectrum disorder
  • machine learning
  • data analysis
  • artificial intelligence
  • neural network