A large language model-based generative natural language processing framework fine-tuned on clinical notes accurately extracts headache frequency from electronic health records.

Chia-Chun Chiang Man Luo Gina M Dumkrieger Shubham Trivedi Yi-Chieh Chen Chieh-Ju Chao Todd J Schwedt Abeed Sarker Imon Banerjee

Published in: Headache (2024)

score. It overcame several challenges related to different ways clinicians document headache frequency that were not easily achieved by traditional NLP models. We also showed that GPT-2-based frameworks outperformed ClinicalBERT in terms of accuracy in extracting headache frequency from clinical notes. To facilitate research in the field, we released the GPT-2 generative model and inference code with open-source license of community use in GitHub. Additional fine-tuning of the algorithm might be required when applied to different health-care systems for various clinical use cases.

Keyphrases

healthcare
electronic health record
autism spectrum disorder
machine learning
mental health
health insurance
neural network