Leveraging GPT-4 for identifying cancer phenotypes in electronic health records: a performance comparison between GPT-4, GPT-3.5-turbo, Flan-T5, Llama-3-8B, and spaCy's rule-based and machine learning-based methods.

Kriti Bhattarai Inez Y OhJonathan Moran SierraJonathan TangPhilip Richard Orrin PayneZach AbramsAlbert M Lai

Published in: JAMIA open (2024)

GPT-4 improves clinical phenotype identification due to its robust pre-training and remarkable pattern recognition capability on the embedded tokens. It demonstrates data-driven effectiveness even with limited context in the input. While rule-based models remain useful for some tasks, GPT models offer improved contextual understanding of the text, and robust clinical phenotype extraction.

Keyphrases

electronic health record
machine learning
randomized controlled trial
papillary thyroid
working memory
clinical decision support
squamous cell carcinoma
artificial intelligence
squamous cell