Login / Signup

Investigating the Impact of Prompt Engineering on the Performance of Large Language Models for Standardizing Obstetric Diagnosis Text: Comparative Study.

Lei WangWenshuai BiSuling ZhaoYinyao MaLongting LvChenwei MengJingru FuHanlin Lv
Published in: JMIR formative research (2024)
After applying LLMs to standardize diagnoses and designing 4 different prompts, we compared the results to those generated by the BERT model. Our findings indicate that QWEN prompts largely outperformed the other prompts, with precision comparable to that of the BERT model. These results demonstrate the potential of unsupervised approaches in improving the efficiency of aligning diagnostic terms in daily research and uncovering hidden information values in patient data.
Keyphrases
  • machine learning
  • physical activity
  • autism spectrum disorder
  • electronic health record
  • case report
  • healthcare
  • smoking cessation
  • big data
  • risk assessment
  • climate change