Assessing the Ability of a Large Language Model to Score Free-Text Medical Student Clinical Notes: Quantitative Study.
Harry B BurkeAlbert HoangJoseph O LopreiatoHeidi KingPaul A HemmerMichael MontgomeryViktoria GagarinPublished in: JMIR medical education (2024)
ChatGPT demonstrated a significantly lower error rate compared to standardized patients. This is the first study to assess the ability of a generative pretrained transformer (GPT) program to score medical students' standardized patient-based free-text clinical notes. It is expected that, in the near future, large language models will provide real-time feedback to practicing physicians regarding their free-text notes. GPT artificial intelligence programs represent an important advance in medical education and medical practice.
Keyphrases
- artificial intelligence
- primary care
- healthcare
- machine learning
- end stage renal disease
- smoking cessation
- medical education
- medical students
- autism spectrum disorder
- big data
- ejection fraction
- newly diagnosed
- deep learning
- chronic kidney disease
- high resolution
- case report
- prognostic factors
- patient reported outcomes