Evaluating the competency of ChatGPT in MRCP Part 1 and a systematic literature review of its capabilities in postgraduate medical assessments.

Oliver VijHenry CalverNikki MyallMrinalini DeyKoushan Kouranloo

Published in: PloS one (2024)

ChatGPT-4 performed at above passing level for the majority of UK postgraduate medical examinations it was tested on. ChatGPT is prone to hallucinations, fabrications and reduced explanation accuracy which could limit its potential as a learning tool. The potential for these errors is an inherent part of LLMs and may always be a limitation for medical applications of ChatGPT.

Keyphrases

healthcare
medical education
risk assessment
cross sectional