Assessment of Resident and AI Chatbot Performance on the University of Toronto Family Medicine Residency Progress Test: Comparative Study.

Ryan St Huang Kevin Jia Qi Lu Christopher Meaney Joel Kemppainen Angela Punnett Fok-Han Leung

Published in: JMIR medical education (2023)

GPT-4 significantly outperforms both GPT-3.5 and Family Medicine residents on a multiple-choice medical knowledge test designed for Family Medicine residents. GPT-4 provides a logical rationale for its response choice, ruling out other answer choices efficiently and with concise justification. Its high degree of accuracy and advanced reasoning capabilities facilitate its potential applications in medical education, including the creation of exam questions and scenarios as well as serving as a resource for medical knowledge or information on community services.

Keyphrases

healthcare
medical education
climate change
mental health
patient safety
primary care
decision making
artificial intelligence
social media
machine learning
quality improvement
medical students