Comparison of ChatGPT-3.5, ChatGPT-4, and Orthopaedic Resident Performance on Orthopaedic Assessment Examinations.

Patrick Allan MasseyCarver MontgomeryAndrew S Zhang

Published in: The Journal of the American Academy of Orthopaedic Surgeons (2023)

Orthopaedic residents were able to answer more questions accurately than ChatGPT-3.5 and GPT-4 on orthopaedic assessment examinations. GPT-4 is superior to ChatGPT-3.5 for answering orthopaedic resident assessment examination questions. Both ChatGPT-3.5 and GPT-4 performed better on text-only questions than questions with images. It is unlikely that GPT-4 or ChatGPT-3.5 would pass the American Board of Orthopaedic Surgery written examination.

Keyphrases

patient safety
quality improvement
deep learning
machine learning
optical coherence tomography
convolutional neural network
atrial fibrillation
percutaneous coronary intervention