ChatGPT With GPT-4 Outperforms Emergency Department Physicians in Diagnostic Accuracy: Retrospective Analysis.

John Michael Hoppe Matthias K Auer Anna Strüven Steffen Massberg Christopher Stremmel

Published in: Journal of medical Internet research (2024)

In this study, which compared the diagnostic accuracy of GPT-3.5, GPT-4, and ED resident physicians against a discharge diagnosis gold standard, GPT-4 outperformed both the resident physicians and its predecessor, GPT-3.5. Despite the retrospective design of the study and its limited sample size, the results underscore the potential of AI as a supportive diagnostic tool in ED settings.

Keyphrases

emergency department
primary care
quality improvement
machine learning
drug induced