Quality and Accountability of ChatGPT in Health Care in Low- and Middle-Income Countries: Simulated Patient Study.

Yafei Si Yuyi Yang Xi Wang Jiaqi Zu Xi Chen Xiaojing Fan Ruopeng An Sen Gong

Published in: Journal of medical Internet research (2024)

Using simulated patients to mimic 9 established noncommunicable and infectious diseases, we assessed ChatGPT's performance in treatment recommendations for common diseases in low- and middle-income countries. ChatGPT had a high level of accuracy in both correct diagnoses (20/27, 74%) and medication prescriptions (22/27, 82%) but a concerning level of unnecessary or harmful medications (23/27, 85%) even with correct diagnoses. ChatGPT performed better in managing noncommunicable diseases than infectious ones. These results highlight the need for cautious AI integration in health care systems to ensure quality and safety.

Keyphrases

healthcare
infectious diseases
end stage renal disease
ejection fraction
chronic kidney disease
newly diagnosed
prognostic factors
quality improvement
artificial intelligence
emergency department
machine learning
patient reported outcomes
clinical practice
social media
smoking cessation
deep learning
electronic health record