Quality and Accountability of ChatGPT in Health Care in Low- and Middle-Income Countries: Simulated Patient Study.
Yafei SiYuyi YangXi WangJiaqi ZuXi ChenXiaojing FanRuopeng AnSen GongPublished in: Journal of medical Internet research (2024)
Using simulated patients to mimic 9 established noncommunicable and infectious diseases, we assessed ChatGPT's performance in treatment recommendations for common diseases in low- and middle-income countries. ChatGPT had a high level of accuracy in both correct diagnoses (20/27, 74%) and medication prescriptions (22/27, 82%) but a concerning level of unnecessary or harmful medications (23/27, 85%) even with correct diagnoses. ChatGPT performed better in managing noncommunicable diseases than infectious ones. These results highlight the need for cautious AI integration in health care systems to ensure quality and safety.
Keyphrases
- healthcare
- infectious diseases
- end stage renal disease
- ejection fraction
- chronic kidney disease
- newly diagnosed
- prognostic factors
- quality improvement
- artificial intelligence
- emergency department
- machine learning
- patient reported outcomes
- clinical practice
- social media
- smoking cessation
- deep learning
- electronic health record