Comparing the performance of ChatGPT GPT-4, Bard, and Llama-2 in the Taiwan Psychiatric Licensing Examination and in differential diagnosis with multi-center psychiatrists.

Dian-Jeng LiYu-Chen KaoShih-Jen Tsai Ya-Mei Bai Ta-Chuan Yeh Che-Sheng Chu Chih-Wei Hsu Szu-Wei Cheng Tien-Wei Hsu Chih-Sung Liang Wen-Pang Su

Published in: Psychiatry and clinical neurosciences (2024)

Compared to Bard and Llama-2, GPT-4 demonstrated superior abilities in identifying psychiatric symptoms and making clinical judgments. Besides, GPT-4's ability for differential diagnosis closely approached that of the experienced psychiatrists. GPT-4 revealed a promising potential as a valuable tool in psychiatric practice among the three LLMs.

Keyphrases

mental health
healthcare
primary care
single cell
sleep quality
depressive symptoms
human health
climate change