Comparing the performance of ChatGPT GPT-4, Bard, and Llama-2 in the Taiwan Psychiatric Licensing Examination and in differential diagnosis with multi-center psychiatrists.
Dian-Jeng LiYu-Chen KaoShih-Jen TsaiYa-Mei BaiTa-Chuan YehChe-Sheng ChuChih-Wei HsuSzu-Wei ChengTien-Wei HsuChih-Sung LiangWen-Pang SuPublished in: Psychiatry and clinical neurosciences (2024)
Compared to Bard and Llama-2, GPT-4 demonstrated superior abilities in identifying psychiatric symptoms and making clinical judgments. Besides, GPT-4's ability for differential diagnosis closely approached that of the experienced psychiatrists. GPT-4 revealed a promising potential as a valuable tool in psychiatric practice among the three LLMs.