Doctor versus AI: Patient and physician evaluation of large language model responses to rheumatology patient questions, a cross sectional study.
Carrie YeElric ZweckZechen MaJustin SmithSteven J KatzPublished in: Arthritis & rheumatology (Hoboken, N.J.) (2023)
Rheumatology patients rated AI-generated responses to patient questions similarly to physician-generated responses in terms of comprehensiveness, readability, and overall preference. However, rheumatologists rated AI-responses significantly poorer than physician-generated responses, suggesting that LLM-chatbot responses are inferior to physician responses, a difference of which patients may not be aware. This article is protected by copyright. All rights reserved.