Doctor versus AI: Patient and physician evaluation of large language model responses to rheumatology patient questions, a cross sectional study.

Carrie YeElric ZweckZechen MaJustin SmithSteven J Katz

Published in: Arthritis & rheumatology (Hoboken, N.J.) (2023)

Rheumatology patients rated AI-generated responses to patient questions similarly to physician-generated responses in terms of comprehensiveness, readability, and overall preference. However, rheumatologists rated AI-responses significantly poorer than physician-generated responses, suggesting that LLM-chatbot responses are inferior to physician responses, a difference of which patients may not be aware. This article is protected by copyright. All rights reserved.

Keyphrases

end stage renal disease
primary care
emergency department
chronic kidney disease
newly diagnosed
case report
artificial intelligence
prognostic factors
autism spectrum disorder
rheumatoid arthritis
patient reported outcomes
juvenile idiopathic arthritis
deep learning
social media