Login / Signup

Evaluating the Efficacy of AI Chatbots as Tutors in Urology: A Comparative Analysis of Responses to the 2022 In-Service Assessment of the European Board of Urology.

Matthias MayKatharina Körner-RiffardLisa KollitschMaximilian BurgerSabine D Brookman-MayMichael RauchenwaldMartin MarszalekKlaus Eredics
Published in: Urologia internationalis (2024)
LLMs exhibit suboptimal urology knowledge and unsatisfactory proficiency for educational purposes. The overall accuracy did not significantly improve when combining EA to FA. The error rates remained high ranging from 16 to 35%. Proficiency levels vary substantially across subtopics. Further development of medicine-specific LLMs is required before integration into urological training programs.
Keyphrases
  • urinary tract
  • healthcare
  • mental health
  • public health
  • artificial intelligence
  • machine learning
  • deep learning