Evaluating a Large Language Model's Ability to Answer Clinicians' Requests for Evidence Summaries.

Mallory N Blasingame Taneya Y Koonce Annette M WilliamsDario A GiuseJing Su Poppy A Krump Nunzia Bettinsoli Giuse

Published in: medRxiv : the preprint server for health sciences (2024)

Overall, the performance of a generative AI tool was promising. However, many included references could not be independently verified, and attempts were not made to assess whether any additional concepts introduced by aiChat were factually accurate. Thus, we envision this being the first of a series of investigations designed to further our understanding of how current and future versions of generative AI can be used and integrated into medical librarians' workflow.

Keyphrases

artificial intelligence
healthcare
current status
autism spectrum disorder
palliative care
high resolution
machine learning
deep learning
electronic health record
mass spectrometry