Assessing the Accuracy and Reliability of AI-Generated Responses to Patient Questions Regarding Spine Surgery.
Viknesh S KasthuriJacob GlueckHan PhamMohammad DaherMariah Balmaceno-CrissChristopher L McDonaldBassel G DieboAlan H DanielsPublished in: The Journal of bone and joint surgery. American volume (2024)
Bing's answers were generally accurate and adequately complete, with incorrect responses rectified upon re-querying. The plurality of information was sourced from commercial websites. The type of source, number of sources, and mean JAMA benchmark score were not significantly correlated with answer accuracy. These findings underscore the importance of ongoing evaluation and improvement of large language models to ensure reliable and informative results for patients seeking information regarding spine surgery online amid the integration of these models in the search experience.