ChatGPT's diagnostic performance based on textual vs. visual information compared to radiologists' diagnostic performance in musculoskeletal radiology.

Daisuke HoriuchiHiroyuki TatekawaTatsushi OuraTaro ShimonoShannon L WalstonHirotaka TakitaShu MatsushitaYasuhito MitsuyamaYukio MikiDaiju Ueda

Published in: European radiology (2024)

This study compared the diagnostic performance of GPT-4-based ChatGPT, GPT-4V-based ChatGPT, and radiologists in musculoskeletal radiology. GPT-4-based ChatGPT was comparable to radiology residents, but did not reach the level of board-certified radiologists. When utilizing ChatGPT, it is crucial to input appropriate descriptions of imaging findings rather than the images.

Keyphrases

artificial intelligence
deep learning
machine learning
convolutional neural network
high resolution
optical coherence tomography
healthcare
fluorescence imaging