ChatGPT's diagnostic performance based on textual vs. visual information compared to radiologists' diagnostic performance in musculoskeletal radiology.
Daisuke HoriuchiHiroyuki TatekawaTatsushi OuraTaro ShimonoShannon L WalstonHirotaka TakitaShu MatsushitaYasuhito MitsuyamaYukio MikiDaiju UedaPublished in: European radiology (2024)
This study compared the diagnostic performance of GPT-4-based ChatGPT, GPT-4V-based ChatGPT, and radiologists in musculoskeletal radiology. GPT-4-based ChatGPT was comparable to radiology residents, but did not reach the level of board-certified radiologists. When utilizing ChatGPT, it is crucial to input appropriate descriptions of imaging findings rather than the images.