Comparing the Diagnostic Performance of GPT-4-based ChatGPT, GPT-4V-based ChatGPT, and Radiologists in Challenging Neuroradiology Cases.
Daisuke HoriuchiHiroyuki TatekawaTatsushi OuraSatoshi OueShannon L WalstonHirotaka TakitaShu MatsushitaYasuhito MitsuyamaTaro ShimonoYukio MikiDaiju UedaPublished in: Clinical neuroradiology (2024)
While GPT-4-based ChatGPT demonstrated relatively higher diagnostic performance than GPT-4V-based ChatGPT, the diagnostic performance of GPT‑4 and GPT-4V-based ChatGPTs did not reach the performance level of either radiology residents or board-certified radiologists in challenging neuroradiology cases.
Keyphrases