Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study.
Masao NodaTakayoshi UenoRyota KoshuYuji TakasoMari Dias ShimadaChizu SaitoHisashi SugimotoHiroaki FushikiMakoto ItoAkihiro NomuraTomokazu YoshizakiPublished in: JMIR medical education (2024)
Examination of artificial intelligence's answering capabilities for the otolaryngology board certification examination improves our understanding of its potential and limitations in this field. Although the improvement was noted with the addition of translation and prompts, the accuracy rate for image-based questions was lower than that for text-based questions, suggesting room for improvement in GPT-4V at this stage. Furthermore, text-plus-image input answers a higher rate in image-based questions. Our findings imply the usefulness and potential of GPT-4V in medicine; however, future consideration of safe use methods is needed.