Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study.

Masao Noda Takayoshi Ueno Ryota Koshu Yuji Takaso Mari Dias Shimada Chizu Saito Hisashi Sugimoto Hiroaki Fushiki Makoto Ito Akihiro Nomura Tomokazu Yoshizaki

Published in: JMIR medical education (2024)

Examination of artificial intelligence's answering capabilities for the otolaryngology board certification examination improves our understanding of its potential and limitations in this field. Although the improvement was noted with the addition of translation and prompts, the accuracy rate for image-based questions was lower than that for text-based questions, suggesting room for improvement in GPT-4V at this stage. Furthermore, text-plus-image input answers a higher rate in image-based questions. Our findings imply the usefulness and potential of GPT-4V in medicine; however, future consideration of safe use methods is needed.

Keyphrases

artificial intelligence
deep learning
machine learning
big data
smoking cessation
current status
risk assessment