Evaluating the Efficacy of ChatGPT in Navigating the Spanish Medical Residency Entrance Examination (MIR): Promising Horizons for AI in Clinical Medicine.

Francisco Guillen-Grima Sara Guillen-AguinagaLaura Guillen-AguinagaRosa Alas-BrunLuc OnambeleWilfrido OrtegaRocio MontejoEnrique Aguinaga-Ontoso Paul Barach Inés Aguinaga-Ontoso

Published in: Clinics and practice (2023)

GPT-4 performs robustly on the Spanish MIR examination, with varying capabilities to discriminate knowledge across specialties. While the model's high success rate is commendable, understanding the error severity is critical, especially when considering AI's potential role in real-world medical practice and its implications for patient safety.

Keyphrases

patient safety
healthcare
cell proliferation
quality improvement
long non coding rna
artificial intelligence
long noncoding rna
primary care
machine learning
deep learning