Evaluating the Efficacy of ChatGPT in Navigating the Spanish Medical Residency Entrance Examination (MIR): Promising Horizons for AI in Clinical Medicine.
Francisco Guillen-GrimaSara Guillen-AguinagaLaura Guillen-AguinagaRosa Alas-BrunLuc OnambeleWilfrido OrtegaRocio MontejoEnrique Aguinaga-OntosoPaul BarachInés Aguinaga-OntosoPublished in: Clinics and practice (2023)
GPT-4 performs robustly on the Spanish MIR examination, with varying capabilities to discriminate knowledge across specialties. While the model's high success rate is commendable, understanding the error severity is critical, especially when considering AI's potential role in real-world medical practice and its implications for patient safety.