Login / Signup

Reasoning with large language models for medical question answering.

Mary M LucasJustin YangJon K PomeroyChristopher C Yang
Published in: Journal of the American Medical Informatics Association : JAMIA (2024)
The proposed iterative ensemble reasoning has the potential to improve the performance of LLMs in medical question answering tasks, particularly with the less powerful LLMs like GPT-3.5 turbo and Med42-70B, which may suggest that this is a promising approach for LLMs with lower capabilities. Additionally, the findings show that our approach helps to refine the reasoning generated by the LLM and thereby improve consistency even with the more powerful GPT-4 turbo. We also identify the potential and need for human-artificial intelligence teaming to improve the reasoning beyond the limits of the model.
Keyphrases