Reasoning with large language models for medical question answering.

Mary M LucasJustin YangJon K PomeroyChristopher C Yang

Published in: Journal of the American Medical Informatics Association : JAMIA (2024)

The proposed iterative ensemble reasoning has the potential to improve the performance of LLMs in medical question answering tasks, particularly with the less powerful LLMs like GPT-3.5 turbo and Med42-70B, which may suggest that this is a promising approach for LLMs with lower capabilities. Additionally, the findings show that our approach helps to refine the reasoning generated by the LLM and thereby improve consistency even with the more powerful GPT-4 turbo. We also identify the potential and need for human-artificial intelligence teaming to improve the reasoning beyond the limits of the model.

Keyphrases

artificial intelligence
machine learning
healthcare
big data
endothelial cells
deep learning
autism spectrum disorder
working memory
induced pluripotent stem cells
magnetic resonance imaging
magnetic resonance
risk assessment