Login / Signup

Large language models for biomedicine: foundations, opportunities, challenges, and best practices.

Satya S SahooJoseph M PlasekHua XuÖzlem UzunerTrevor CohenMeliha YetisgenHongfang LiuStephane M MeystreYanshan Wang
Published in: Journal of the American Medical Informatics Association : JAMIA (2024)
We focus on 3 broad categories of NLP tasks, namely natural language understanding, natural language inferencing, and natural language generation. We review the emerging trends in prompt tuning, instruction fine-tuning, and evaluation metrics used for LLMs while drawing attention to several issues that impact biomedical NLP applications, including falsehoods in generated text (confabulation/hallucinations), toxicity, and dataset contamination leading to overfitting. We also review potential approaches to address some of these current challenges in LLMs, such as chain of thought prompting, and the phenomena of emergent capabilities observed in LLMs that can be leveraged to address complex NLP challenge in biomedical applications.
Keyphrases
  • autism spectrum disorder
  • working memory
  • healthcare
  • primary care
  • risk assessment
  • human health
  • air pollution
  • oxidative stress
  • drinking water
  • smoking cessation
  • climate change
  • health risk