Exploring the Efficacy of Large Language Models in Summarizing Mental Health Counseling Sessions: Benchmark Study.

Prottay Kumar Adhikary Aseem Srivastava Shivani Kumar Salam Michael Singh Puneet Manuja Jini K Gopinath Vijay Krishnan Swati Kedia Gupta Koushik Sinha Deb Tanmoy Chakraborty

Published in: JMIR mental health (2024)

While LLMs fine-tuned specifically on mental health domain data display better performance based on automatic evaluation scores, expert assessments indicate that these models are not yet reliable for clinical application. Further refinement and validation are necessary before their implementation in practice.

Keyphrases

mental health
primary care
healthcare
mental illness
quality improvement
air pollution
machine learning
autism spectrum disorder
deep learning
electronic health record
big data
smoking cessation
hiv infected
neural network