Exploring the Efficacy of Large Language Models in Summarizing Mental Health Counseling Sessions: Benchmark Study.
Prottay Kumar AdhikaryAseem SrivastavaShivani KumarSalam Michael SinghPuneet ManujaJini K GopinathVijay KrishnanSwati Kedia GuptaKoushik Sinha DebTanmoy ChakrabortyPublished in: JMIR mental health (2024)
While LLMs fine-tuned specifically on mental health domain data display better performance based on automatic evaluation scores, expert assessments indicate that these models are not yet reliable for clinical application. Further refinement and validation are necessary before their implementation in practice.