Measuring Alliance and Symptom Severity in Psychotherapy Transcripts Using Bert Topic Modeling.
Christopher LalkTobias SteinbrennerWeronika KaniaAlexander PopkoRobin WesterJana SchaffrathSteffen EberhardtBrian SchwartzWolfgang LutzJulian RubelPublished in: Administration and policy in mental health (2024)
We aim to use topic modeling, an approach for discovering clusters of related words ("topics"), to predict symptom severity and therapeutic alliance in psychotherapy transcripts, while also identifying the most important topics and overarching themes for prediction. We analyzed 552 psychotherapy transcripts from 124 patients. Using BERTopic (Grootendorst, 2022), we extracted 250 topics each for patient and therapist speech. These topics were used to predict symptom severity and alliance with various competing machine-learning methods. Sensitivity analyses were calculated for a model based on 50 topics, LDA-based topic modeling, and a bigram model. Additionally, we grouped topics into themes using qualitative analysis and identified key topics and themes with eXplainable Artificial Intelligence (XAI). Symptom severity could be predicted with highest accuracy by patient topics ( r =0.45, 95%-CI 0.40, 0.51), whereas alliance was better predicted by therapist topics ( r =0.20, 95%-CI 0.16, 0.24). Drivers for symptom severity were themes related to health and negative experiences. Lower alliance was correlated with various themes, especially psychotherapy framework, income, and everyday life. This analysis shows the potential of using topic modeling in psychotherapy research allowing to predict several treatment-relevant metrics with reasonable accuracy. Further, the use of XAI allows for an analysis of the individual predictive value of topics and themes. Limitations entail heterogeneity across different topic modeling hyperparameters and a relatively small sample size.