Large Language Models Versus Expert Clinicians in Crisis Prediction Among Telemental Health Patients: Comparative Study.

Christine Lee Matthew Mohebbi Erin O'Callaghan Mirene Winsberg

Published in: JMIR mental health (2024)

GPT-4, with a simple prompt design, produced results on some metrics that approached those of a trained clinician. Additional work must be done before such a model can be piloted in a clinical setting. The model should undergo safety checks for bias, given evidence that LLMs can perpetuate the biases of the underlying data on which they are trained. We believe that LLMs hold promise for augmenting the identification of higher-risk patients at intake and potentially delivering more timely care to patients.

Keyphrases

end stage renal disease
healthcare
newly diagnosed
chronic kidney disease
palliative care
prognostic factors
big data
patient reported outcomes
clinical practice
climate change
social media
artificial intelligence
health insurance