Login / Signup

Sociodemographic Variables in Offender and Non-Offender Patients Diagnosed with Schizophrenia Spectrum Disorders-An Explorative Analysis Using Machine Learning.

Andreas B HofmannMarc DörnerLena MachetanzJohannes Kirchebner
Published in: Healthcare (Basel, Switzerland) (2024)
With the growing availability of medical data and the enhanced performance of computers, new opportunities for data analysis in research are emerging. One of these modern approaches is machine learning (ML), an advanced form of statistics broadly defined as the application of complex algorithms. ML provides innovative methods for detecting patterns in complex datasets. This enables the identification of correlations or the prediction of specific events. These capabilities are especially valuable for multifactorial phenomena, such as those found in mental health and forensic psychiatry. ML also allows for the quantification of the quality of the emerging statistical model. The present study aims to examine various sociodemographic variables in order to detect differences in a sample of 370 offender patients and 370 non-offender patients, all with schizophrenia spectrum disorders, through discriminative model building using ML. In total, 48 variables were tested. Out of seven algorithms, gradient boosting emerged as the most suitable for the dataset. The discriminative model finally included three variables (regarding country of birth, residence status, and educational status) and yielded an area under the curve (AUC) of 0.65, meaning that the statistical discrimination of offender and non-offender patients based purely on the sociodemographic variables is rather poor.
Keyphrases