Resting-State EEG Signal for Major Depressive Disorder Detection: A Systematic Validation on a Large and Diverse Dataset.
Chien-Te WuHao-Chuan HuangShiuan HuangI-Ming ChenShih-Cheng LiaoChih-Ken ChenChemin LinShwu-Hua LeeMu-Hong ChenChia-Fen TsaiChang-Hsin WengLi-Wei KoTzyy-Ping JungYi-Hung LiuPublished in: Biosensors (2021)
Major depressive disorder (MDD) is a global healthcare issue and one of the leading causes of disability. Machine learning combined with non-invasive electroencephalography (EEG) has recently been shown to have the potential to diagnose MDD. However, most of these studies analyzed small samples of participants recruited from a single source, raising serious concerns about the generalizability of these results in clinical practice. Thus, it has become critical to re-evaluate the efficacy of various common EEG features for MDD detection across large and diverse datasets. To address this issue, we collected resting-state EEG data from 400 participants across four medical centers and tested classification performance of four common EEG features: band power (BP), coherence, Higuchi's fractal dimension, and Katz's fractal dimension. Then, a sequential backward selection (SBS) method was used to determine the optimal subset. To overcome the large data variability due to an increased data size and multi-site EEG recordings, we introduced the conformal kernel (CK) transformation to further improve the MDD as compared with the healthy control (HC) classification performance of support vector machine (SVM). The results show that (1) coherence features account for 98% of the optimal feature subset; (2) the CK-SVM outperforms other classifiers such as K-nearest neighbors (K-NN), linear discriminant analysis (LDA), and SVM; (3) the combination of the optimal feature subset and CK-SVM achieves a high five-fold cross-validation accuracy of 91.07% on the training set (140 MDD and 140 HC) and 84.16% on the independent test set (60 MDD and 60 HC). The current results suggest that the coherence-based connectivity is a more reliable feature for achieving high and generalizable MDD detection performance in real-life clinical practice.
Keyphrases
- major depressive disorder
- resting state
- functional connectivity
- machine learning
- bipolar disorder
- deep learning
- big data
- clinical practice
- healthcare
- artificial intelligence
- electronic health record
- loop mediated isothermal amplification
- label free
- multiple sclerosis
- risk assessment
- data analysis
- climate change
- health information