Sex-related patterns in the electroencephalogram and their relevance in machine learning classifiers.
Thomas JochmannMarc S SeibelElisabeth JochmannSheraz KhanMatti S HämäläinenJens HaueisenPublished in: Human brain mapping (2023)
Deep learning is increasingly being proposed for detecting neurological and psychiatric diseases from electroencephalogram (EEG) data but the method is prone to inadvertently incorporate biases from training data and exploit illegitimate patterns. The recent demonstration that deep learning can detect the sex from EEG implies potential sex-related biases in deep learning-based disease detectors for the many diseases with unequal prevalence between males and females. In this work, we present the male- and female-typical patterns used by a convolutional neural network that detects the sex from clinical EEG (81% accuracy in a separate test set with 142 patients). We considered neural sources, anatomical differences, and non-neural artifacts as sources of differences in the EEG curves. Using EEGs from 1140 patients, we found electrocardiac artifacts to be leaking into the supposedly brain activity-based classifiers. Nevertheless, the sex remained detectable after rejecting heart-related and other artifacts. In the cleaned data, EEG topographies were critical to detect the sex, but waveforms and frequencies were not. None of the traditional frequency bands was particularly important for sex detection. We were able to determine the sex even from EEGs with shuffled time points and therewith completely destroyed waveforms. Researchers should consider neural and non-neural sources as potential origins of sex differences in their data, they should maintain best practices of artifact rejection, even when datasets are large, and they should test their classifiers for sex biases.
Keyphrases
- deep learning
- machine learning
- convolutional neural network
- functional connectivity
- big data
- resting state
- artificial intelligence
- end stage renal disease
- newly diagnosed
- ejection fraction
- healthcare
- primary care
- risk factors
- computed tomography
- risk assessment
- quantum dots
- patient reported outcomes
- human health
- blood brain barrier
- cerebral ischemia
- label free
- drug induced