Data Augmentation Effects on Highly Imbalanced EEG Datasets for Automatic Detection of Photoparoxysmal Responses.
Fernando Moncada MartinsVíctor Manuel González SuárezJosé Ramón Villar FlechaBeatriz García LópezPublished in: Sensors (Basel, Switzerland) (2023)
Photosensitivity is a neurological disorder in which a person's brain produces epileptic discharges, known as Photoparoxysmal Responses (PPRs), when it receives certain visual stimuli. The current standardized diagnosis process used in hospitals consists of submitting the subject to the Intermittent Photic Stimulation process and attempting to trigger these phenomena. The brain activity is measured by an Electroencephalogram (EEG), and the clinical specialists manually look for the PPRs that were provoked during the session. Due to the nature of this disorder, long EEG recordings may contain very few PPR segments, meaning that a highly imbalanced dataset is available. To tackle this problem, this research focused on applying Data Augmentation (DA) to create synthetic PPR segments from the real ones, improving the balance of the dataset and, thus, the global performance of the Machine Learning techniques applied for automatic PPR detection. K-Nearest Neighbors and a One-Hidden-Dense-Layer Neural Network were employed to evaluate the performance of this DA stage. The results showed that DA is able to improve the models, making them more robust and more able to generalize. A comparison with the results obtained from a previous experiment also showed a performance improvement of around 20% for the Accuracy and Specificity measurements without Sensitivity suffering any losses. This project is currently being carried out with subjects at Burgos University Hospital, Spain.
Keyphrases
- neural network
- resting state
- functional connectivity
- machine learning
- big data
- working memory
- deep learning
- electronic health record
- loop mediated isothermal amplification
- real time pcr
- high intensity
- label free
- artificial intelligence
- healthcare
- quality improvement
- white matter
- soft tissue
- cerebral ischemia
- multiple sclerosis
- high density
- blood brain barrier
- data analysis
- rna seq
- finite element