Generation of Individualized Synthetic Data for Augmentation of the Type 1 Diabetes Data Sets Using Deep Learning Models.
Josep NoguerIvan ContrerasOmer MujahidAleix BeneytoJosep VehíPublished in: Sensors (Basel, Switzerland) (2022)
In this paper, we present a methodology based on generative adversarial network architecture to generate synthetic data sets with the intention of augmenting continuous glucose monitor data from individual patients. We use these synthetic data with the aim of improving the overall performance of prediction models based on machine learning techniques. Experiments were performed on two cohorts of patients suffering from type 1 diabetes mellitus with significant differences in their clinical outcomes. In the first contribution, we have demonstrated that the chosen methodology is able to replicate the intrinsic characteristics of individual patients following the statistical distributions of the original data. Next, a second contribution demonstrates the potential of synthetic data to improve the performance of machine learning approaches by testing and comparing different prediction models for the problem of predicting nocturnal hypoglycemic events in type 1 diabetic patients. The results obtained for both generative and predictive models are quite encouraging and set a precedent in the use of generative techniques to train new machine learning models.
Keyphrases
- machine learning
- big data
- electronic health record
- type diabetes
- deep learning
- end stage renal disease
- ejection fraction
- newly diagnosed
- prognostic factors
- artificial intelligence
- obstructive sleep apnea
- cardiovascular disease
- metabolic syndrome
- physical activity
- data analysis
- adipose tissue
- insulin resistance
- high resolution
- skeletal muscle
- risk assessment
- climate change
- patient reported