Diabetes Detection Models in Mexican Patients by Combining Machine Learning Algorithms and Feature Selection Techniques for Clinical and Paraclinical Attributes: A Comparative Evaluation.
Antonio García-DomínguezCarlos E Galvan-TejadaRafael Magallanes-QuintanarHamurabi GamboaIrma Elizabeth Gonzalez-CurielJesús Peralta-RomeroMiguel Cruz-LópezPublished in: Journal of diabetes research (2023)
The development of medical diagnostic models to support healthcare professionals has witnessed remarkable growth in recent years. Among the prevalent health conditions affecting the global population, diabetes stands out as a significant concern. In the domain of diabetes diagnosis, machine learning algorithms have been widely explored for generating disease detection models, leveraging diverse datasets primarily derived from clinical studies. The performance of these models heavily relies on the selection of the classifier algorithm and the quality of the dataset. Therefore, optimizing the input data by selecting relevant features becomes essential for accurate classification. This research presents a comprehensive investigation into diabetes detection models by integrating two feature selection techniques: the Akaike information criterion and genetic algorithms. These techniques are combined with six prominent classifier algorithms, including support vector machine, random forest, k-nearest neighbor, gradient boosting, extra trees, and naive Bayes. By leveraging clinical and paraclinical features, the generated models are evaluated and compared to existing approaches. The results demonstrate superior performance, surpassing accuracies of 94%. Furthermore, the use of feature selection techniques allows for working with a reduced dataset. The significance of feature selection is underscored in this study, showcasing its pivotal role in enhancing the performance of diabetes detection models. By judiciously selecting relevant features, this approach contributes to the advancement of medical diagnostic capabilities and empowers healthcare professionals in making informed decisions regarding diabetes diagnosis and treatment.
Keyphrases
- machine learning
- deep learning
- type diabetes
- cardiovascular disease
- artificial intelligence
- big data
- glycemic control
- healthcare
- loop mediated isothermal amplification
- end stage renal disease
- public health
- label free
- chronic kidney disease
- ejection fraction
- adipose tissue
- climate change
- mental health
- dna methylation
- insulin resistance
- high resolution
- health information
- copy number
- sensitive detection
- social media
- neural network
- risk assessment