Login / Signup

A comparative analysis of converters of tabular data into image for the classification of Arboviruses using Convolutional Neural Networks.

Leonides Medeiros NetoSebastião Rogerio da Silva NetoPatricia Takako Endo
Published in: PloS one (2023)
Tabular data is commonly used in business and literature and can be analyzed using tree-based Machine Learning (ML) algorithms to extract meaningful information. Deep Learning (DL) excels in data such as image, sound, and text, but it is less frequently utilized with tabular data. However, it is possible to use tools to convert tabular data into images for use with Convolutional Neural Networks (CNNs) which are powerful DL models for image classification. The goal of this work is to compare the performance of converters for tabular data into images, select the best one, optimize a CNN using random search, and compare it with an optimized ML algorithm, the XGBoost. Results show that even a basic CNN, with only 1 convolutional layer, can reach comparable metrics to the XGBoost, which was trained on the original tabular data and optimized with grid search and feature selection. However, further optimization of the CNN with random search did not significantly improve its performance.
Keyphrases
  • deep learning
  • convolutional neural network
  • machine learning
  • artificial intelligence
  • big data
  • electronic health record
  • healthcare
  • oxidative stress
  • data analysis
  • neural network
  • social media