A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition.
Pedro Lima LouroHugo RedinhoRicardo MalheiroRui Pedro PaivaRenato PandaPublished in: Sensors (Basel, Switzerland) (2024)
Classical machine learning techniques have dominated Music Emotion Recognition. However, improvements have slowed down due to the complex and time-consuming task of handcrafting new emotionally relevant audio features. Deep learning methods have recently gained popularity in the field because of their ability to automatically learn relevant features from spectral representations of songs, eliminating such necessity. Nonetheless, there are limitations, such as the need for large amounts of quality labeled data, a common problem in MER research. To understand the effectiveness of these techniques, a comparison study using various classical machine learning and deep learning methods was conducted. The results showed that using an ensemble of a Dense Neural Network and a Convolutional Neural Network architecture resulted in a state-of-the-art 80.20% F1 score, an improvement of around 5% considering the best baseline results, concluding that future research should take advantage of both paradigms, that is, combining handcrafted features with feature learning.
Keyphrases
- deep learning
- convolutional neural network
- machine learning
- artificial intelligence
- neural network
- big data
- autism spectrum disorder
- depressive symptoms
- randomized controlled trial
- optical coherence tomography
- computed tomography
- quality improvement
- electronic health record
- magnetic resonance
- working memory
- pet imaging
- positron emission tomography
- contrast enhanced
- current status