Login / Signup

Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion.

Vahid HajihashemiAbdorreza Alavi GharahbaghPedro Miguel CruzMarta Campos FerreiraJosé J M MachadoJoão Manuel R S Tavares
Published in: Sensors (Basel, Switzerland) (2022)
The analysis of ambient sounds can be very useful when developing sound base intelligent systems. Acoustic scene classification (ASC) is defined as identifying the area of a recorded sound or clip among some predefined scenes. ASC has huge potential to be used in urban sound event classification systems. This research presents a hybrid method that includes a novel mathematical fusion step which aims to tackle the challenges of ASC accuracy and adaptability of current state-of-the-art models. The proposed method uses a stereo signal, two ensemble classifiers (random subspace), and a novel mathematical fusion step. In the proposed method, a stable, invariant signal representation of the stereo signal is built using Wavelet Scattering Transform (WST). For each mono, i.e., left and right, channel, a different random subspace classifier is trained using WST. A novel mathematical formula for fusion step was developed, its parameters being found using a Genetic algorithm. The results on the DCASE 2017 dataset showed that the proposed method has higher classification accuracy (about 95%), pushing the boundaries of existing methods.
Keyphrases
  • deep learning
  • machine learning
  • convolutional neural network
  • neural network
  • air pollution
  • risk assessment
  • human milk
  • body composition
  • human health
  • preterm infants
  • resistance training
  • high intensity