Login / Signup

Exploiting deep neural network and long short-term memory method-ologies in bioacoustic classification of LPC-based features.

Cihun-Siyong Alex GongChih-Hui Simon SuKuo-Wei ChaoYi-Chu ChaoChin-Kai SuWei-Hang Chiu
Published in: PloS one (2021)
The research describes the recognition and classification of the acoustic characteristics of amphibians using deep learning of deep neural network (DNN) and long short-term memory (LSTM) for biological applications. First, original data is collected from 32 species of frogs and 3 species of toads commonly found in Taiwan. Secondly, two digital filtering algorithms, linear predictive coding (LPC) and Mel-frequency cepstral coefficient (MFCC), are respectively used to collect amphibian bioacoustic features and construct the datasets. In addition, principal component analysis (PCA) algorithm is applied to achieve dimensional reduction of the training model datasets. Next, the classification of amphibian bioacoustic features is accomplished through the use of DNN and LSTM. The Pytorch platform with a GPU processor (NVIDIA GeForce GTX 1050 Ti) realizes the calculation and recognition of the acoustic feature classification results. Based on above-mentioned two algorithms, the sound feature datasets are classified and effectively summarized in several classification result tables and graphs for presentation. The results of the classification experiment of the different features of bioacoustics are verified and discussed in detail. This research seeks to extract the optimal combination of the best recognition and classification algorithms in all experimental processes.
Keyphrases
  • deep learning
  • machine learning
  • neural network
  • artificial intelligence
  • convolutional neural network
  • big data
  • working memory
  • magnetic resonance
  • magnetic resonance imaging