Login / Signup

A dataset for benchmarking Neotropical anuran calls identification in passive acoustic monitoring.

Juan Sebastián CañasMaría Paula Toro-GómezLarissa Sayuri Moreira SugaiHernán Darío Benítez RestrepoJorge RudasBreyner Posso BautistaLuís Felipe ToledoSimone DenaAdão Henrique Rosa DomingosFranco Leandro de SouzaSelvino Neckel-OliveiraAnderson da RosaVítor Carvalho-RochaJosé Vinícius BernardyJosé Luiz Massao Moreira SugaiCarolina Emília Dos SantosRogério Pereira BastosDiego LlusiaJuan Sebastián Ulloa
Published in: Scientific data (2023)
Global change is predicted to induce shifts in anuran acoustic behavior, which can be studied through passive acoustic monitoring (PAM). Understanding changes in calling behavior requires automatic identification of anuran species, which is challenging due to the particular characteristics of neotropical soundscapes. In this paper, we introduce a large-scale multi-species dataset of anuran amphibians calls recorded by PAM, that comprises 27 hours of expert annotations for 42 different species from two Brazilian biomes. We provide open access to the dataset, including the raw recordings, experimental setup code, and a benchmark with a baseline model of the fine-grained categorization problem. Additionally, we highlight the challenges of the dataset to encourage machine learning researchers to solve the problem of anuran call identification towards conservation policy. All our experiments and resources have been made available at https://soundclim.github.io/anuraweb/ .
Keyphrases
  • machine learning
  • bioinformatics analysis
  • healthcare
  • public health
  • deep learning
  • minimally invasive
  • mental health
  • air pollution
  • molecular dynamics