Computer Audition for Fighting the SARS-CoV-2 Corona Crisis-Introducing the Multitask Speech Corpus for COVID-19.
Kun QianMaximilian SchmittHuaiyuan ZhengTomoya KoikeJing HanJuan LiuWei JiJunjun DuanMeishu SongZijiang YangZhao RenShuo LiuZixing ZhangYoshiharu YamamotoBjörn W SchullerPublished in: IEEE internet of things journal (2021)
Computer audition (CA) has experienced a fast development in the past decades by leveraging advanced signal processing and machine learning techniques. In particular, for its noninvasive and ubiquitous character by nature, CA-based applications in healthcare have increasingly attracted attention in recent years. During the tough time of the global crisis caused by the coronavirus disease 2019 (COVID-19), scientists and engineers in data science have collaborated to think of novel ways in prevention, diagnosis, treatment, tracking, and management of this global pandemic. On the one hand, we have witnessed the power of 5G, Internet of Things, big data, computer vision, and artificial intelligence in applications of epidemiology modeling, drug and/or vaccine finding and designing, fast CT screening, and quarantine management. On the other hand, relevant studies in exploring the capacity of CA are extremely lacking and underestimated. To this end, we propose a novel multitask speech corpus for COVID-19 research usage. We collected 51 confirmed COVID-19 patients' in-the-wild speech data in Wuhan city, China. We define three main tasks in this corpus, i.e., three-category classification tasks for evaluating the physical and/or mental status of patients, i.e., sleep quality, fatigue, and anxiety. The benchmarks are given by using both classic machine learning methods and state-of-the-art deep learning techniques. We believe this study and corpus cannot only facilitate the ongoing research on using data science to fight against COVID-19, but also the monitoring of contagious diseases for general purpose.
Keyphrases
- big data
- artificial intelligence
- coronavirus disease
- deep learning
- machine learning
- sars cov
- sleep quality
- respiratory syndrome coronavirus
- public health
- convolutional neural network
- healthcare
- working memory
- physical activity
- end stage renal disease
- chronic kidney disease
- electronic health record
- depressive symptoms
- computed tomography
- ejection fraction
- patient reported outcomes
- risk factors
- newly diagnosed
- hearing loss
- magnetic resonance imaging
- health information
- peritoneal dialysis
- image quality