Quantifying the information in noisy epidemic curves.
Kris Varun ParagChristl Ann DonnellyAlexander Eugene ZarebskiPublished in: Nature computational science (2022)
Reliably estimating the dynamics of transmissible diseases from noisy surveillance data is an enduring problem in modern epidemiology. Key parameters are often inferred from incident time series, with the aim of informing policy-makers on the growth rate of outbreaks or testing hypotheses about the effectiveness of public health interventions. However, the reliability of these inferences depends critically on reporting errors and latencies innate to the time series. Here, we develop an analytical framework to quantify the uncertainty induced by under-reporting and delays in reporting infections, as well as a metric for ranking surveillance data informativeness. We apply this metric to two primary data sources for inferring the instantaneous reproduction number: epidemic case and death curves. We find that the assumption of death curves as more reliable, commonly made for acute infectious diseases such as COVID-19 and influenza, is not obvious and possibly untrue in many settings. Our framework clarifies and quantifies how actionable information about pathogen transmissibility is lost due to surveillance limitations.
Keyphrases
- public health
- infectious diseases
- adverse drug
- electronic health record
- big data
- immune response
- randomized controlled trial
- coronavirus disease
- sars cov
- healthcare
- liver failure
- global health
- physical activity
- systematic review
- health information
- cardiovascular disease
- type diabetes
- data analysis
- emergency department
- intensive care unit
- respiratory failure
- hepatitis b virus
- mass spectrometry
- deep learning