sourceR: Classification and source attribution of infectious agents among heterogeneous populations.
Poppy MillerJonathan C MarshallNigel FrenchChristopher P JewellPublished in: PLoS computational biology (2017)
Zoonotic diseases are a major cause of morbidity, and productivity losses in both human and animal populations. Identifying the source of food-borne zoonoses (e.g. an animal reservoir or food product) is crucial for the identification and prioritisation of food safety interventions. For many zoonotic diseases it is difficult to attribute human cases to sources of infection because there is little epidemiological information on the cases. However, microbial strain typing allows zoonotic pathogens to be categorised, and the relative frequencies of the strain types among the sources and in human cases allows inference on the likely source of each infection. We introduce sourceR, an R package for quantitative source attribution, aimed at food-borne diseases. It implements a Bayesian model using strain-typed surveillance data from both human cases and source samples, capable of identifying important sources of infection. The model measures the force of infection from each source, allowing for varying survivability, pathogenicity and virulence of pathogen strains, and varying abilities of the sources to act as vehicles of infection. A Bayesian non-parametric (Dirichlet process) approach is used to cluster pathogen strain types by epidemiological behaviour, avoiding model overfitting and allowing detection of strain types associated with potentially high "virulence". sourceR is demonstrated using Campylobacter jejuni isolate data collected in New Zealand between 2005 and 2008. Chicken from a particular poultry supplier was identified as the major source of campylobacteriosis, which is qualitatively similar to results of previous studies using the same dataset. Additionally, the software identifies a cluster of 9 multilocus sequence types with abnormally high 'virulence' in humans. sourceR enables straightforward attribution of cases of zoonotic infection to putative sources of infection. As sourceR develops, we intend it to become an important and flexible resource for food-borne disease attribution studies.
Keyphrases
- endothelial cells
- escherichia coli
- drinking water
- antimicrobial resistance
- biofilm formation
- staphylococcus aureus
- pseudomonas aeruginosa
- induced pluripotent stem cells
- healthcare
- climate change
- machine learning
- human health
- public health
- physical activity
- gene expression
- microbial community
- risk assessment
- high resolution
- social media
- dna methylation
- deep learning
- mass spectrometry
- genome wide
- gram negative