Login / Signup

Categorizing sequences of concern by function to better assess mechanisms of microbial pathogenesis.

Gene D GodboldAnthony D KappellDanielle S LeSassierTodd J TreangenKrista L Ternus
Published in: Infection and immunity (2021)
To identify sequences with a role in microbial pathogenesis, we assessed the adequacy of their annotation by existing controlled vocabularies and sequence databases. Our goal was to regularize descriptions of microbial pathogenesis for improved integration with bioinformatic applications. Here we review the challenges of annotating sequences for pathogenic activity. We relate the categorization of more than 2750 sequences of pathogenic microbes through a controlled vocabulary called Functions of Sequences of Concern (FunSoCs). These allow for an ease of description by both humans and machines. We provide a subset of 220 fully annotated sequences in the supplementary material as examples. The use of this compact (∼30 terms) controlled vocabulary has potential benefits for research in microbial genomics, public health, biosecurity, biosurveillance, and the characterization of new and emerging pathogens.
Keyphrases
  • microbial community
  • public health
  • genetic diversity
  • risk assessment
  • climate change
  • artificial intelligence