Genomic analysis of 38 Legionella species identifies large and diverse effector repertoires.
David BursteinFrancisco AmaroTal ZusmanZiv LifshitzOfir CohenJack A GilbertTal PupkoHoward A ShumanGil SegalPublished in: Nature genetics (2016)
Infection by the human pathogen Legionella pneumophila relies on the translocation of ∼ 300 virulence proteins, termed effectors, which manipulate host cell processes. However, almost no information exists regarding effectors in other Legionella pathogens. Here we sequenced, assembled and characterized the genomes of 38 Legionella species and predicted their effector repertoires using a previously validated machine learning approach. This analysis identified 5,885 predicted effectors. The effector repertoires of different Legionella species were found to be largely non-overlapping, and only seven core effectors were shared by all species studied. Species-specific effectors had atypically low GC content, suggesting exogenous acquisition, possibly from the natural protozoan hosts of these species. Furthermore, we detected numerous new conserved effector domains and discovered new domain combinations, which allowed the inference of as yet undescribed effector functions. The effector collection and network of domain architectures described here can serve as a roadmap for future studies of effector function and evolution.
Keyphrases
- type iii
- regulatory t cells
- dendritic cells
- machine learning
- genetic diversity
- healthcare
- single cell
- antimicrobial resistance
- high resolution
- cystic fibrosis
- pseudomonas aeruginosa
- genome wide
- social media
- transcription factor
- dna methylation
- immune response
- candida albicans
- gram negative
- biofilm formation
- deep learning