Charting the Lipopeptidome of Nonpathogenic Pseudomonas.
Catherine Cesa-LunaNiels GeudensLéa GirardVic De RooHassan R MakladJosé C MartinsMonica HöfteRené De MotPublished in: mSystems (2023)
A major source of pseudomonad-specialized metabolites is the nonribosomal peptide synthetases (NRPSs) assembling siderophores and lipopeptides. Cyclic lipopeptides (CLPs) of the Mycin and Peptin families are frequently associated with, but not restricted to, phytopathogenic species. We conducted an in silico analysis of the NRPSs encoded by lipopeptide biosynthetic gene clusters in nonpathogenic Pseudomonas genomes, covering 13 chemically diversified families. This global assessment of lipopeptide production capacity revealed it to be confined to the Pseudomonas fluorescens lineage, with most strains synthesizing a single type of CLP. Whereas certain lipopeptide families are specific for a taxonomic subgroup, others are found in distant groups. NRPS activation domain-guided peptide predictions enabled reliable family assignments, including identification of novel members. Focusing on the two most abundant lipopeptide families (Viscosin and Amphisin), a portion of their uncharted diversity was mapped, including characterization of two novel Amphisin family members (nepenthesin and oakridgin). Using NMR fingerprint matching, known Viscosin-family lipopeptides were identified in 15 (type) species spread across different taxonomic groups. A bifurcate genomic organization predominates among Viscosin-family producers and typifies Xantholysin-, Entolysin-, and Poaeamide-family producers but most families feature a single NRPS gene cluster embedded between cognate regulator and transporter genes. The strong correlation observed between NRPS system phylogeny and rpoD -based taxonomic affiliation indicates that much of the structural diversity is linked to speciation, providing few indications of horizontal gene transfer. The grouping of most NRPS systems in four superfamilies based on activation domain homology suggests extensive module dynamics driven by domain deletions, duplications, and exchanges. IMPORTANCE Pseudomonas species are prominent producers of lipopeptides that support proliferation in a multitude of environments and foster varied lifestyles. By genome mining of biosynthetic gene clusters (BGCs) with lipopeptide-specific organization, we mapped the global Pseudomonas lipopeptidome and linked its staggering diversity to taxonomy of the producers, belonging to different groups within the major Pseudomonas fluorescens lineage. Activation domain phylogeny of newly mined lipopeptide synthetases combined with previously characterized enzymes enabled assignment of predicted BGC products to specific lipopeptide families. In addition, novel peptide sequences were detected, showing the value of substrate specificity analysis for prioritization of BGCs for further characterization. NMR fingerprint matching proved an excellent tool to unequivocally identify multiple lipopeptides bioinformatically assigned to the Viscosin family, by far the most abundant one in Pseudomonas and with stereochemistry of all its current members elucidated. In-depth analysis of activation domains provided insight into mechanisms driving lipopeptide structural diversification.
Keyphrases
- biofilm formation
- genome wide
- copy number
- plant growth
- genome wide identification
- magnetic resonance
- single cell
- staphylococcus aureus
- pseudomonas aeruginosa
- deep learning
- dna methylation
- machine learning
- gene expression
- lymph node
- randomized controlled trial
- optical coherence tomography
- genetic diversity
- amino acid
- open label