Comparative Genomics and Metabolomics in the Genus Nocardia.
Daniel MännleShaun M K McKinnieShrikant S MantriKatharina SteinkeZeyin LuBradley S MooreNadine ZiemertLeonard KaysserPublished in: mSystems (2020)
Using automated genome analysis tools, it is often unclear to what degree genetic variability in homologous biosynthetic pathways relates to structural variation. This hampers strain prioritization and compound identification and can lead to overinterpretation of chemical diversity. Here, we assessed the metabolic potential of Nocardia, an underinvestigated actinobacterial genus that is known to comprise opportunistic human pathogens. Our analysis revealed a plethora of putative biosynthetic gene clusters of various classes, including polyketide, nonribosomal peptide, and terpenoid pathways. Furthermore, we used the highly conserved biosynthetic pathway for nocobactin-like siderophores to investigate how gene cluster differences correlate to structural differences in the produced compounds. Sequence similarity networks generated by BiG-SCAPE (Biosynthetic Gene Similarity Clustering and Prospecting Engine) showed the presence of several distinct gene cluster families. Metabolic profiling of selected Nocardia strains using liquid chromatography-mass spectrometry (LC-MS) metabolomics data, nuclear magnetic resonance (NMR) spectroscopy, and GNPS (Global Natural Product Social molecular networking) revealed that nocobactin-like biosynthetic gene cluster (BGC) families above a BiG-SCAPE threshold of 70% can be assigned to distinct structural types of nocobactin-like siderophores.IMPORTANCE Our work emphasizes that Nocardia represent a prolific source for natural products rivaling better-characterized genera such as Streptomyces or Amycolatopsis Furthermore, we showed that large-scale analysis of biosynthetic gene clusters using similarity networks with high stringency allows the distinction and prediction of natural product structural variations. This will facilitate future genomics-driven drug discovery campaigns.
Keyphrases
- mass spectrometry
- genome wide
- copy number
- liquid chromatography
- single cell
- magnetic resonance
- drug discovery
- machine learning
- healthcare
- dna methylation
- mental health
- dna damage
- magnetic resonance imaging
- endothelial cells
- high resolution
- rna seq
- multidrug resistant
- transcription factor
- dna repair
- risk assessment
- ms ms
- gram negative
- induced pluripotent stem cells
- single molecule
- high performance liquid chromatography