Login / Signup

PCA, PC-CVA, and Random Forest of GCIB-SIMS Data for the Elucidation of Bacterial Envelope Differences in Antibiotic Resistance Research.

Alfred FranssonKelly Dimovska NilssonAlex HendersonAnne FarewellJohn Stephen Fletcher
Published in: Analytical chemistry (2024)
Antibiotic resistance can rapidly spread through bacterial populations via bacterial conjugation. The bacterial membrane has an important role in facilitating conjugation, thus investigating the effects on the bacterial membrane caused by conjugative plasmids, antibiotic resistance, and genes involved in conjugation is of interest. Analysis of bacterial membranes was conducted using gas cluster ion beam-secondary ion mass spectrometry (GCIB-SIMS). The complexity of the data means that data analysis is important for the identification of changes in the membrane composition. Preprocessing of data and several analytical methods for identification of changes in bacterial membranes have been investigated. GCIB-SIMS data from Escherichia coli samples were subjected to principal components analysis (PCA), principal components-canonical variate analysis (PC-CVA), and Random Forests (RF) data analysis with the aim of extracting the maximum biological information. The influence of increasing replicate data was assessed, and the effect of diminishing biological variation was studied. Optimized m / z region-specific scaling provided improved clustering, with an increase in biologically significant peaks contributing to the loadings. PC-CVA improved clustering, provided clearer loadings, and benefited from larger data sets collected over several months. RF required larger sample numbers and while showing overlap with the PC-CVA, produced additional peaks of interest. The combination of PC-CVA and RF allowed very subtle differences between bacterial strains and growth conditions to be elucidated for the first time. Specifically, comparative analysis of an E. coli strain with and without the F-plasmid revealed changes in cyclopropanation of fatty acids, where the addition of the F-plasmid led to a reduction in cyclopropanation.
Keyphrases
  • data analysis
  • escherichia coli
  • electronic health record
  • big data
  • mass spectrometry
  • single cell
  • healthcare
  • fatty acid
  • liquid chromatography
  • cystic fibrosis
  • ms ms
  • health information