Login / Signup

BERNN: Enhancing classification of Liquid Chromatography Mass Spectrometry data with batch effect removal neural networks.

Simon J PelletierMickaël LeclercqFlorence Roux-DalvaiMatthijs B de GeusShannon LeslieWeiwei WangTukiet T LamAngus C NairnSteven E ArnoldBecky C CarlyleFrédéric PreciosoArnaud Droit
Published in: Nature communications (2024)
Liquid Chromatography Mass Spectrometry (LC-MS) is a powerful method for profiling complex biological samples. However, batch effects typically arise from differences in sample processing protocols, experimental conditions, and data acquisition techniques, significantly impacting the interpretability of results. Correcting batch effects is crucial for the reproducibility of omics research, but current methods are not optimal for the removal of batch effects without compressing the genuine biological variation under study. We propose a suite of Batch Effect Removal Neural Networks (BERNN) to remove batch effects in large LC-MS experiments, with the goal of maximizing sample classification performance between conditions. More importantly, these models must efficiently generalize in batches not seen during training. A comparison of batch effect correction methods across five diverse datasets demonstrated that BERNN models consistently showed the strongest sample classification performance. However, the model producing the greatest classification improvements did not always perform best in terms of batch effect removal. Finally, we show that the overcorrection of batch effects resulted in the loss of some essential biological variability. These findings highlight the importance of balancing batch effect removal while preserving valuable biological diversity in large-scale LC-MS experiments.
Keyphrases
  • mass spectrometry
  • liquid chromatography
  • neural network
  • machine learning
  • deep learning
  • tandem mass spectrometry
  • capillary electrophoresis
  • gas chromatography
  • artificial intelligence
  • rna seq