Login / Signup

LeGenD: determining N-glycoprofiles using an explainable AI-leveraged model with lectin profiling.

Haining LiAngelo G PeraltaSanne SchoffelenAnders Holmgaard HansenJohnny ArnsdorfSong-Min SchinnJonathan SkidmoreBiswa ChoudhuryMousumi PaulchakrabartiBjorn G VoldborgAustin W T ChiangNathan E Lewis
Published in: bioRxiv : the preprint server for biology (2024)
Glycosylation affects many vital functions of organisms. Therefore, its surveillance is critical from basic science to biotechnology, including biopharmaceutical development and clinical diagnostics. However, conventional glycan structure analysis faces challenges with throughput and cost. Lectins offer an alternative approach for analyzing glycans, but they only provide glycan epitopes and not full glycan structure information. To overcome these limitations, we developed LeGenD, a lectin and AI-based approach to predict N -glycan structures and determine their relative abundance in purified proteins based on lectin-binding patterns. We trained the LeGenD model using 309 glycoprofiles from 10 recombinant proteins, produced in 30 glycoengineered CHO cell lines. Our approach accurately reconstructed experimentally-measured N -glycoprofiles of bovine Fetuin B and IgG from human sera. Explanatory AI analysis with SHapley Additive exPlanations (SHAP) helped identify the critical lectins for glycoprofile predictions. Our LeGenD approach thus presents an alternative approach for N -glycan analysis.
Keyphrases
  • cell surface
  • artificial intelligence
  • public health
  • healthcare
  • mass spectrometry
  • microbial community
  • cell free
  • binding protein
  • antibiotic resistance genes