Login / Signup

Integration of mechanistic immunological knowledge into a machine learning pipeline improves predictions.

Anthony CulosAmy S TsaiNatalie StanleyMartin BeckerMohammad S GhaemiDavid R McIlwainRamin FallahzadehAthena TanadaHuda NassarCamilo EspinosaMaria XenochristouEdward GanioLaura PetersonXiaoyuan HanIna Annelies StelzerKazuo AndoDyani K GaudillièreThanaphong PhongpreechaIvana MariéAlan L ChangGary M ShawDavid K StevensonSean C BendallKara L DavisWendy FantlGarry P NolanTrevor HastieRobert TibshiraniMartin S AngstBrice L GaudilliereNima Aghaeepour
Published in: Nature machine intelligence (2020)
The dense network of interconnected cellular signalling responses that are quantifiable in peripheral immune cells provides a wealth of actionable immunological insights. Although high-throughput single-cell profiling techniques, including polychromatic flow and mass cytometry, have matured to a point that enables detailed immune profiling of patients in numerous clinical settings, the limited cohort size and high dimensionality of data increase the possibility of false-positive discoveries and model overfitting. We introduce a generalizable machine learning platform, the immunological Elastic-Net (iEN), which incorporates immunological knowledge directly into the predictive models. Importantly, the algorithm maintains the exploratory nature of the high-dimensional dataset, allowing for the inclusion of immune features with strong predictive capabilities even if not consistent with prior knowledge. In three independent studies our method demonstrates improved predictions for clinically relevant outcomes from mass cytometry data generated from whole blood, as well as a large simulated dataset. The iEN is available under an open-source licence.
Keyphrases
  • single cell
  • high throughput
  • machine learning
  • rna seq
  • big data
  • healthcare
  • end stage renal disease
  • artificial intelligence
  • electronic health record
  • newly diagnosed
  • deep learning
  • ejection fraction
  • prognostic factors