Login / Signup

Practical guide to building machine learning-based clinical prediction models using imbalanced datasets.

Jacklyn LuuEvgenia BorisenkoValerie PrzekopAdvait PatilJoshua A VillarrealJeff Choi
Published in: Trauma surgery & acute care open (2024)
Clinical prediction models often aim to predict rare, high-risk events, but building such models requires robust understanding of imbalance datasets and their unique study design considerations. This practical guide highlights foundational prediction model principles for surgeon-data scientists and readers who encounter clinical prediction models, from feature engineering and algorithm selection strategies to model evaluation and design techniques specific to imbalanced datasets. We walk through a clinical example using readable code to highlight important considerations and common pitfalls in developing machine learning-based prediction models. We hope this practical guide facilitates developing and critically appraising robust clinical prediction models for the surgical community.
Keyphrases
  • machine learning
  • healthcare
  • deep learning
  • big data
  • mental health
  • robot assisted
  • clinical evaluation