Login / Signup

Uncertainty in lung cancer stage for survival estimation via set-valued classification.

Savannah L BergquistGabriel A BrooksMary Beth LandrumNancy L KeatingSherri Rose
Published in: Statistics in medicine (2022)
The difficulty in identifying cancer stage in health care claims data has limited oncology quality of care and health outcomes research. We fit prediction algorithms for classifying lung cancer stage into three classes (stages I/II, stage III, and stage IV) using claims data, and then demonstrate a method for incorporating the classification uncertainty in survival estimation. Leveraging set-valued classification and split conformal inference, we show how a fixed algorithm developed in one cohort of data may be deployed in another, while rigorously accounting for uncertainty from the initial classification step. We demonstrate this process using SEER cancer registry data linked with Medicare claims data.
Keyphrases
  • machine learning
  • deep learning
  • electronic health record
  • big data
  • healthcare
  • health insurance
  • palliative care
  • papillary thyroid
  • social media
  • affordable care act
  • young adults
  • chronic pain