Impact of sample size on the stability of risk scores from clinical prediction models: a case study in cardiovascular disease.
Alexander PateRichard EmsleyMatthew SperrinGlen P MartinTjeerd van StaaPublished in: Diagnostic and prognostic research (2020)
Widely used cardiovascular disease risk prediction models suffer from high levels of instability induced by sampling variation. Many models will also suffer from overfitting (a closely linked concept), but at acceptable levels of overfitting, there may still be high levels of instability in individual risk. Stability of risk estimates should be a criterion when determining the minimum sample size to develop models.