Sequestration of imaging studies in MIDRC: stratified sampling to balance demographic characteristics of patients in a multi-institutional data commons.
Natalie BaughanHeather M WhitneyKenny H ChaBerkman SahinerTingting HuGrace Hyun KimMichael F McNitt-GrayKyle J MyersMaryellen L GigerPublished in: Journal of medical imaging (Bellingham, Wash.) (2023)
The developed multi-dimensional stratified sampling algorithm can partition a large dataset while maintaining balance across several variables, superior to the balance achieved from naïve randomization.