Login / Signup

Reproducible biomedical benchmarking in the cloud: lessons from crowd-sourced data challenges.

Kyle EllrottAlex BuchananAllison CreasonMichael MasonThomas SchaffterBruce HoffJames EddyJohn M ChiltonThomas YuJoshua M StuartJulio Saez-RodriguezGustavo StolovitzkyPaul C BoutrosJustin Guinney
Published in: Genome biology (2019)
Challenges are achieving broad acceptance for addressing many biomedical questions and enabling tool assessment. But ensuring that the methods evaluated are reproducible and reusable is complicated by the diversity of software architectures, input and output file formats, and computing environments. To mitigate these problems, some challenges have leveraged new virtualization and compute methods, requiring participants to submit cloud-ready software packages. We review recent data challenges with innovative approaches to model reproducibility and data sharing, and outline key lessons for improving quantitative biomedical data analysis through crowd-sourced benchmarking challenges.
Keyphrases
  • data analysis
  • electronic health record
  • big data
  • healthcare
  • high resolution
  • artificial intelligence