KMDATA: a curated database of reconstructed individual patient-level data from 153 oncology clinical trials.
Geoffrey G FellRobert A ReddAlyssa M VanderbeekRifaquat RahmanBill LouvJon McDunnAndrea ArfèBrian M AlexanderSteffen VentzLorenzo TrippaPublished in: Database : the journal of biological databases and curation (2021)
We created a database of reconstructed patient-level data from published clinical trials that includes multiple time-to-event outcomes such as overall survival and progression-free survival. Outcomes were extracted from Kaplan-Meier (KM) curves reported in 153 oncology Phase III clinical trial publications identified through a PubMed search of clinical trials in breast, lung, prostate and colorectal cancer, published between 2014 and 2016. For each trial that met our search criteria, we curated study-level information and digitized all reported KM curves with the software Digitizelt. We then used the digitized KM survival curves to estimate (possibly censored) patient-level time-to-event outcomes. Collections of time-to-event datasets from completed trials can be used to support the choice of appropriate trial designs for future clinical studies. Patient-level data allow investigators to tailor clinical trial designs to diseases and classes of treatments. Patient-level data also allow investigators to estimate the operating characteristics (e.g. power and type I error rate) of candidate statistical designs and methods. Database URL: https://10.6084/m9.figshare.14642247.v1.
Keyphrases
- clinical trial
- phase iii
- phase ii
- open label
- case report
- study protocol
- double blind
- electronic health record
- big data
- prostate cancer
- palliative care
- systematic review
- healthcare
- metabolic syndrome
- emergency department
- type diabetes
- deep learning
- machine learning
- randomized controlled trial
- insulin resistance
- health information
- current status
- glycemic control
- free survival
- benign prostatic hyperplasia