Login / Signup

Sparse Regression in Cancer Genomics: Comparing Variable Selection and Predictions in Real World Data.

Robert J O'SheaSophia TsokaGary J R CookVicky Goh
Published in: Cancer informatics (2021)
This analysis explores a novel approach for comparisons of model selection approaches in real genomic data from 5 cancers. Our benchmarking datasets have been made publicly available for use in future research. Our findings support the use of L 0 L 2 penalisation for structural selection and L 1 L 2 penalisation for coefficient recovery in genomic data. Evaluation of learning algorithms according to observed test performance in external genomic datasets yields valuable insights into actual test performance, providing a data-driven complement to internal cross-validation in genomic regression tasks.
Keyphrases