Login / Signup

MFmap: A semi-supervised generative model matching cell lines to tumours and cancer subtypes.

Xiaoxiao ZhangMaik Kschischo
Published in: PloS one (2021)
Translating in vitro results from experiments with cancer cell lines to clinical applications requires the selection of appropriate cell line models. Here we present MFmap (model fidelity map), a machine learning model to simultaneously predict the cancer subtype of a cell line and its similarity to an individual tumour sample. The MFmap is a semi-supervised generative model, which compresses high dimensional gene expression, copy number variation and mutation data into cancer subtype informed low dimensional latent representations. The accuracy (test set F1 score >90%) of the MFmap subtype prediction is validated in ten different cancer datasets. We use breast cancer and glioblastoma cohorts as examples to show how subtype specific drug sensitivity can be translated to individual tumour samples. The low dimensional latent representations extracted by MFmap explain known and novel subtype specific features and enable the analysis of cell-state transformations between different subtypes. From a methodological perspective, we report that MFmap is a semi-supervised method which simultaneously achieves good generative and predictive performance and thus opens opportunities in other areas of computational biology.
Keyphrases
  • papillary thyroid
  • machine learning
  • gene expression
  • squamous cell
  • copy number
  • dna methylation
  • lymph node metastasis
  • mitochondrial dna
  • artificial intelligence
  • genome wide
  • big data
  • rna seq