How to analyse the spatiotemporal tumour samples needed to investigate cancer evolution: A case study using paired primary and recurrent glioblastoma.
Alastair DroopAlexander BrunsGeorgette TannerNora RippausRuth MortonSally HarrisonHenry KingKatherine AshtonKhaja SyedMichael D JenkinsonAndrew BrodbeltAruna ChakrabartyAzzam IsmailSusan ShortLucy F SteadPublished in: International journal of cancer (2017)
Many traits of cancer progression (e.g., development of metastases or resistance to therapy) are facilitated by tumour evolution: Darwinian selection of subclones with distinct genotypes or phenotypes that enable such progression. Characterising these subclones provide an opportunity to develop drugs to better target their specific properties but requires the accurate identification of somatic mutations shared across multiple spatiotemporal tumours from the same patient. Current best practices for calling somatic mutations are optimised for single samples, and risk being too conservative to identify shared mutations with low prevalence in some samples. We reasoned that datasets from multiple matched tumours can be used for mutual validation and thus propose an adapted two-stage approach: (1) low-stringency mutation calling to identify mutations shared across samples irrespective of the weight of evidence in a single sample; (2) high-stringency mutation calling to further characterise mutations present in a single sample. We applied our approach to three-independent cohorts of paired primary and recurrent glioblastoma tumours, two of which have previously been analysed using existing approaches, and found that it significantly increased the amount of biologically relevant shared somatic mutations identified. We also found that duplicate removal was detrimental when identifying shared somatic mutations. Our approach is also applicable when multiple datasets e.g. DNA and RNA are available for the same tumour.