Kataegis in clinical and molecular subgroups of primary breast cancer.
Srinivas VeerlaJohan StaafPublished in: NPJ breast cancer (2024)
Kataegis is a hypermutation phenomenon characterized by localized clusters of single base pair substitution (SBS) reported in multiple cancer types. Despite a high frequency in breast cancer, large-scale analyses of kataegis patterns and associations with clinicopathological and molecular variables in established breast cancer subgroups are lacking. Therefore, WGS profiled primary breast cancers (n = 791) with associated clinical and molecular data layers, like RNA-sequencing data, were analyzed for kataegis frequency, recurrence, and associations with genomic contexts and functional elements, transcriptional patterns, driver alterations, homologous recombination deficiency (HRD), and prognosis in tumor subgroups defined by ER, PR, and HER2/ERBB2 status. Kataegis frequency was highest in the HER2-positive(p) subgroups, including both ER-negative(n)/positive(p) tumors (ERnHER2p/ERpHER2p). In TNBC, kataegis was neither associated with PAM50 nor TNBC mRNA subtypes nor with distant relapse in chemotherapy-treated patients. In ERpHER2n tumors, kataegis was associated with aggressive characteristics, including PR-negativity, molecular Luminal B subtype, higher mutational burden, higher grade, and expression of proliferation-associated genes. Recurrent kataegis loci frequently targeted regions commonly amplified in ER-positive tumors, while few recurrent loci were observed in TNBC. SBSs in kataegis loci appeared enriched in regions of open chromatin. Kataegis status was not associated with HRD in any subgroup or with distinct transcriptional patterns in unsupervised or supervised analysis. In summary, kataegis is a common hypermutation phenomenon in established breast cancer subgroups, particularly in HER2p subgroups, coinciding with an aggressive tumor phenotype in ERpHER2n disease. In TNBC, the molecular implications and associations of kataegis are less clear, including its prognostic value.
Keyphrases
- high frequency
- genome wide
- dna damage
- machine learning
- end stage renal disease
- gene expression
- transcription factor
- single molecule
- dna repair
- newly diagnosed
- chronic kidney disease
- endoplasmic reticulum
- estrogen receptor
- dna methylation
- electronic health record
- transcranial magnetic stimulation
- single cell
- oxidative stress
- ejection fraction
- signaling pathway
- minimally invasive
- binding protein
- peritoneal dialysis
- big data
- radiation therapy
- childhood cancer
- risk factors
- patient reported outcomes
- free survival
- locally advanced
- cancer therapy
- lymph node metastasis
- heat shock protein
- study protocol
- replacement therapy
- heat shock