Nucleotide Weight Matrices Reveal Ubiquitous Mutational Footprints of AID/APOBEC Deaminases in Human Cancer Genomes.
Igor B RogozinAbiel Roche-LimaArtem G LadaFrida BelinkyIvan A SidorenkoGalina V GlazkoVladimir N BabenkoDavid N CooperYouri I PavlovPublished in: Cancers (2019)
Cancer genomes accumulate nucleotide sequence variations that number in the tens of thousands per genome. A prominent fraction of these mutations is thought to arise as a consequence of the off-target activity of DNA/RNA editing cytosine deaminases. These enzymes, collectively called activation induced deaminase (AID)/APOBECs, deaminate cytosines located within defined DNA sequence contexts. The resulting changes of the original C:G pair in these contexts (mutational signatures) provide indirect evidence for the participation of specific cytosine deaminases in a given cancer type. The conventional method used for the analysis of mutable motifs is the consensus approach. Here, for the first time, we have adopted the frequently used weight matrix (sequence profile) approach for the analysis of mutagenesis and provide evidence for this method being a more precise descriptor of mutations than the sequence consensus approach. We confirm that while mutational footprints of APOBEC1, APOBEC3A, APOBEC3B, and APOBEC3G are prominent in many cancers, mutable motifs characteristic of the action of the humoral immune response somatic hypermutation enzyme, AID, are the most widespread feature of somatic mutation spectra attributable to deaminases in cancer genomes. Overall, the weight matrix approach reveals that somatic mutations are significantly associated with at least one AID/APOBEC mutable motif in all studied cancers.
Keyphrases
- papillary thyroid
- immune response
- squamous cell
- physical activity
- weight loss
- endothelial cells
- childhood cancer
- squamous cell carcinoma
- genome wide
- copy number
- machine learning
- lymph node metastasis
- weight gain
- single molecule
- gene expression
- deep learning
- nucleic acid
- young adults
- single cell
- inflammatory response
- clinical practice
- molecular dynamics
- amino acid
- drug induced
- diabetic rats
- stress induced
- pluripotent stem cells