Short sequence motif dynamics in the SARS-CoV-2 genome suggest a role for cytosine deamination in CpG reduction.
Mukhtar SadykovTobias MourierQingtian GuanArnab PainPublished in: bioRxiv : the preprint server for biology (2021)
RNA viruses use CpG reduction to evade the host cell defense, but the driving mechanisms are still largely unknown. In an attempt to address this we used a rapidly growing genomic dataset of SARS-CoV-2 with relevant metadata information. Remarkably, by simply ordering SARS-CoV-2 genomes by their date of collection, we find a progressive increase of C-to-U substitutions resulting in 5'-UCG-3' motif reduction that in turn have reduced the CpG frequency over just a few months of observation. This is consistent with APOBEC-mediated RNA editing resulting in CpG reduction, thus allowing the virus to escape ZAP-mediated RNA degradation. Our results thus link the dynamics of target sequences in the viral genome for two known host molecular defense mechanisms, mediated by the APOBEC and ZAP proteins.