Comprehensive survey of conserved RNA secondary structures in full-genome alignment of Hepatitis C virus.
Sandra TriebelKevin LamkiewiczNancy OntiverosBlake SweeneyPeter F StadlerAnton I PetrovMichael NiepmannManja MarzPublished in: Scientific reports (2024)
Hepatitis C virus (HCV) is a plus-stranded RNA virus that often chronically infects liver hepatocytes and causes liver cirrhosis and cancer. These viruses replicate their genomes employing error-prone replicases. Thereby, they routinely generate a large 'cloud' of RNA genomes (quasispecies) which-by trial and error-comprehensively explore the sequence space available for functional RNA genomes that maintain the ability for efficient replication and immune escape. In this context, it is important to identify which RNA secondary structures in the sequence space of the HCV genome are conserved, likely due to functional requirements. Here, we provide the first genome-wide multiple sequence alignment (MSA) with the prediction of RNA secondary structures throughout all representative full-length HCV genomes. We selected 57 representative genomes by clustering all complete HCV genomes from the BV-BRC database based on k-mer distributions and dimension reduction and adding RefSeq sequences. We include annotations of previously recognized features for easy comparison to other studies. Our results indicate that mainly the core coding region, the C-terminal NS5A region, and the NS5B region contain secondary structure elements that are conserved beyond coding sequence requirements, indicating functionality on the RNA level. In contrast, the genome regions in between contain less highly conserved structures. The results provide a complete description of all conserved RNA secondary structures and make clear that functionally important RNA secondary structures are present in certain HCV genome regions but are largely absent from other regions. Full-genome alignments of all branches of Hepacivirus C are provided in the supplement.
Keyphrases
- hepatitis c virus
- genome wide
- human immunodeficiency virus
- nucleic acid
- high resolution
- transcription factor
- magnetic resonance imaging
- clinical trial
- squamous cell carcinoma
- magnetic resonance
- young adults
- randomized controlled trial
- computed tomography
- emergency department
- lipopolysaccharide induced
- dengue virus
- inflammatory response
- papillary thyroid
- lps induced
- rna seq
- lymph node metastasis
- copy number