Login / Signup

SegVir: Reconstruction of Complete Segmented RNA Viral Genomes from Metatranscriptomes.

Xubo TangJiayu ShangGuowei ChenKei-Hang Katie ChanMang ShiYanni Sun
Published in: Molecular biology and evolution (2024)
Segmented RNA viruses are a complex group of RNA viruses with multisegment genomes. Reconstructing complete segmented viruses is crucial for advancing our understanding of viral diversity, evolution, and public health impact. Using metatranscriptomic data to identify known and novel segmented viruses has sped up the survey of segmented viruses in various ecosystems. However, the high genetic diversity and the difficulty in binning complete segmented genomes present significant challenges in segmented virus reconstruction. Current virus detection tools are primarily used to identify nonsegmented viral genomes. This study presents SegVir, a novel tool designed to identify segmented RNA viruses and reconstruct their complete genomes from complex metatranscriptomes. SegVir leverages both close and remote homology searches to accurately detect conserved and divergent viral segments. Additionally, we introduce a new method that can evaluate the genome completeness and conservation based on gene content. Our evaluations on simulated datasets demonstrate SegVir's superior sensitivity and precision compared to existing tools. Moreover, in experiments using real data, we identified some virus segments missing in the NCBI database, underscoring SegVir's potential to enhance viral metagenome analysis. The source code and supporting data of SegVir are available via https://github.com/HubertTang/SegVir.
Keyphrases
  • genetic diversity
  • sars cov
  • public health
  • electronic health record
  • big data
  • nucleic acid
  • copy number
  • sensitive detection
  • deep learning
  • quantum dots