Low-coverage whole genome sequencing for a highly selective cohort of severe COVID-19 patients.
Renato SantosVíctor Moreno-TorresIlduara Pintos-PascualOctavio CorralCarmen de MendozaVicente SorianoManuel CorpasPublished in: GigaByte (Hong Kong, China) (2024)
Despite the advances in genetic marker identification associated with severe COVID-19, the full genetic characterisation of the disease remains elusive. This study explores imputation in low-coverage whole genome sequencing for a severe COVID-19 patient cohort. We generated a dataset of 79 imputed variant call format files using the GLIMPSE1 tool, each containing an average of 9.5 million single nucleotide variants. Validation revealed a high imputation accuracy (squared Pearson correlation ≍0.97) across sequencing platforms, showcasing GLIMPSE1's ability to confidently impute variants with minor allele frequencies as low as 2% in individuals with Spanish ancestry. We carried out a comprehensive analysis of the patient cohort, examining hospitalisation and intensive care utilisation, sex and age-based differences, and clinical phenotypes using a standardised set of medical terms developed to characterise severe COVID-19 symptoms. The methods and findings presented here can be leveraged for future genomic projects to gain vital insights into health challenges like COVID-19.
Keyphrases
- sars cov
- coronavirus disease
- copy number
- early onset
- healthcare
- genome wide
- case report
- respiratory syndrome coronavirus
- mental health
- single cell
- drug induced
- risk assessment
- physical activity
- depressive symptoms
- climate change
- human health
- affordable care act
- social media
- current status
- health insurance
- sleep quality