Whole-genome sequencing of SARS-CoV-2 reveals the detection of G614 variant in Pakistan.
Massab UmairAamer IkramMuhammad SalmanAdnan KhurshidMasroor AlamNazish BadarRana SulemanFaheem TahirSalmaan SharifJoel MontgomeryShannon WhitmerJohn KlenaPublished in: PloS one (2021)
Since its emergence in China, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has spread worldwide including Pakistan. During the pandemic, whole genome sequencing has played an important role in understanding the evolution and genomic diversity of SARS-CoV-2. Although an unprecedented number of SARS-CoV-2 full genomes have been submitted in GISAID and NCBI, data from Pakistan is scarce. We report the sequencing, genomic characterization, and phylogenetic analysis of five SARS-CoV-2 strains isolated from patients in Pakistan. The oropharyngeal swabs of patients that were confirmed positive for SARS-CoV-2 through real-time RT-PCR at National Institute of Health, Pakistan, were selected for whole-genome sequencing. Sequencing was performed using NEBNext Ultra II Directional RNA Library Prep kit for Illumina (NEW ENGLAND BioLabs Inc., MA, US) and Illumina iSeq 100 instrument (Illumina, San Diego, US). Based on whole-genome analysis, three Pakistani SARS-CoV-2 strains clustered into the 20A (GH) clade along with the strains from Oman, Slovakia, United States, and Pakistani strain EPI_ISL_513925. The two 19B (S)-clade strains were closely related to viruses from India and Oman. Overall, twenty-nine amino acid mutations were detected in the current study genome sequences, including fifteen missense and four novel mutations. Notably, we have found a D614G (aspartic acid to glycine) mutation in spike protein of the sequences from the GH clade. The G614 variant carrying the characteristic D614G mutation has been shown to be more infectious that lead to its rapid spread worldwide. This report highlights the detection of GH and S clade strains and G614 variant from Pakistan warranting large-scale whole-genome sequencing of strains prevalent in different regions to understand virus evolution and to explore their genetic diversity.
Keyphrases
- sars cov
- respiratory syndrome coronavirus
- escherichia coli
- genetic diversity
- tertiary care
- end stage renal disease
- ejection fraction
- newly diagnosed
- amino acid
- coronavirus disease
- patient reported outcomes
- loop mediated isothermal amplification
- public health
- prognostic factors
- machine learning
- high resolution
- climate change
- copy number
- big data
- patient reported
- quantum dots