Login / Signup

An overview of germline variations in genes of primary immunodeficiences through integrative analysis of ClinVar, HGMD® and dbSNP databases.

Lyubov E SalnikovaDmitry S KolobkovDaria A SviridovaSerikbai K Abilev
Published in: Human genetics (2021)
Primary immunodeficiencies (PID) are a diverse group of genetic disorders caused by inadequate development and function of immune system. Identifying genetic etiology is important for genetic counselling and treatment decisions. Clinical relevance of genetic variants is a complex problem depending on gene-specific and variant specific genotype-phenotype interactions. To address this challenge, we aimed to characterize the pathogenic landscape of PID genes by combining the analysis of germline variations reported in ClinVar and HGMD® and identification of damaging variations available in dbSNP. We generated a joint ClinVar/HGMD database, which included 111,940 variants, among them 32,452 were classified as pathogenic/likely pathogenic. From a total of 5,415,794 bi- or multiallelic variants in PID genes recorded in dbSNP, we retrieved 38,291 high impact (HI) biallelic variants with presumably disruptive impact in the protein, of them 25,500 variants were not present in ClinVar/HGMD. Using a functional prediction algorithm, we additionally identified 28,507 deleterious and 56,016 neutral missense variants among dbSNP variants and created a collection of damaging and neutral variations in PID genes, not currently present in ClinVar/HGMD, with their allele frequencies and mappings to protein domains. The distribution of pathogenic variants from ClinVar/HGMD, HI variants and deleterious missense variants from dbSNP was analyzed in the context of hereditary pattern and gene specific metrics, such as pLI and haploinsufficiency. Our report summarized data on complex gene-specific variability in PID genes and might be useful for the identification of the most promising variants and gene regions for further study.
Keyphrases