Association study of GBA1 variants with MSA based on comprehensive sequence analysis -Pitfalls in short-read sequence analysis depending on the human reference genome.
Kenta OrimoJun MitsuiTakashi MatsukawaMasaki TanakaJunko NomotoHiroyuki IshiuraYosuke OmaeYosuke KawaiKatsushi Tokunaganull nullTatsushi TodaShoji TsujiPublished in: Journal of human genetics (2024)
Multiple system atrophy (MSA) is a neurodegenerative disorder characterized by various combinations of autonomic failure, parkinsonism, and cerebellar ataxia. To elucidate variants associated with MSA, we have been conducting short-read-based whole-genome sequence analysis. In the process of the association studies, we initially focused on GBA1, a previously proposed susceptibility gene for MSA, to evaluate whether GBA1 variants can be efficiently identified despite its extraordinarily high homology with its pseudogene, GBA1LP. To accomplish this, we conducted a short-read whole-genome sequence analysis with alignment to GRCh38 as well as Sanger sequence analysis and compared the results. We identified five variants with inconsistencies between the two pipelines, of which three variants (p.L483P, p.A495P-p.V499V, p.L483_M489delinsW) were the results of misalignment due to minor alleles in GBA1P1 registered in GRCh38. The miscalling events in these variants were resolved by alignment to GRCh37 as the reference genome, where the major alleles are registered. In addition, a structural variant was not properly identified either by short-read or by Sanger sequence analyses. Having accomplished correct variant calling, we identified three variants pathogenic for Gaucher disease (p.S310G, p.L483P, and p.L483_M489delinsW). Of these variants, the allele frequency of p.L483P (0.003) in the MSA cases was higher than that (0.0011) in controls. The meta-analysis incorporating a previous report demonstrated a significant association of p.L483P with MSA with an odds ratio of 2.85 (95% CI; 1.05 - 7.76, p = 0.0400).