Benchmarking pharmacogenomics genotyping tools: Performance analysis on short-read sequencing samples and depth-dependent evaluation.
Andreas HalmanSebastian LunkeSimon SadedinClaire MooreRachel ConyersPublished in: Clinical and translational science (2024)
Pharmacogenomics (PGx) investigates the influence of genetics on drug responses, enabling tailored treatments for personalized healthcare. This study assessed the accuracy of genotyping six genes using whole genome sequencing with four different computational tools and various sequencing depths. The effects of using different reference genomes (GRCh38 and GRCh37) and sequence aligners (BWA-MEM and Bowtie2) were also explored. The results showed generally minor variations in tool performance across most genes; however, more notable discrepancies were observed in the analysis of the complex CYP2D6 gene. Cyrius, a CYP2D6-specific tool, demonstrated the most robust performance, achieving the highest concordance rates for CYP2D6 in all instances, comparable to the consensus approach in most cases. There were rather small differences between the samples with 20× coverage depth and those with higher depth, but the decreased performance was more evident at lower depths, particularly at 5×. Additionally, variations in CYP2D6 results were observed when samples were aligned to different reference genomes using the same method, or to the same genome using different aligners, which led to reporting incorrect rare star alleles in several cases. These findings inform the selection of optimal PGx tools and methodologies as well as suggest that employing a consensus approach with two or more tools might be preferable for certain genes and tool combinations, especially at lower sequencing depths, to ensure accurate results. Additionally, we show how the upstream alignment can affect the performance of tools, an important factor to take into account.
Keyphrases
- genome wide
- healthcare
- genome wide identification
- dna methylation
- single cell
- adverse drug
- optical coherence tomography
- copy number
- high throughput
- gene expression
- bioinformatics analysis
- emergency department
- genome wide analysis
- clinical decision support
- genetic diversity
- transcription factor
- affordable care act
- health insurance
- high density