Login / Signup

KAGE: fast alignment-free graph-based genotyping of SNPs and short indels.

Ivar GryttenKnut Dagestad RandGeir Kjetil Sandve
Published in: Genome biology (2022)
Genotyping is a core application of high-throughput sequencing. We present KAGE, a genotyper for SNPs and short indels that is inspired by recent developments within graph-based genome representations and alignment-free methods. KAGE uses a pan-genome representation of the population to efficiently and accurately predict genotypes. Two novel ideas improve both the speed and accuracy: a Bayesian model incorporates genotypes from thousands of individuals to improve prediction accuracy, and a computationally efficient method leverages correlation between variants. We show that the accuracy of KAGE is at par with the best existing alignment-free genotypers, while being an order of magnitude faster.
Keyphrases
  • genome wide
  • dna methylation
  • copy number
  • high throughput sequencing
  • high throughput
  • neural network
  • working memory
  • single cell
  • deep learning