Login / Signup

vi-HMM: a novel HMM-based method for sequence variant identification in short-read data.

Man TangMohammad Shabbir HasanHongxiao ZhuLiqing ZhangXiaowei Wu
Published in: Human genomics (2019)
We propose vi-HMM, a hidden Markov model (HMM)-based method for calling SNPs and INDELs in mapped short-read data. This method allows transitions between hidden states (defined as "SNP," "Ins," "Del," and "Match") of adjacent genomic bases and determines an optimal hidden state path by using the Viterbi algorithm. The inferred hidden state path provides a direct solution to the identification of SNPs and INDELs. Simulation studies show that, under various sequencing depths, vi-HMM outperforms commonly used variant calling methods in terms of sensitivity and F1 score. When applied to the real data, vi-HMM demonstrates higher accuracy in calling SNPs and INDELs.
Keyphrases
  • genome wide
  • electronic health record
  • big data
  • machine learning
  • single molecule
  • dna methylation
  • data analysis
  • copy number
  • gene expression
  • single cell
  • bioinformatics analysis
  • neural network