Login / Signup

Haplotype threading: accurate polyploid phasing from long reads.

Sven D SchrinnerRebecca Serra MariJana EblerMikko RautiainenLancelot SeillierJulia J ReimerBjörn UsadelTobias MarschallGunnar W Klau
Published in: Genome biology (2020)
Resolving genomes at haplotype level is crucial for understanding the evolutionary history of polyploid species and for designing advanced breeding strategies. Polyploid phasing still presents considerable challenges, especially in regions of collapsing haplotypes.We present WHATSHAP POLYPHASE, a novel two-stage approach that addresses these challenges by (i) clustering reads and (ii) threading the haplotypes through the clusters. Our method outperforms the state-of-the-art in terms of phasing quality. Using a real tetraploid potato dataset, we demonstrate how to assemble local genomic regions of interest at the haplotype level. Our algorithm is implemented as part of the widely used open source tool WhatsHap.
Keyphrases
  • machine learning
  • deep learning
  • single cell
  • genome wide
  • rna seq
  • quality improvement
  • gene expression
  • mass spectrometry
  • neural network