Login / Signup

Bayesian localization of CNV candidates in WGS data within minutes.

John WiedenhöftAlex CaganRimma KozhemyakinaRimma GulevichAlexander Schliep
Published in: Algorithms for molecular biology : AMB (2019)
Using this approach, we discover several CNV candidates in two rat populations divergently selected for tame and aggressive behavior, consistent with earlier results concerning the domestication syndrome as well as experimental observations. Computationally, we observe a 29.5-fold decrease in memory, an average 5.8-fold speedup, as well as a 191-fold decrease in minor page faults. We also observe that metrics varied greatly in the old implementation, but not the new one. We conjecture that this is due to the better compression scheme. The fully Bayesian segmentation of the entire WGS data set required 3.5 min and 1.24 GB of memory, and can hence be performed on a commodity laptop.
Keyphrases
  • electronic health record
  • working memory
  • big data
  • primary care
  • healthcare
  • deep learning
  • oxidative stress
  • case report