Login / Signup

Genome maps across 26 human populations reveal population-specific patterns of structural variation.

Michal Levy-SakinSteven PastorYulia MostovoyLe LiAlden K Y LeungJennifer McCaffreyEleanor YoungErnest T LamAlex R HastieKaren H Y WongClaire Yik-Lok ChungWalfred MaJustin SibertRamakrishnan RajagopalanNana JinEugene Y C ChowCatherine ChuAnnie PoonChin LinAhmed NaguibWei-Ping WangHan CaoTing-Fung ChanKevin Y YipMing XiaoPui-Yan Kwok
Published in: Nature communications (2019)
Large structural variants (SVs) in the human genome are difficult to detect and study by conventional sequencing technologies. With long-range genome analysis platforms, such as optical mapping, one can identify large SVs (>2 kb) across the genome in one experiment. Analyzing optical genome maps of 154 individuals from the 26 populations sequenced in the 1000 Genomes Project, we find that phylogenetic population patterns of large SVs are similar to those of single nucleotide variations in 86% of the human genome, while ~2% of the genome has high structural complexity. We are able to characterize SVs in many intractable regions of the genome, including segmental duplications and subtelomeric, pericentromeric, and acrocentric areas. In addition, we discover ~60 Mb of non-redundant genome content missing in the reference genome sequence assembly. Our results highlight the need for a comprehensive set of alternate haplotypes from different populations to represent SV patterns in the genome.
Keyphrases
  • genome wide
  • endothelial cells
  • high resolution
  • dna methylation
  • single cell
  • induced pluripotent stem cells
  • mass spectrometry
  • pluripotent stem cells