Login / Signup

Rapid and ongoing evolution of repetitive sequence structures in human centromeres.

Yuta SuzukiEugene W MyersShinichi Morishita
Published in: Science advances (2020)
Our understanding of centromere sequence variation across human populations is limited by its extremely long nested repeat structures called higher-order repeats that are challenging to sequence. Here, we analyzed chromosomes 11, 17, and X using long-read sequencing data for 36 individuals from diverse populations including a Han Chinese trio and 21 Japanese. We revealed substantial structural diversity with many previously unidentified variant higher-order repeats specific to individuals characterizing rapid, haplotype-specific evolution of human centromeric arrays, while frequent single-nucleotide variants are largely conserved. We found a characteristic pattern shared among prevalent variants in human and chimpanzee. Our findings pave the way for studying sequence evolution in human and primate centromeres.
Keyphrases
  • endothelial cells
  • induced pluripotent stem cells
  • transcription factor
  • machine learning
  • mass spectrometry
  • dna methylation
  • genome wide
  • big data
  • amino acid