Login / Signup

Comprehensive variation discovery and recovery of missing sequence in the pig genome using multiple de novo assemblies.

Mingzhou LiLei ChenShilin TianYu LinQianzi TangXuming ZhouDiyan LiCarol K L YeungTiandong CheLong JinYuhua FuJideng MaXun WangAnan JiangJing LanQi PanYingkai LiuZonggang LuoZongyi GuoHaifeng LiuLi ZhuSurong ShuaiGuoqing TangJiugang ZhaoYanzhi JiangLin BaiShunhua ZhangMiaomiao MaiChangchun LiDawei WangYiren GuGuosong WangHongfeng LuYan LiHaihao ZhuZongwen LiMing LiVadim N GladyshevZhi JiangShuhong ZhaoJinyong WangRuiqiang LiXuewei Li
Published in: Genome research (2016)
Uncovering genetic variation through resequencing is limited by the fact that only sequences with similarity to the reference genome are examined. Reference genomes are often incomplete and cannot represent the full range of genetic diversity as a result of geographical divergence and independent demographic events. To more comprehensively characterize genetic variation of pigs (Sus scrofa), we generated de novo assemblies of nine geographically and phenotypically representative pigs from Eurasia. By comparing them to the reference pig assembly, we uncovered a substantial number of novel SNPs and structural variants, as well as 137.02-Mb sequences harboring 1737 protein-coding genes that were absent in the reference assembly, revealing variants left by selection. Our results illustrate the power of whole-genome de novo sequencing relative to resequencing and provide valuable genetic resources that enable effective use of pigs in both agricultural production and biomedical research.
Keyphrases
  • genetic diversity
  • genome wide
  • copy number
  • dna methylation
  • risk assessment
  • heavy metals
  • gene expression
  • single cell
  • high throughput
  • binding protein
  • protein protein
  • genome wide identification
  • genome wide analysis