Login / Signup

A database of 5305 healthy Korean individuals reveals genetic and clinical implications for an East Asian population.

Jeongeun LeeJean LeeSungwon JeonJeongha LeeInsu JangJin Ok YangSoojin ParkByungwook LeeJinwook ChoiByung-Ok ChoiHeon Yung GeeJaeseong OhIn-Jin JangSanghyuk LeeDaehyun BaekYoungil KohSung Soo YoonYoung-Joon KimJong-Hee ChaeWoong-Yang ParkJong Hwa BhakMurim Choi
Published in: Experimental & molecular medicine (2022)
Despite substantial advances in disease genetics, studies to date have largely focused on individuals of European descent. This limits further discoveries of novel functional genetic variants in other ethnic groups. To alleviate the paucity of East Asian population genome resources, we established the Korean Variant Archive 2 (KOVA 2), which is composed of 1896 whole-genome sequences and 3409 whole-exome sequences from healthy individuals of Korean ethnicity. This is the largest genome database from the ethnic Korean population to date, surpassing the 1909 Korean individuals deposited in gnomAD. The variants in KOVA 2 displayed all the known genetic features of those from previous genome databases, and we compiled data from Korean-specific runs of homozygosity, positively selected intervals, and structural variants. In doing so, we found loci, such as the loci of ADH1A/1B and UHRF1BP1, that are strongly selected in the Korean population relative to other East Asian populations. Our analysis of allele ages revealed a correlation between variant functionality and evolutionary age. The data can be browsed and downloaded from a public website ( https://www.kobic.re.kr/kova/ ). We anticipate that KOVA 2 will serve as a valuable resource for genetic studies involving East Asian populations.
Keyphrases
  • genome wide
  • copy number
  • dna methylation
  • healthcare
  • big data
  • electronic health record
  • emergency department
  • mental health
  • genetic diversity