Login / Signup

Leveraging base-pair mammalian constraint to understand genetic variation and human disease.

Patrick F SullivanJennifer R S MeadowsSteven GazalBaDoi N PhanXue LiDiane P GenereuxMichael X DongMatteo BianchiGregory AndrewsSharadha SakthikumarJessika NordinAnanya RoyMatthew J ChristmasVoichita D MarinescuChao WangOla WallermanJames R XueShuyang YaoQuan SunJin P SzatkiewiczJia WenLaura M HuckinsAlyssa J LawlerKathleen C KeoughZhili ZhengJian ZengNaomi R WrayYun LiJessica S JohnsonJiawen Chennull nullBenedict PatenSteven K ReillyGraham M HughesNishigandha PhalkeKatherine S PollardAndreas R PfenningKarin Forsberg-NilssonElinor K KarlssonKerstin Lindblad-Toh
Published in: Science (New York, N.Y.) (2023)
Thousands of genomic regions have been associated with heritable human diseases, but attempts to elucidate biological mechanisms are impeded by an inability to discern which genomic positions are functionally important. Evolutionary constraint is a powerful predictor of function, agnostic to cell type or disease mechanism. Single-base phyloP scores from 240 mammals identified 3.3% of the human genome as significantly constrained and likely functional. We compared phyloP scores to genome annotation, association studies, copy-number variation, clinical genetics findings, and cancer data. Constrained positions are enriched for variants that explain common disease heritability more than other functional annotations. Our results improve variant annotation but also highlight that the regulatory landscape of the human genome still needs to be further explored and linked to disease.
Keyphrases
  • copy number
  • endothelial cells
  • genome wide
  • mitochondrial dna
  • induced pluripotent stem cells
  • pluripotent stem cells
  • squamous cell carcinoma
  • deep learning
  • gene expression
  • lymph node metastasis