Login / Signup

Contiguously hydrophobic sequences are functionally significant throughout the human exome.

Ruchi LohiaMatthew E B HansenGrace Brannigan
Published in: Proceedings of the National Academy of Sciences of the United States of America (2022)
Hydrophobic interactions have long been established as essential for stabilizing struc-tured proteins as well as drivers of aggregation, but the impact of hydrophobicity on thefunctional significance of sequence variants has rarely been considered in a genome-wide context. Here we test the role of hydrophobicity on functional impact across70,000 disease- and non–disease-associated single-nucleotide polymorphisms (SNPs),using enrichment of disease association as an indicator of functionality. We find thatfunctional impact is uncorrelated with hydrophobicity of the SNP itself and only weaklycorrelated with the average local hydrophobicity, but is strongly correlated with boththe size and minimum hydrophobicity of the contiguously hydrophobic sequence (or“blob”) that contains the SNP. Disease association is found to vary by more than sixfoldas a function of contiguous hydrophobicity parameters, suggesting utility as a prior foridentifying causal variation. We further find signatures of differential selective constrainton hydrophobic blobs and that SNPs splitting a long hydrophobic blob or joiningtwo short hydrophobic blobs are particularly likely to be disease associated. Trends arepreserved for both aggregating and nonaggregating proteins, indicating that the role ofcontiguous hydrophobicity extends well beyond aggregation risk.
Keyphrases
  • genome wide
  • ionic liquid
  • dna methylation
  • copy number
  • aqueous solution
  • endothelial cells
  • gene expression
  • breast cancer risk