Login / Signup

Prediction of disease-associated functional variants in noncoding regions through a comprehensive analysis by integrating datasets and features.

Yu LuYiming WuYuan LiuYizhou LiRunyu JingMenglong Li
Published in: Human mutation (2021)
One of the greatest challenges in human genetics is deciphering the link between functional variants in noncoding sequences and the pathophysiology of complex diseases. To address this issue, many methods have been developed to sort functional single-nucleotide variants (SNVs) for neutral SNVs in noncoding regions. In this study, we integrated well-established features and commonly used datasets and merged them into large-scale datasets based on a random forest model, which yielded promising performance and outperformed some cutting-edge approaches. Our analyses of feature importance and data coverage also provide certain clues for future research in enhancing the prediction of functional noncoding SNVs.
Keyphrases
  • copy number
  • endothelial cells
  • machine learning
  • healthcare
  • climate change
  • electronic health record
  • genome wide
  • current status
  • induced pluripotent stem cells
  • affordable care act