Login / Signup

A comparison on predicting functional impact of genomic variants.

Dong WangJie LiYadong WangEdwin Wang
Published in: NAR genomics and bioinformatics (2022)
Single-nucleotide polymorphism (SNPs) may cause the diverse functional impact on RNA or protein changing genotype and phenotype, which may lead to common or complex diseases like cancers. Accurate prediction of the functional impact of SNPs is crucial to discover the 'influential' (deleterious, pathogenic, disease-causing, and predisposing) variants from massive background polymorphisms in the human genome. Increasing computational methods have been developed to predict the functional impact of variants. However, predictive performances of these computational methods on massive genomic variants are still unclear. In this regard, we systematically evaluated 14 important computational methods including specific methods for one type of variant and general methods for multiple types of variants from several aspects; none of these methods achieved excellent (AUC ≥ 0.9) performance in both data sets. CADD and REVEL achieved excellent performance on multiple types of variants and missense variants, respectively. This comparison aims to assist researchers and clinicians to select appropriate methods or develop better predictive methods.
Keyphrases
  • copy number
  • genome wide
  • endothelial cells
  • machine learning
  • artificial intelligence
  • young adults
  • deep learning
  • small molecule
  • intellectual disability
  • electronic health record