Login / Signup

SIFT missense predictions for genomes.

Robert VaserSwarnaseetha AdusumalliSim Ngak LengMile ŠikićPauline C Ng
Published in: Nature protocols (2015)
The SIFT (sorting intolerant from tolerant) algorithm helps bridge the gap between mutations and phenotypic variations by predicting whether an amino acid substitution is deleterious. SIFT has been used in disease, mutation and genetic studies, and a protocol for its use has been previously published with Nature Protocols. This updated protocol describes SIFT 4G (SIFT for genomes), which is a faster version of SIFT that enables practical computations on reference genomes. Users can get predictions for single-nucleotide variants from their organism of interest using the SIFT 4G annotator with SIFT 4G's precomputed databases. The scope of genomic predictions is expanded, with predictions available for more than 200 organisms. Users can also run the SIFT 4G algorithm themselves. SIFT predictions can be retrieved for 6.7 million variants in 4 min once the database has been downloaded. If precomputed predictions are not available, the SIFT 4G algorithm can compute predictions at a rate of 2.6 s per protein sequence. SIFT 4G is available from http://sift-dna.org/sift4g.
Keyphrases
  • machine learning
  • amino acid
  • deep learning
  • autism spectrum disorder
  • small molecule
  • dna methylation
  • genome wide
  • multidrug resistant
  • single molecule
  • cell free
  • circulating tumor
  • electronic health record