KinFams: De-Novo Classification of Protein Kinases Using CATH Functional Units.
Tolulope AdeyeluNicola BordinVaishali P WamanMarta SadlejIan SillitoeAurelio A Moya-GarcíaChristine A OrengoPublished in: Biomolecules (2023)
Protein kinases are important targets for treating human disorders, and they are the second most targeted families after G-protein coupled receptors. Several resources provide classification of kinases into evolutionary families (based on sequence homology); however, very few systematically classify functional families (FunFams) comprising evolutionary relatives that share similar functional properties. We have developed the FunFam-MARC (Multidomain ARchitecture-based Clustering) protocol, which uses multi-domain architectures of protein kinases and specificity-determining residues for functional family classification. FunFam-MARC predicts 2210 kinase functional families (KinFams), which have increased functional coherence, in terms of EC annotations, compared to the widely used KinBase classification. Our protocol provides a comprehensive classification for kinase sequences from >10,000 organisms. We associate human KinFams with diseases and drugs and identify 28 druggable human KinFams, i.e., enriched in clinically approved drugs. Since relatives in the same druggable KinFam tend to be structurally conserved, including the drug-binding site, these KinFams may be valuable for shortlisting therapeutic targets. Information on the human KinFams and associated 3D structures from AlphaFold2 are provided via our CATH FTP website and Zenodo. This gives the domain structure representative of each KinFam together with information on any drug compounds available. For 32% of the KinFams, we provide information on highly conserved residue sites that may be associated with specificity.
Keyphrases
- endothelial cells
- machine learning
- deep learning
- induced pluripotent stem cells
- pluripotent stem cells
- randomized controlled trial
- amino acid
- transcription factor
- emergency department
- health information
- high resolution
- mass spectrometry
- gene expression
- protein protein
- dna methylation
- drug induced
- adverse drug
- social media