AttABseq: an attention-based deep learning prediction method for antigen-antibody binding affinity changes based on protein sequences.
Ruofan JinQing YeJike WangZheng CaoDejun JiangTianyue WangYu KangWanting XuChang-Yu HsiehTing-Jun HouPublished in: Briefings in bioinformatics (2024)
The optimization of therapeutic antibodies through traditional techniques, such as candidate screening via hybridoma or phage display, is resource-intensive and time-consuming. In recent years, computational and artificial intelligence-based methods have been actively developed to accelerate and improve the development of therapeutic antibodies. In this study, we developed an end-to-end sequence-based deep learning model, termed AttABseq, for the predictions of the antigen-antibody binding affinity changes connected with antibody mutations. AttABseq is a highly efficient and generic attention-based model by utilizing diverse antigen-antibody complex sequences as the input to predict the binding affinity changes of residue mutations. The assessment on the three benchmark datasets illustrates that AttABseq is 120% more accurate than other sequence-based models in terms of the Pearson correlation coefficient between the predicted and experimental binding affinity changes. Moreover, AttABseq also either outperforms or competes favorably with the structure-based approaches. Furthermore, AttABseq consistently demonstrates robust predictive capabilities across a diverse array of conditions, underscoring its remarkable capacity for generalization across a wide spectrum of antigen-antibody complexes. It imposes no constraints on the quantity of altered residues, rendering it particularly applicable in scenarios where crystallographic structures remain unavailable. The attention-based interpretability analysis indicates that the causal effects of point mutations on antibody-antigen binding affinity changes can be visualized at the residue level, which might assist automated antibody sequence optimization. We believe that AttABseq provides a fiercely competitive answer to therapeutic antibody optimization.
Keyphrases
- deep learning
- artificial intelligence
- machine learning
- highly efficient
- binding protein
- dna binding
- working memory
- amino acid
- magnetic resonance
- magnetic resonance imaging
- high throughput
- transcription factor
- cystic fibrosis
- capillary electrophoresis
- pseudomonas aeruginosa
- small molecule
- diffusion weighted imaging
- genetic diversity