Cmai: Predicting Antigen-Antibody Interactions from Massive Sequencing Data.
Bing SongKaiwen WangSaiyang NaJia YaoFarjana J FattahMitchell S von ItzsteinDonghan M YangJialiang LiuYaming XueChaoying LiangYuzhi GuoIndu RamanChengsong ZhuJonathan E DowellJade HomsiSawsan RashdanShengjie YangMary E GwinDavid HsiehchenYvonne Gloria-McCutchenPrithvi RajXiaochen BaiJun WangJose Conejo-GarciaYang XieDavid E GerberJunzhou HuangTao WangPublished in: bioRxiv : the preprint server for biology (2024)
The interaction between antigens and antibodies (B cell receptors, BCRs) is the key step underlying the function of the humoral immune system in various biological contexts. The capability to profile the landscape of antigen-binding affinity of a vast number of BCRs will provide a powerful tool to reveal novel insights at unprecedented levels and will yield powerful tools for translational development. However, current experimental approaches for profiling antibody-antigen interactions are costly and time-consuming, and can only achieve low-to-mid throughput. On the other hand, bioinformatics tools in the field of antibody informatics mostly focus on optimization of antibodies given known binding antigens, which is a very different research question and of limited scope. In this work, we developed an innovative Artificial Intelligence tool, Cmai, to address the prediction of the binding between antibodies and antigens that can be scaled to high-throughput sequencing data. Cmai achieved an AUROC of 0.91 in our validation cohort. We devised a biomarker metric based on the output from Cmai applied to high-throughput BCR sequencing data. We found that, during immune-related adverse events (irAEs) caused by immune-checkpoint inhibitor (ICI) treatment, the humoral immunity is preferentially responsive to intracellular antigens from the organs affected by the irAEs. In contrast, extracellular antigens on malignant tumor cells are inducing B cell infiltrations, and the infiltrating B cells have a greater tendency to co-localize with tumor cells expressing these antigens. We further found that the abundance of tumor antigen-targeting antibodies is predictive of ICI treatment response. Overall, Cmai and our biomarker approach filled in a gap that is not addressed by current antibody optimization works nor works such as AlphaFold3 that predict the structures of complexes of proteins that are known to bind.
Keyphrases
- big data
- artificial intelligence
- single cell
- dendritic cells
- electronic health record
- high throughput
- immune response
- machine learning
- deep learning
- high throughput sequencing
- magnetic resonance
- acute lymphoblastic leukemia
- computed tomography
- tyrosine kinase
- magnetic resonance imaging
- genome wide
- binding protein
- contrast enhanced
- antibiotic resistance genes