AutoEpiCollect, a Novel Machine Learning-Based GUI Software for Vaccine Design: Application to Pan-Cancer Vaccine Design Targeting PIK3CA Neoantigens.
Madhav SamudralaSindhusri DhavejiKush SavsaniSivanesan DakshanamurthyPublished in: Bioengineering (Basel, Switzerland) (2024)
Previous epitope-based cancer vaccines have focused on analyzing a limited number of mutated epitopes and clinical variables preliminarily to experimental trials. As a result, relatively few positive clinical outcomes have been observed in epitope-based cancer vaccines. Further efforts are required to diversify the selection of mutated epitopes tailored to cancers with different genetic signatures. To address this, we developed the first version of AutoEpiCollect, a user-friendly GUI software, capable of generating safe and immunogenic epitopes from missense mutations in any oncogene of interest. This software incorporates a novel, machine learning-driven epitope ranking method, leveraging a probabilistic logistic regression model that is trained on experimental T-cell assay data. Users can freely download AutoEpiCollectGUI with its user guide for installing and running the software on GitHub. We used AutoEpiCollect to design a pan-cancer vaccine targeting missense mutations found in the proto-oncogene PIK3CA, which encodes the p110ɑ catalytic subunit of the PI3K kinase protein. We selected PIK3CA as our gene target due to its widespread prevalence as an oncokinase across various cancer types and its lack of presence as a gene target in clinical trials. After entering 49 distinct point mutations into AutoEpiCollect, we acquired 361 MHC Class I epitope/HLA pairs and 219 MHC Class II epitope/HLA pairs. From the 49 input point mutations, we identified MHC Class I epitopes targeting 34 of these mutations and MHC Class II epitopes targeting 11 mutations. Furthermore, to assess the potential impact of our pan-cancer vaccine, we employed PCOptim and PCOptim-CD to streamline our epitope list and attain optimized vaccine population coverage. We achieved a world population coverage of 98.09% for MHC Class I data and 81.81% for MHC Class II data. We used three of our predicted immunogenic epitopes to further construct 3D models of peptide-HLA and peptide-HLA-TCR complexes to analyze the epitope binding potential and TCR interactions. Future studies could aim to validate AutoEpiCollect's vaccine design in murine models affected by PIK3CA-mutated or other mutated tumor cells located in various tissue types. AutoEpiCollect streamlines the preclinical vaccine development process, saving time for thorough testing of vaccinations in experimental trials.
Keyphrases
- papillary thyroid
- machine learning
- squamous cell
- clinical trial
- electronic health record
- lymph node metastasis
- genome wide
- childhood cancer
- gene expression
- protein kinase
- monoclonal antibody
- copy number
- regulatory t cells
- healthcare
- risk factors
- data analysis
- high throughput
- cancer therapy
- stem cells
- autism spectrum disorder
- drug delivery
- dna methylation
- risk assessment
- young adults
- transcription factor
- amino acid
- tyrosine kinase
- body composition
- dendritic cells
- intellectual disability
- human health