Using Proteomics Data to Identify Personalized Treatments in Multiple Myeloma: A Machine Learning Approach.
Angeliki KatsenouRoisin O'FarrellPaul DowlingCaroline A HeckmanPeter O'GormanDespina BazouPublished in: International journal of molecular sciences (2023)
This paper describes a machine learning (ML) decision support system to provide a list of chemotherapeutics that individual multiple myeloma (MM) patients are sensitive/resistant to, based on their proteomic profile. The methodology used in this study involved understanding the parameter space and selecting the dominant features (proteomics data), identifying patterns of proteomic profiles and their association to the recommended treatments, and defining the decision support system of personalized treatment as a classification problem. During the data analysis, we compared several ML algorithms, such as linear regression, Random Forest, and support vector machines, to classify patients as sensitive/resistant to therapeutics. A further analysis examined data-balancing techniques that emerged due to the small cohort size. The results suggest that utilizing proteomics data is a promising approach for identifying effective treatment options for patients with MM (reaching on average an accuracy of 81%). Although this pilot study was limited by the small patient cohort (39 patients), which restricted the training and validation of the explored ML solutions to identify complex associations between proteins, it holds great promise for developing personalized anti-MM treatments using ML approaches.
Keyphrases
- machine learning
- end stage renal disease
- data analysis
- big data
- chronic kidney disease
- ejection fraction
- newly diagnosed
- mass spectrometry
- multiple myeloma
- prognostic factors
- electronic health record
- peritoneal dialysis
- deep learning
- artificial intelligence
- patient reported outcomes
- smoking cessation
- replacement therapy