Transparent Exploration of Machine Learning for Biomarker Discovery from Proteomics and Omics Data.
Furkan M TorunSebastian Virreira WinterSophia DollFelix M RieseArtem VorobyevJohannes B Mueller-ReifPhilipp E GeyerMaximillian T StraussPublished in: Journal of proteome research (2022)
Biomarkers are of central importance for assessing the health state and to guide medical interventions and their efficacy; still, they are lacking for most diseases. Mass spectrometry (MS)-based proteomics is a powerful technology for biomarker discovery but requires sophisticated bioinformatics to identify robust patterns. Machine learning (ML) has become a promising tool for this purpose. However, it is sometimes applied in an opaque manner and generally requires specialized knowledge. To enable easy access to ML for biomarker discovery without any programming or bioinformatics skills, we developed "OmicLearn" (http://OmicLearn.org), an open-source browser-based ML tool using the latest advances in the Python ML ecosystem. Data matrices from omics experiments are easily uploaded to an online or a locally installed web server. OmicLearn enables rapid exploration of the suitability of various ML algorithms for the experimental data sets. It fosters open science via transparent assessment of state-of-the-art algorithms in a standardized format for proteomics and other omics sciences.
Keyphrases
- machine learning
- mass spectrometry
- big data
- healthcare
- small molecule
- electronic health record
- artificial intelligence
- liquid chromatography
- single cell
- high throughput
- public health
- deep learning
- gas chromatography
- capillary electrophoresis
- high performance liquid chromatography
- climate change
- minimally invasive
- label free
- physical activity
- ms ms
- data analysis
- tandem mass spectrometry
- simultaneous determination
- clinical evaluation