Pathway-based subnetworks enable cross-disease biomarker discovery.
Syed HaiderCindy Q YaoVicky S SabineMichal GrzadkowskiVincent StimperMaud H W StarmansJianxin WangFrancis NguyenNathalie C MoonXihui LinCamilla DrakeCheryl A CrozierCassandra L BrookesCornelis J H van de VeldeAnnette HasenburgDirk G KiebackChristos J MarkopoulosLuc Y DirixCaroline SeynaeveDaniel W ReaArek KasprzykPhilippe LambinPietro Lio'John M S BartlettPaul C BoutrosPublished in: Nature communications (2018)
Biomarkers lie at the heart of precision medicine. Surprisingly, while rapid genomic profiling is becoming ubiquitous, the development of biomarkers usually involves the application of bespoke techniques that cannot be directly applied to other datasets. There is an urgent need for a systematic methodology to create biologically-interpretable molecular models that robustly predict key phenotypes. Here we present SIMMS (Subnetwork Integration for Multi-Modal Signatures): an algorithm that fragments pathways into functional modules and uses these to predict phenotypes. We apply SIMMS to multiple data types across five diseases, and in each it reproducibly identifies known and novel subtypes, and makes superior predictions to the best bespoke approaches. To demonstrate its ability on a new dataset, we profile 33 genes/nodes of the PI3K pathway in 1734 FFPE breast tumors and create a four-subnetwork prediction model. This model out-performs a clinically-validated molecular test in an independent cohort of 1742 patients. SIMMS is generic and enables systematic data integration for robust biomarker discovery.
Keyphrases
- genome wide
- small molecule
- end stage renal disease
- electronic health record
- ejection fraction
- newly diagnosed
- big data
- machine learning
- heart failure
- high throughput
- peritoneal dialysis
- single molecule
- single cell
- deep learning
- dna methylation
- gene expression
- copy number
- atrial fibrillation
- radiation therapy
- sentinel lymph node
- bioinformatics analysis
- data analysis