Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data.
Krzysztof GogolewskiMarcin KosteckiAnna GambinPublished in: Entropy (Basel, Switzerland) (2020)
The constantly and rapidly increasing amount of the biological data gained from many different high-throughput experiments opens up new possibilities for data- and model-driven inference. Yet, alongside, emerges a problem of risks related to data integration techniques. The latter are not so widely taken account of. Especially, the approaches based on the flux balance analysis (FBA) are sensitive to the structure of a metabolic network for which the low-entropy clusters can prevent the inference from the activity of the metabolic reactions. In the following article, we set forth problems that may arise during the integration of metabolomic data with gene expression datasets. We analyze common pitfalls, provide their possible solutions, and exemplify them by a case study of the renal cell carcinoma (RCC). Using the proposed approach we provide a metabolic description of the known morphological RCC subtypes and suggest a possible existence of the poor-prognosis cluster of patients, which are commonly characterized by the low activity of the drug transporting enzymes crucial in the chemotherapy. This discovery suits and extends the already known poor-prognosis characteristics of RCC. Finally, the goal of this work is also to point out the problem that arises from the integration of high-throughput data with the inherently nonuniform, manually curated low-throughput data. In such cases, the over-represented information may potentially overshadow the non-trivial discoveries.
Keyphrases
- poor prognosis
- electronic health record
- high throughput
- renal cell carcinoma
- gene expression
- big data
- long non coding rna
- single cell
- end stage renal disease
- mental health
- chronic kidney disease
- emergency department
- dna methylation
- artificial intelligence
- squamous cell carcinoma
- rna seq
- newly diagnosed
- machine learning
- data analysis
- climate change
- peritoneal dialysis
- rectal cancer