Semantic Multi-Classifier Systems Identify Predictive Processes in Heart Failure Models across Species.
Ludwig LausserLea SiegleWolfgang RottbauerDerk FrankSteffen JustHans Armin KestlerPublished in: Biomolecules (2018)
Genetic model organisms have the potential of removing blind spots from the underlying gene regulatory networks of human diseases. Allowing analyses under experimental conditions they complement the insights gained from observational data. An inevitable requirement for a successful trans-species transfer is an abstract but precise high-level characterization of experimental findings. In this work, we provide a large-scale analysis of seven weak contractility/heart failure genotypes of the model organism zebrafish which all share a weak contractility phenotype. In supervised classification experiments, we screen for discriminative patterns that distinguish between observable phenotypes (homozygous mutant individuals) as well as wild-type (homozygous wild-types) and carriers (heterozygous individuals). As the method of choice we use semantic multi-classifier systems, a knowledge-based approach which constructs hypotheses from a predefined vocabulary of high-level terms (e.g., Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways or Gene Ontology (GO) terms). Evaluating these models leads to a compact description of the underlying processes and guides the screening for new molecular markers of heart failure. Furthermore, we were able to independently corroborate the identified processes in Wistar rats.
Keyphrases
- heart failure
- wild type
- genome wide
- machine learning
- left ventricular
- endothelial cells
- cardiac resynchronization therapy
- atrial fibrillation
- healthcare
- acute heart failure
- copy number
- smooth muscle
- genetic diversity
- dna methylation
- high throughput
- big data
- electronic health record
- gene expression
- human health
- single cell
- artificial intelligence
- induced pluripotent stem cells
- network analysis