Login / Signup

Clustering rare diseases within an ontology-enriched knowledge graph.

Jaleal S SanjakQian ZhuEwy A Math
Published in: bioRxiv : the preprint server for biology (2023)
Our study lays out a method for clustering rare diseases using the graph node embeddings. We develop an easy to maintain pipeline that can be updated when new data on rare diseases emerges. The embeddings themselves can be paired with other representation learning methods for other data types, such as drugs, to address other predictive modeling problems. Detailed subnetwork analysis and in-depth review of individual clusters may lead to translatable findings. Future work will focus on incorporation of additional data sources, with a particular focus on common disease data.
Keyphrases
  • electronic health record
  • big data
  • single cell
  • lymph node
  • rna seq
  • data analysis
  • drinking water
  • machine learning
  • artificial intelligence
  • deep learning