Clustering rare diseases within an ontology-enriched knowledge graph.
Jaleal S SanjakQian ZhuEwy A MathPublished in: bioRxiv : the preprint server for biology (2023)
Our study lays out a method for clustering rare diseases using the graph node embeddings. We develop an easy to maintain pipeline that can be updated when new data on rare diseases emerges. The embeddings themselves can be paired with other representation learning methods for other data types, such as drugs, to address other predictive modeling problems. Detailed subnetwork analysis and in-depth review of individual clusters may lead to translatable findings. Future work will focus on incorporation of additional data sources, with a particular focus on common disease data.