Replacing non-biomedical concepts improves embedding of biomedical concepts.
Enock NiyonkuruMauricio Soto GomezElena CasiraghiStephan AntogiovanniHannah BlauJustin T ReeseGiorgio ValentiniPeter Nick RobinsonPublished in: bioRxiv : the preprint server for biology (2024)
This pilot study shows that non-biomedical synonym replacement tends to improve the quality of embeddings of biomedical concepts using the Word2Vec algorithm. We have implemented our approach in a freely available Python package available at https://github.com/TheJacksonLaboratory/wn2vec .