Adapting Bidirectional Encoder Representations from Transformers (BERT) to Assess Clinical Semantic Textual Similarity: Algorithm Development and Validation Study.
Klaus KadesJan SellnerGregor KoehlerPeter M FullT Y Emmy LaiJens KleesiekKlaus H Maier-HeinPublished in: JMIR medical informatics (2021)
We found that using a graph-based similarity approach has the potential to extrapolate domain specific knowledge to unseen sentences. We observed that it is easily possible to obtain deceptive results from the test dataset, especially when the distribution of the data samples is different between training and test datasets.