A comparative study of pre-trained language models for named entity recognition in clinical trial eligibility criteria from multiple corpora.

Jianfu LiQiang WeiOmid GhiasvandMiao ChenVictor LobanovChunhua WengHua Xu

Published in: BMC medical informatics and decision making (2022)

Findings from this study not only demonstrate the importance of contextual embeddings trained from domain-specific corpora, but also shed lights on the benefits of leveraging multiple data sources for the challenging NER task in clinical trial eligibility criteria text.

Keyphrases

clinical trial
resistance training
phase ii
double blind
open label
study protocol
autism spectrum disorder
big data