Cross-lingual Natural Language Processing on Limited Annotated Case/Radiology Reports in English and Japanese: Insights from the Real-MedNLP Workshop.

Shuntaro YadaYuta NakamuraShoko WakamiyaEiji Aramaki

Published in: Methods of information in medicine (2024)

Most systems adopt medical-domain-specific pre-trained language models using data augmentation methods. Despite the challenge of limited corpus size in Tasks 1 and 2, recent approaches are promising because the partial match scores reached approximately 0.8-0.9 F1-scores. Task 3 applications revealed that the different availabilities of external language resources affected the performance per language.

Keyphrases

autism spectrum disorder
healthcare
working memory
emergency department
artificial intelligence
electronic health record
big data
soft tissue
adverse drug
drug induced