Evaluating and Enhancing Large Language Models' Performance in Domain-Specific Medicine: Development and Usability Study With DocOA.
Xi ChenLi WangMingke YouWeiZhi LiuYu FuJie XuShaoting ZhangGang ChenKang LiJian LiPublished in: Journal of medical Internet research (2024)
This study introduces a novel benchmark framework that assesses the domain-specific abilities of LLMs in multiple aspects, highlights the limitations of generalized LLMs in clinical contexts, and demonstrates the potential of tailored approaches for developing domain-specific medical LLMs.