Taiyi: a bilingual fine-tuned large language model for diverse biomedical tasks.
Ling LuoJinzhong NingYingwen ZhaoZhijun WangZeyuan DingPeng ChenWeiru FuQinyu HanGuangtao XuYunzhi QiuDinghao PanJiru LiHao LiWenduo FengSenbo TuYuqi LiuZhihao YangJian WangYuanyuan SunHongfei LinPublished in: Journal of the American Medical Informatics Association : JAMIA (2024)
Leveraging rich high-quality biomedical corpora and developing effective fine-tuning strategies can significantly improve the performance of LLMs within the biomedical domain. Taiyi shows the bilingual multitasking capability through supervised fine-tuning. However, those tasks such as information extraction that are not generation tasks in nature remain challenging for LLM-based generative approaches, and they still underperform the conventional discriminative approaches using smaller language models.