TBGA: a large-scale Gene-Disease Association dataset for Biomedical Relation Extraction.
Stefano MarchesinGianmaria SilvelloPublished in: BMC bioinformatics (2022)
TBGA is amongst the largest datasets for GDA extraction. We have evaluated state-of-the-art models for GDA extraction on TBGA, showing that it is a challenging and well-suited dataset for the task. We made the dataset publicly available to foster the development of state-of-the-art BioRE models for GDA extraction.