Bc8

Bc8BioRED

Dataset Information
Languages
English
Introduced
2025
License
Unknown
Homepage

Overview

Bc8BioRED is built upon BioRED 2022 with the addition of directionality annotations. The training and development sets from the original 2022 BioRED corpus were combined and reused as the training set, while the test set was used as the development set. Furthermore, the 400 test abstracts from the BioCreative VIII were utilized for evaluation. Bc8BioRED encompasses seven types of entities and eight types of relationships. Each relationship annotation in the Bc8BioRED corpus is categorized by novelty to indicate whether the relationship represents a significant finding or previously known background knowledge. Initially, the BioRED 2022 corpus comprised 600 abstracts for RE system development, with an additional 400 abstracts annotated to enhance coverage of emerging topics. The dataset encompasses directionality annotations (subject/object roles) for each relation pair, resulting in 10,864 directionality annotations.

Variants: Bc8

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Document-level Relation Extraction BioRex+Directionality Enhancing Biomedical Relation Extraction with … 2025-01-23

Research Papers

Recent papers with results on this dataset: