SciERC

Dataset Information
Modalities
Texts
Languages
English
Introduced
2018
License
Unknown
Homepage

Overview

SciERC dataset is a collection of 500 scientific abstract annotated with scientific entities, their relations, and coreference clusters. The abstracts are taken from 12 AI conference/workshop proceedings in four AI communities, from the Semantic Scholar Corpus. SciERC extends previous datasets in scientific articles SemEval 2017 Task 10 and SemEval 2018 Task 7 by extending entity types, relation types, relation coverage, and adding cross-sentence relations using coreference links.

Source: http://nlp.cs.washington.edu/sciIE/
Image Source: http://nlp.cs.washington.edu/sciIE/

Variants: SciERC, sciERC-sent

Associated Benchmarks

This dataset is used in 3 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Continual Pretraining DAS Continual Pre-training of Language Models 2023-02-07
Relation Extraction PFN A Partition Filter Network for … 2021-08-27
Named Entity Recognition (NER) RDANER A Robust and Domain-Adaptive Approach … 2021-01-02
Named Entity Recognition (NER) Ours: cross-sentence A Frustratingly Easy Approach for … 2020-10-24
Named Entity Recognition (NER) SpERT Span-based Joint Entity and Relation … 2019-09-17
Relation Extraction SciBERT (SciVocab) SciBERT: A Pretrained Language Model … 2019-03-26
Named Entity Recognition (NER) SciBERT (SciVocab) SciBERT: A Pretrained Language Model … 2019-03-26
Named Entity Recognition (NER) SciBERT (Base Vocab) SciBERT: A Pretrained Language Model … 2019-03-26
Relation Extraction SciBERT (Base Vocab) SciBERT: A Pretrained Language Model … 2019-03-26
Named Entity Recognition (NER) SCIIE Multi-Task Identification of Entities, Relations, … 2018-08-29

Research Papers

Recent papers with results on this dataset: