PreCo

Dataset Information
Modalities
Texts
Languages
English
Homepage

Overview

A large-scale English dataset for coreference resolution. The dataset is designed to embody the core challenges in coreference, such as entity representation, by alleviating the challenge of low overlap between training and test sets and enabling separated analysis of mention detection and mention clustering.

Source: PreCo: A Large-scale Dataset in Preschool Vocabulary for Coreference Resolution

Variants: PreCo

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Coreference Resolution Maverick_incr Maverick: Efficient and Accurate Coreference … 2024-07-31
Coreference Resolution longdoc S (OntoNotes + PreCo + LitBank) On Generalization in Coreference Resolution 2021-09-20

Research Papers

Recent papers with results on this dataset: