A large-scale English dataset for coreference resolution. The dataset is designed to embody the core challenges in coreference, such as entity representation, by alleviating the challenge of low overlap between training and test sets and enabling separated analysis of mention detection and mention clustering.
Source: PreCo: A Large-scale Dataset in Preschool Vocabulary for Coreference Resolution
Variants: PreCo
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Coreference Resolution | Maverick_incr | Maverick: Efficient and Accurate Coreference … | 2024-07-31 |
Coreference Resolution | longdoc S (OntoNotes + PreCo + LitBank) | On Generalization in Coreference Resolution | 2021-09-20 |
Recent papers with results on this dataset: