CoNLL-2009

Dataset Information
Introduced
2012
License
Unknown
Homepage

Overview

The task builds on the CoNLL-2008 task and extends it to multiple languages. The core of the task is to predict syntactic and semantic dependencies and their labeling. Data is provided for both statistical training and evaluation, which extract these labeled dependencies from manually annotated treebanks such as the Penn Treebank for English, the Prague Dependency Treebank for Czech and similar treebanks for Catalan, Chinese, German, Japanese and Spanish languages, enriched with semantic relations (such as those captured in the Prop/Nombank and similar resources). Great effort has been devoted to provide the participants with a common and relatively simple data representation for all the languages, similar to the last year's English data.

Variants: CoNLL-2009

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Semantic Role Labeling Ours (High-Order model) End-to-end Semantic Role Labeling with … 2021-01-02
Dependency Parsing CRFPar Efficient Second-Order TreeCRF for Neural … 2020-05-03
Dependency Parsing Biaffine Parser Deep Biaffine Attention for Neural … 2016-11-06

Research Papers

Recent papers with results on this dataset: