LSOIE

Large-Scale dataset for Supervised Open Information Extraction

Dataset Information
Introduced
2021
License
Unknown
Homepage

Overview

LSOIE is a large-scale OpenIE data converted from QA-SRL 2.0 in two domains, i.e., Wikipedia and Science. It is 20 times larger than the next largest human-annotated OpenIE data, and thus is reliable for fair evaluation. LSOIE provides n-ary OpenIE annotations and gold tuples are in the 〈ARG0, Relation, ARG1, . . . , ARGn〉 format. The dataset has two subsets ... namely LSOIE-wiki and LSOIE-sci, for comprehensive evaluation. LSOIE-wiki has 24,251 sentences and LSOIE-sci has 47,919 sentences.

Source: https://arxiv.org/pdf/2212.02068v1.pdf (section 5)

Variants: LSOIE, LSOIE-wiki

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Open Information Extraction DetIELSOIE DetIE: Multilingual Open Information Extraction … 2022-06-24
Open Information Extraction CIGL-OIE DetIE: Multilingual Open Information Extraction … 2022-06-24
Open Information Extraction DetIELSOIE + IGL-CA DetIE: Multilingual Open Information Extraction … 2022-06-24
Open Information Extraction DetIEIMoJIE DetIE: Multilingual Open Information Extraction … 2022-06-24
Open Information Extraction OpenIE4 DetIE: Multilingual Open Information Extraction … 2022-06-24
Open Information Extraction OpenIE6 (CIGL-OIE + IGL-CA) DetIE: Multilingual Open Information Extraction … 2022-06-24
Open Information Extraction OpenIE5 DetIE: Multilingual Open Information Extraction … 2022-06-24
Open Information Extraction DetIEIMoJIE (ours) + IGL-CA DetIE: Multilingual Open Information Extraction … 2022-06-24
Open Information Extraction OllIE Mausam et al. (2012) DetIE: Multilingual Open Information Extraction … 2022-06-24

Research Papers

Recent papers with results on this dataset: