STREUSLE

Dataset Information
Modalities
Texts
Languages
English
Introduced
2015
License
CC-BY-SA 4.0
Homepage

Overview

STREUSLE stands for Supersense-Tagged Repository of English with a Unified Semantics for Lexical Expressions. The text is from the web reviews portion of the English Web Treebank [9]. STREUSLE incorporates comprehensive annotations of multiword expressions (MWEs) [1] and semantic supersenses for lexical expressions. The supersense labels apply to single- and multiword noun and verb expressions, as described in [2], and prepositional/possessive expressions, as described in [3, 4, 5, 6, 7, 8]. Lexical expressions also feature a lexical category label indicating its holistic grammatical status; for verbal multiword expressions, these labels incorporate categories from the PARSEME 1.1 guidelines [15]. For each token, these pieces of information are concatenated together into a lextag: a sentence's words and their lextags are sufficient to recover lexical categories, supersenses, and multiword expressions [8].

Variants: STREUSLE

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Natural Language Understanding BERT (pred POS/lemmas) Lexical Semantic Recognition 2020-04-30
Natural Language Understanding BERT (none) Lexical Semantic Recognition 2020-04-30
Natural Language Understanding BERT (gold POS/lemmas) Lexical Semantic Recognition 2020-04-30
Natural Language Understanding GloVe (gold POS/lemmas) Lexical Semantic Recognition 2020-04-30
Natural Language Understanding GloVe (none) Lexical Semantic Recognition 2020-04-30
Natural Language Understanding GloVe (pred POS/lemmas) Lexical Semantic Recognition 2020-04-30
Natural Language Understanding BiLSTM + MLP (gold syntax) Comprehensive Supersense Disambiguation of English … 2018-05-13
Natural Language Understanding SVM (feature-rich, gold syntax) Comprehensive Supersense Disambiguation of English … 2018-05-13
Natural Language Understanding SVM (feature-rich, auto syntax) Comprehensive Supersense Disambiguation of English … 2018-05-13
Natural Language Understanding BiLSTM + MLP (auto syntax) Comprehensive Supersense Disambiguation of English … 2018-05-13

Research Papers

Recent papers with results on this dataset: