WiC

Words in Context

Dataset Information
Modalities
Texts
Languages
English
License
Homepage

Overview

WiC is a benchmark for the evaluation of context-sensitive word embeddings. WiC is framed as a binary classification task. Each instance in WiC has a target word w, either a verb or a noun, for which two contexts are provided. Each of these contexts triggers a specific meaning of w. The task is to identify if the occurrences of w in the two contexts correspond to the same meaning or not. In fact, the dataset can also be viewed as an application of Word Sense Disambiguation in practise.

Source: WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations

Variants: WiC

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Classification OPT-1.3B Achieving Dimension-Free Communication in Federated … 2024-05-24
Classification OPT-125M Achieving Dimension-Free Communication in Federated … 2024-05-24

Research Papers

Recent papers with results on this dataset: