XCOPA

Dataset Information
Modalities
Texts
License
Unknown
Homepage

Overview

The Cross-lingual Choice of Plausible Alternatives (XCOPA) dataset is a benchmark to evaluate the ability of machine learning models to transfer commonsense reasoning across languages. The dataset is the translation and reannotation of the English COPA (Roemmele et al. 2011) and covers 11 languages from 11 families and several areas around the globe. The dataset is challenging as it requires both the command of world knowledge and the ability to generalise to new languages.

Source: https://github.com/cambridgeltl/xcopa

Variants: XCOPA

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Cross-Lingual Transfer PaLM 2 (few-shot) PaLM 2 Technical Report 2023-05-17
Cross-Lingual Transfer mT0-13B Crosslingual Generalization through Multitask Finetuning 2022-11-03
Cross-Lingual Transfer BLOOMZ Crosslingual Generalization through Multitask Finetuning 2022-11-03
Cross-Lingual Transfer mGPT mGPT: Few-Shot Learners Go Multilingual 2022-04-15
Cross-Lingual Transfer RoBERTa Large (translate test) XCOPA: A Multilingual Dataset for … 2020-05-01
Cross-Lingual Transfer MAD-X Base MAD-X: An Adapter-Based Framework for … 2020-04-30

Research Papers

Recent papers with results on this dataset: