The Cross-lingual Choice of Plausible Alternatives (XCOPA) dataset is a benchmark to evaluate the ability of machine learning models to transfer commonsense reasoning across languages. The dataset is the translation and reannotation of the English COPA (Roemmele et al. 2011) and covers 11 languages from 11 families and several areas around the globe. The dataset is challenging as it requires both the command of world knowledge and the ability to generalise to new languages.
Variants: XCOPA
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Cross-Lingual Transfer | PaLM 2 (few-shot) | PaLM 2 Technical Report | 2023-05-17 |
Cross-Lingual Transfer | mT0-13B | Crosslingual Generalization through Multitask Finetuning | 2022-11-03 |
Cross-Lingual Transfer | BLOOMZ | Crosslingual Generalization through Multitask Finetuning | 2022-11-03 |
Cross-Lingual Transfer | mGPT | mGPT: Few-Shot Learners Go Multilingual | 2022-04-15 |
Cross-Lingual Transfer | RoBERTa Large (translate test) | XCOPA: A Multilingual Dataset for … | 2020-05-01 |
Cross-Lingual Transfer | MAD-X Base | MAD-X: An Adapter-Based Framework for … | 2020-04-30 |
Recent papers with results on this dataset: