CODAH

COmmonsense Dataset Adversarially-authored by Humans

Dataset Information
Modalities
Texts
License
Unknown
Homepage

Overview

The COmmonsense Dataset Adversarially-authored by Humans (CODAH) is an evaluation set for commonsense question-answering in the sentence completion style of SWAG. As opposed to other automatically generated NLI datasets, CODAH is adversarially constructed by humans who can view feedback from a pre-trained model and use this information to design challenging commonsense questions. It contains 2801 questions in total, and uses 5-fold cross validation for evaluation.

Source: CODAH Dataset
Image Source: https://www.aclweb.org/anthology/W19-2008.pdf

Variants: CODAH

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Question Answering G-DAUG-Combo + RoBERTa-Large Generative Data Augmentation for Commonsense … 2020-04-24
Question Answering BERT Large CODAH: An Adversarially Authored Question-Answer … 2019-04-08
Common Sense Reasoning BERT Large CODAH: An Adversarially Authored Question-Answer … 2019-04-08

Research Papers

Recent papers with results on this dataset: