LogiQA consists of 8,678 QA instances, covering multiple types of deductive reasoning. Results show that state-of-the-art neural models perform by far worse than human ceiling. The dataset can also serve as a benchmark for reinvestigating logical AI under the deep learning NLP setting.
Source: LogiQA: A Challenge Dataset for Machine Reading Comprehension with Logical Reasoning
Image Source: https://arxiv.org/pdf/2007.08124v1.pdf
Variants: LogiQA
This dataset is used in 1 benchmark:
No recent benchmark submissions available for this dataset.
No papers with results on this dataset found.