WinoGrande

Dataset Information
Modalities
Texts
License
Homepage

Overview

WinoGrande is a large-scale dataset of 44k problems, inspired by the original WSC design, but adjusted to improve both the scale and the hardness of the dataset. The key steps of the dataset construction consist of (1) a carefully designed crowdsourcing procedure, followed by (2) systematic bias reduction using a novel AfLite algorithm that generalizes human-detectable word associations to machine-detectable embedding associations.

Source: WinoGrande: An Adversarial Winograd Schema Challenge at Scale
Image Source: https://winogrande.allenai.org/

Variants: WinoGrande, Winogrande, Winogrande (5-shot), Winogrande TR v0.2, Winogrande TR

Associated Benchmarks

This dataset is used in 4 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
parameter-efficient fine-tuning LLaMA2-7b GIFT-SW: Gaussian noise Injected Fine-Tuning … 2024-08-27
Common Sense Reasoning LLaMA3 8B+MoSLoRA Mixture-of-Subspaces in Low-Rank Adaptation 2024-06-16
Common Sense Reasoning LLaMA-3 8B + MixLoRA MixLoRA: Enhancing Large Language Models … 2024-04-22
Common Sense Reasoning LLaMA-2 13B + MixLoRA MixLoRA: Enhancing Large Language Models … 2024-04-22
Common Sense Reasoning LLaMA-2 7B + MixLoRA MixLoRA: Enhancing Large Language Models … 2024-04-22
Common Sense Reasoning Branch-Train-MiX 4x7B (sampling top-1 expert) Branch-Train-MiX: Mixing Expert LLMs into … 2024-03-12
parameter-efficient fine-tuning LLaMA2-7b DoRA: Weight-Decomposed Low-Rank Adaptation 2024-02-14
Common Sense Reasoning Mixtral 8x7B (0-shot) Mixtral of Experts 2024-01-08
Common Sense Reasoning Mistral 7B (0-shot) Mixtral of Experts 2024-01-08
Common Sense Reasoning Camelidae-8×34B Parameter-Efficient Sparsity Crafting from Dense … 2024-01-05
Common Sense Reasoning Mistral 7B (0-shot) Mistral 7B 2023-10-10
Common Sense Reasoning phi-1.5-web 1.3B (zero-shot) Textbooks Are All You Need … 2023-09-11
Common Sense Reasoning T0-3B (CoT fine-tuned) The CoT Collection: Improving Zero-shot … 2023-05-23
Common Sense Reasoning PaLM 2-L (1-shot) PaLM 2 Technical Report 2023-05-17
Common Sense Reasoning PaLM 2-S (1-shot) PaLM 2 Technical Report 2023-05-17
Common Sense Reasoning PaLM 2-M (1-shot) PaLM 2 Technical Report 2023-05-17
Common Sense Reasoning LaMini-GPT 1.5B LaMini-LM: A Diverse Herd of … 2023-04-27
Common Sense Reasoning GPT-2-XL 1.5B LaMini-LM: A Diverse Herd of … 2023-04-27
Common Sense Reasoning LaMini-F-T5 783M LaMini-LM: A Diverse Herd of … 2023-04-27
Common Sense Reasoning T5-Large 738M LaMini-LM: A Diverse Herd of … 2023-04-27

Research Papers

Recent papers with results on this dataset: