Winoground

Dataset Information
Modalities
Images, Texts
Languages
English
Introduced
2022
License
Unknown
Homepage

Overview

Winoground is a dataset for evaluating the ability of vision and language models to conduct visio-linguistic compositional reasoning. Given two images and two captions, the goal is to match them correctly -- but crucially, both captions contain a completely identical set of words, only in a different order. The dataset was carefully hand-curated by expert annotators and is labeled with a rich set of fine-grained tags to assist in analyzing model performance.

Variants: Winoground

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Visual Reasoning GPT-4o + CA A Cognitive Paradigm Approach to … 2025-01-23
Visual Reasoning KeyComp* (GPT-3.5) Prompting Large Vision-Language Models for … 2024-01-20
Visual Reasoning KeyComp* (GPT-4) Prompting Large Vision-Language Models for … 2024-01-20
Visual Reasoning KeyComp (GPT-3.5) Prompting Large Vision-Language Models for … 2024-01-20
Visual Reasoning OpenFlamingo + CoCoT CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning OpenFlamingo CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning Gemini + DDCoT CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning GPT-4V + CoCoT CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning Gemini + CCoT CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning OpenFlamingo + DDCoT CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning MMICL + CoCoT CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning MMICL + DDCoT CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning GPT-4V CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning OpenFlamingo + CCoT CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning Gemini + CoCoT CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning Gemini CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning MMICL + CCoT CoCoT: Contrastive Chain-of-Thought Prompting for … 2024-01-05
Visual Reasoning LLaVA-1.5 Compositional Chain-of-Thought Prompting for Large … 2023-11-27
Visual Reasoning LLaVA-1.5-CCoT Compositional Chain-of-Thought Prompting for Large … 2023-11-27
Visual Reasoning LLaVA-1.5-ZS-CoT Compositional Chain-of-Thought Prompting for Large … 2023-11-27

Research Papers

Recent papers with results on this dataset: