A-OKVQA

Dataset Information
Modalities
Images, Texts
Introduced
2022
License
Unknown
Homepage

Overview

A-OKVQA is crowdsourced visual question answering dataset composed of a diverse set of about 25K questions requiring a broad base of commonsense and world knowledge to answer.

Variants: A-OKVQA

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Visual Question Answering (VQA) HYDRA HYDRA: A Hyper Agent for … 2024-03-19
Visual Question Answering (VQA) PaLI-X-VPD Visual Program Distillation: Distilling Tools … 2023-12-05
Visual Question Answering (VQA) SMoLA-PaLI-X Specialist Model Omni-SMoLA: Boosting Generalist Multimodal Models … 2023-12-01
Visual Question Answering (VQA) MC-CoT Boosting the Power of Small … 2023-11-23
Visual Question Answering (VQA) A Simple Baseline for KB-VQA A Simple Baseline for Knowledge-Based … 2023-10-20
Visual Question Answering (VQA) Prophet Prophet: Prompting Large Language Models … 2023-03-03
Visual Question Answering (VQA) PromptCap PromptCap: Prompt-Guided Task-Aware Image Captioning 2022-11-15
Visual Question Answering (VQA) VLC-BERT VLC-BERT: Visual Question Answering with … 2022-10-24
Visual Question Answering (VQA) GPV-2 Webly Supervised Concept Expansion for … 2022-02-04
Visual Question Answering (VQA) KRISP KRISP: Integrating Implicit and Symbolic … 2020-12-20
Visual Question Answering (VQA) LXMERT LXMERT: Learning Cross-Modality Encoder Representations … 2019-08-20
Visual Question Answering (VQA) ViLBERT ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations … 2019-08-06
Visual Question Answering (VQA) ViLBERT - VQA ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations … 2019-08-06
Visual Question Answering (VQA) ViLBERT - OK-VQA ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations … 2019-08-06
Visual Question Answering (VQA) Pythia Pythia v0.1: the Winning Entry … 2018-07-26

Research Papers

Recent papers with results on this dataset: