DocVQA

Dataset Information
Modalities
Images, Texts
License
Unknown
Homepage

Overview

DocVQA consists of 50,000 questions defined on 12,000+ document images.

Source: DocVQA: A Dataset for VQA on Document Images

Variants: DocVQA val, DocVQA test, DocVQA

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Visual Question Answering (VQA) ChatGPT 3.5 with LAPDoc Prompt (SpatialFormat) LAPDoc: Layout-Aware Prompting for Documents 2024-02-15

Research Papers

Recent papers with results on this dataset: