VisualMRC

Name: VisualMRC
Published: 2021-01-27
License: Unknown

VisualMRC: Machine Reading Comprehension on Document Images

Dataset Information

Modalities

Images, Texts

Languages

English

Introduced

2021

License

Unknown

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

VisualMRC is a visual machine reading comprehension dataset that proposes a task: given a question and a document image, a model produces an abstractive answer.

You can find more details, analyses, and baseline results in the paper,
VisualMRC: Machine Reading Comprehension on Document Images, AAAI 2021.

Statistics:
10,197 images
30,562 QA pairs
10.53 average question tokens (tokenizing with NLTK tokenizer)
9.53 average answer tokens (tokenizing wit NLTK tokenizer)
151.46 average OCR tokens (tokenizing with NLTK tokenizer)

Variants: VisualMRC

Associated Benchmarks

This dataset is used in 1 benchmark:

Visual Question Answering - Metrics: CIDEr

Recent Benchmark Submissions

Task	Model	Paper	Date
Visual Question Answering	LayoutT5 (Large)	VisualMRC: Machine Reading Comprehension on …	2021-01-27

Research Papers

Recent papers with results on this dataset:

VisualMRC: Machine Reading Comprehension on Document Images (2021) -

External Links:

VisualMRC

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview