NLVR

Natural Language Visual Reasoningnatural language for visual reasoning

Dataset Information
Modalities
Images, Texts
Languages
English
Introduced
2017
License
Unknown
Homepage

Overview

NLVR contains 92,244 pairs of human-written English sentences grounded in synthetic images. Because the images are synthetically generated, this dataset can be used for semantic parsing.

Source: http://lil.nlp.cornell.edu/nlvr/
Image Source: http://lil.nlp.cornell.edu/nlvr/

Variants: NLVR, NLVR2 Dev, NLVR2 Test

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Visual Reasoning VisualBERT VisualBERT: A Simple and Performant … 2019-08-09

Research Papers

Recent papers with results on this dataset: