Natural Language Visual Reasoningnatural language for visual reasoning
NLVR contains 92,244 pairs of human-written English sentences grounded in synthetic images. Because the images are synthetically generated, this dataset can be used for semantic parsing.
Source: http://lil.nlp.cornell.edu/nlvr/
Image Source: http://lil.nlp.cornell.edu/nlvr/
Variants: NLVR, NLVR2 Dev, NLVR2 Test
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Visual Reasoning | VisualBERT | VisualBERT: A Simple and Performant … | 2019-08-09 |
Recent papers with results on this dataset: