Image-Chat

Dataset Information
Modalities
Images, Texts
Languages
English
Introduced
2020
Homepage

Overview

The IMAGE-CHAT dataset is a large collection of (image, style trait for speaker A, style trait for speaker B, dialogue between A & B) tuples that we collected using crowd-workers, Each dialogue consists of consecutive turns by speaker A and B. No particular constraints are placed on the kinds of utterance, only that we ask the speakers to both use the provided style trait, and to respond to the given image and dialogue history in an engaging way. The goal is not just to build a diagnostic dataset but a basis for training models that humans actually want to engage with.

Variants: Image-Chat

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Visual Dialog Multi-Modal BlenderBot Multi-Modal Open-Domain Dialogue 2020-10-02

Research Papers

Recent papers with results on this dataset: