MMConv

Dataset Information
Modalities
Images, Texts
Languages
English
Introduced
2021
Homepage

Overview

The main goal of the data collection is to acquire highly natural conversations that cover a wide variety of styles and scenarios. In total, the presented corpus consists of five domains: Food, Hotel, Nightlife, Shopping mall and Sightseeing. Controlled by our various task settings, the collected dialogues cover between one to four domains per dialogue, and are thus of greatly varying length and complexity. There are 808 single-task dialogues that contains a single venue target and 4, 298 multi-task dialogues consisting of at least two to four venue targets. These different venues vary in domains most of the times.

Variants: MMConv

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Response Generation PaCE PaCE: Unified Multi-modal Dialogue Pre-training … 2023-05-24
Response Generation SimpleTOD A Simple Language Model for … 2020-05-02

Research Papers

Recent papers with results on this dataset: