EmpatheticDialogues

Dataset Information
Modalities
Texts, Dialog
Introduced
2019
Homepage

Overview

The EmpatheticDialogues dataset is a large-scale multi-turn empathetic dialogue dataset collected on the Amazon Mechanical Turk, containing 24,850 one-to-one open-domain conversations. Each conversation was obtained by pairing two crowd-workers: a speaker and a listener. The speaker is asked to talk about the personal emotional feelings. The listener infers the underlying emotion through what the speaker says and responds empathetically. The dataset provides 32 evenly distributed emotion labels.

Source: Empathetic Dialogue Generation viaKnowledge Enhancing and Emotion Dependency Modeling
Image Source: Towards Empathetic Open-domain Conversation Models: A New Benchmark and Dataset

Variants: EmpatheticDialogues

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Visual Dialog Multi-Modal BlenderBot Multi-Modal Open-Domain Dialogue 2020-10-02

Research Papers

Recent papers with results on this dataset: