EmoCause

Dataset Information
Modalities
Texts
Languages
English
Introduced
2021
License
Homepage

Overview

EmoCause is a dataset of annotated emotion cause words in emotional situations from the EmpatheticDialogues valid and test set. The goal is to recognize emotion cause words in sentences by training only on sentence-level emotion labels without word-level labels (i.e., weakly-supervised emotion cause recognition).

EmoCause is based on the fact that humans do not recognize the cause of emotions with supervised learning on word-level cause labels. Thus, we do not provide a training set.

  • Number of emotion categories: 32
  • Average number of cause words per utterance: 2.3
  • Total number of utterances: 4.6K (valid: 3.8K / test: 0.8K)

Variants: EmoCause

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Recognizing Emotion Cause in Conversations Human Perspective-taking and Pragmatics for Generating … 2021-09-18
Recognizing Emotion Cause in Conversations GEE Perspective-taking and Pragmatics for Generating … 2021-09-18
Recognizing Emotion Cause in Conversations BERT-Attention Perspective-taking and Pragmatics for Generating … 2021-09-18
Recognizing Emotion Cause in Conversations EmpDG Perspective-taking and Pragmatics for Generating … 2021-09-18
Recognizing Emotion Cause in Conversations RAKE Perspective-taking and Pragmatics for Generating … 2021-09-18
Recognizing Emotion Cause in Conversations Random Perspective-taking and Pragmatics for Generating … 2021-09-18

Research Papers

Recent papers with results on this dataset: