RESD

Russian Emotional Speech Dialogs with annotated text

Dataset Information
Modalities
Texts, Audio, Speech
Languages
Russian
Introduced
2024
License
MIT
Homepage

Overview

Russian dataset of emotional speech dialogues. This dataset was assembled from ~3.5 hours of live speech by actors who voiced pre-distributed emotions in the dialogue for ~3 minutes each.

Each sample of dataset contains name of part from the original dataset studio source, speech file (16000 or 44100Hz) of human voice, 1 of 7 labeled emotions and the speech-to-texted part of voice speech.

Emotions are represented in 7 states: anger, disgust, fear, enthusiasm, happiness, neutral and sadness.

This dataset was created by Artem Amentes, Nikita Davidchuk and Ilya Lubenets

@misc{Aniemore,
  author = {Артем Аментес, Илья Лубенец, Никита Давидчук},
  title = {Открытая библиотека искусственного интеллекта для анализа и выявления эмоциональных оттенков речи человека},
  year = {2022},
  publisher = {Hugging Face},
  journal = {Hugging Face Hub},
  howpublished = {\url{https://huggingface.com/aniemore/Aniemore}},
  email = {[email protected]}
}

Variants: RESD

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Speech Emotion Recognition emotion2vec+base emotion2vec: Self-Supervised Pre-Training for Speech … 2023-12-23
Speech Emotion Recognition emotion2vec+large emotion2vec: Self-Supervised Pre-Training for Speech … 2023-12-23
Speech Emotion Recognition emotion2vec emotion2vec: Self-Supervised Pre-Training for Speech … 2023-12-23

Research Papers

Recent papers with results on this dataset: