Russian Emotional Speech Dialogs with annotated text
Russian dataset of emotional speech dialogues. This dataset was assembled from ~3.5 hours of live speech by actors who voiced pre-distributed emotions in the dialogue for ~3 minutes each.
Each sample of dataset contains name of part from the original dataset studio source, speech file (16000 or 44100Hz) of human voice, 1 of 7 labeled emotions and the speech-to-texted part of voice speech.
Emotions are represented in 7 states: anger, disgust, fear, enthusiasm, happiness, neutral and sadness.
This dataset was created by Artem Amentes, Nikita Davidchuk and Ilya Lubenets
@misc{Aniemore,
author = {Артем Аментес, Илья Лубенец, Никита Давидчук},
title = {Открытая библиотека искусственного интеллекта для анализа и выявления эмоциональных оттенков речи человека},
year = {2022},
publisher = {Hugging Face},
journal = {Hugging Face Hub},
howpublished = {\url{https://huggingface.com/aniemore/Aniemore}},
email = {[email protected]}
}
Variants: RESD
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Speech Emotion Recognition | emotion2vec+base | emotion2vec: Self-Supervised Pre-Training for Speech … | 2023-12-23 |
Speech Emotion Recognition | emotion2vec+large | emotion2vec: Self-Supervised Pre-Training for Speech … | 2023-12-23 |
Speech Emotion Recognition | emotion2vec | emotion2vec: Self-Supervised Pre-Training for Speech … | 2023-12-23 |
Recent papers with results on this dataset: