The dataset consists of audio segments collected from various Spanish-language YouTube channels, labeled with the emotion they convey according to five of Ekman's six basic emotions: anger, disgust, fear, joy, and sadness, as well as a neutral emotion.
Language(s)
Spanish
Dataset description link
Year
2024
Domain
Diverse
Annotations
Each instance is labeled with one of the following five emotions: anger, disgust, fear, joy, and sadness, as well as a neutral emotion.
Format
csv
Publication
Pan et al. (2024). Overview of EmoSPeech at IberLEF 2024: Multimodal Speech-text Emotion Recognition in Spanish. Procesamiento del Lenguaje Natural, Revista, 73: 359-368.
NLP Topic
Number of units
3750
Type of units
Samples of texts
Training set size
3000
Test set size
750