The DETESTS dataset consists of 5,629 sentences, with an average of 24% of them containing stereotypes. It contains comments published in response to different articles extracted from Spanish online newspapers (ABC, elDiario.es, El Mundo, NIUS, etc.) and discussion forums.
Language(s)
Spanish
Dataset description link
Year
2022
Domain
News
Text types
News comments
Annotations
CategorÃas de estereotipos.
Data access
Public
NLP Topic
Number of units
5629
Type of units
Sentence
Training set size
3817 sentences
Test set size
1812 sentences