hate detection

DETEST-Dis

The DETESTS dataset is designed for the detection of stereotypes in texts, specifically focusing on racism and prejudices in social media and news article comments. It contains texts in Spanish, including tweets related to immigration hoaxes and news comments, which have been manually annotated to identify the presence of both explicit and implicit stereotypes. The dataset is structured to address binary classification tasks, where texts must be classified as either containing stereotypes or not, as well as to detect whether these stereotypes are explicit or implicit.

DETESTS

The DETESTS dataset consists of 5,629 sentences, with an average of 24% of them containing stereotypes. It contains comments published in response to different articles extracted from Spanish online newspapers (ABC, elDiario.es, El Mundo, NIUS, etc.) and discussion forums.