hate detection

EXIST-2024-ES

The Spanish corpus EXIST 2024 is a collection of tweets and memes labeled with information related to sexism: whether the tweet/meme is sexist, the type of intent shown by the author, and the type of sexism.

DETEST-Dis

The DETESTS dataset is designed for the detection of stereotypes in texts, specifically focusing on racism and prejudices in social media and news article comments. It contains texts in Spanish, including tweets related to immigration hoaxes and news comments, which have been manually annotated to identify the presence of both explicit and implicit stereotypes. The dataset is structured to address binary classification tasks, where texts must be classified as either containing stereotypes or not, as well as to detect whether these stereotypes are explicit or implicit.

DETESTS

The DETESTS dataset consists of 5,629 sentences, with an average of 24% of them containing stereotypes. It contains comments published in response to different articles extracted from Spanish online newspapers (ABC, elDiario.es, El Mundo, NIUS, etc.) and discussion forums.