DETESTS

The DETESTS dataset consists of 5,629 sentences, with an average of 24% of them containing stereotypes. It contains comments published in response to different articles extracted from Spanish online newspapers (ABC, elDiario.es, El Mundo, NIUS, etc.) and discussion forums.

Language(s)

Spanish

Dataset description link

http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6442

Year

2022

Domain

News

Text types

News comments

Annotations

Categorías de estereotipos.

Data access

Public

Data link

https://detestsiberlef.wixsite.com/detests/corpus

NLP Topic

hate detection

Number of units

5629

Type of units

Sentence

Training set size

3817 sentences

Test set size

1812 sentences

Log in or register to post comments

If you have published a result better than those on the list, send a message to odesia-comunicacion@lsi.uned.es indicating the result and the DOI of the article, along with a copy of it if it is not published openly.