The Spanish dataset EXIST 2024 is a collection of tweets and memes tagged with information related to sexism: whether the tweet/meme is sexist, the type of intention shown by the author, and the type of sexism being practiced.
Language(s)
Spanish
Dataset description link
Year
2024
Domain
Social
Annotations
binary label indicating whether a meme expresses sexism, multiclass lables about the type of sexism and the intention of the author
Format
json
Annotation guide link
Data access
Register form
Data link
Publication
Plaza, L. et al. (2024).EXIST 2024: sEXism Identification in Social neTworks and Memes. Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2024. Lecture Notes in Computer Science, volume 14612
Publication link
License
CC-BY-4.0
NLP Topic
Number of units
2573
Size
2573.00MB
Training set size
2034
Test set size
540

