Datasets
Below is information about Spanish textual data sets created with the goal of solving NLP tasks. In this case, these are collections of texts, generally enriched with annotations.
-
Rest-Mex 2023 Clustering
NewsSpanish , Spanish (Mexico)Published in 2023114,550Newstopic modeling -
FinancES 2023
FinanceSpanishPublished in 20237,980News headlinessentiment analysis -
GUA-SPA: Guarani Spanish corpus
NewsSpanish , Spanish (Paraguay) , GuaraniPublished in 20231,500Newscode switching detection -
MINT ES
GeneralSpanish , EnglishPublished in 20231,991Tweetssentiment analysis -
OpeNER-ES-2022
TourismSpanishPublished in 20222,057Reviewssentiment analysis -
EmoEventEs
SpanishPublished in 20218,409Tweetssentiment analysis -
VaxxStance-ES
HealthSpanishPublished in 20212,697Tweetssentiment analysis -
InterTASS 2020
Spanish , Spanish (Chile) , Spanish (Costa Rica) , Spanish (Mexico) , Spanish (Peru) , Spanish (Uruguay)Published in 2020Tweetssentiment analysis -
EmoEVENT
Spanish , Spanish (Chile) , Spanish (Costa Rica) , Spanish (Mexico) , Spanish (Peru) , Spanish (Uruguay)Published in 20208,409Tweetssentiment analysis -
SentiMix-Spanglish
SpanishPublished in 202018,789Tweetssentiment analysis -
NEGES
SpanishPublished in 2019400Reviewssentiment analysis, processing negation -
InterTASS-SP
SpanishPublished in 20193,401Tweetssentiment analysis -
InterTASS-MEX
SpanishPublished in 20193,000Tweetssentiment analysis -
InterTASS-CR
SpanishPublished in 20192,363Tweetssentiment analysis -
InterTASS-PE
SpanishPublished in 20193,005Tweetssentiment analysis
Pagination
If you have published a result better than those on the list, send a message to odesia-comunicacion@lsi.uned.es indicating the result and the DOI of the article, along with a copy of it if it is not published openly.