Portal ODESIA
State of the Art of Natural Language Processing
in Spanish

State of the Art in numbers

342

tasks

Scientific activity proposed with the aim of solving a specific NLP problem.

170

datasets

A collection of annotated texts and/or images that is used to specify a task.

130

competitions

Scientific event in which one or more NLP tasks are proposed.

8

forums

Evaluation initiative that acts as framework to organise competitions.

datasets

  • PolyHope-2025 V2

    Social
    Published in 2025
    Spanish , English
    29957
    29957.00MB
    Tweets

  • XC-Translate-2025-en-es

    Diverse
    Published in 2025
    Spanish , English , Arabic , Deuch , French , Italian , Korean , Chinese
    6148
    Pairs of sentence

  • Spa-DataBench

    Diverse
    Published in 2025
    Spanish
    300
    300.00MB

  • Mu-SHROOM-2025-es

    Diverse
    Published in 2025
    Spanish , English , Arabic , Deuch , Farsi , French , Hindi , Italian , Swedish , Chinese
    200
    Wikipedia

Forums

CLEF

Conference and Labs of the Evaluation Forum

https://www.clef-initiative.eu/

IBERLEF

Evaluation campaign for Natural Language Processing (NLP) systems in Spanish and other Iberian languages

https://sites.google.com/view/iberlef-2023/home

IberEval

Workshop on Evaluation of Human Language Technologies for Iberian Languages

SEMEVAL

International Workshop on Semantic Evaluation

https://semeval.github.io/

CoNLL

International Workshop on Semantic Evaluation

https://www.signll.org/conll

PAN

Series of scientific events and shared tasks on digital text forensics and stylometry

https://pan.webis.de/data.html

If you use this portal, please cite:

A Web Portal about the State of the Art of NLP Tasks in Spanish
Enrique Amigó, Jorge Carrillo-de-Albornoz, Andrés Fernández, Julio Gonzalo, Guillermo Marco, Roser Morante, Laura Plaza, Jacobo Pedrosa
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

@inproceedings{webportal2024,
  title={A Web Portal about the State of the Art of NLP Tasks in Spanish},
  author={Amigó, Enrique and Carrillo-de-Albornoz, Jorge and Fernández, Andrés and Gonzalo, Julio and Marco, Guillermo and Morante, Roser and Plaza, Laura and Pedrosa, Jacobo},
  booktitle={Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)},
  url={https://aclanthology.org/2024.lrec-main.183/},
  year={2024}
}