GenoVarDis 2024: NER in Genomic Variants and related Diseases

This task addresses the lack of resources in Spanish for named entity recognition (NER) and genomic variants, being the first of its kind. It is based on a corpus curated by experts that covers mutations and entities related to variants (genes, diseases, and symptoms). The proposal aims to improve the training of NER models in a domain with limited resources, overcoming the limitations of current tools based on regular expressions. Since NER datasets for variants are scarce even in English, this work is crucial for advancing in this field. Inspired by precision medicine and biocuration, it drives research in NLP in Spanish.

Publication
Agüero-Torales et al. (2024). Overview of GenoVarDis at IberLEF 2024: NER of Genomic Variants and Related Diseases in Spanish. Procesamiento del Lenguaje Natural, 73: 421-434.

Task results

If you have published a result better than those on the list, send a message to odesia-comunicacion@lsi.uned.es indicating the result and the DOI of the article, along with a copy of it if it is not published openly.