Binary word identification

The goal of the CWI shared task of 2018 is to predict which words challenge non-native speakers based on the annotations collected from both native and non-native speakers. A labeled training set is provided where words in context were annotated regarding their complexity. Training and test data belong to the same language. The task consists in labeling the target test words in context as complex (1) or simple (0).

Publication

Seid Muhie Yimam, Chris Biemann, Shervin Malmasi, Gustavo Paetzold, Lucia Specia, Sanja Štajner, Anaïs Tack, Marcos Zampieri (2018) A Report on the Complex Word Identification Shared Task 2018. Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications, pages 66-78 New Orleans, Louisiana, June 5, 2018.

Competition

Second Complex Word Identification Shared Task

Language

Spanish

NLP topic

morphology

Abstract task

Classification

Dataset

CWIG3G2-ES

Year

2018

Publication link

https://aclanthology.org/W18-…

Task results

System	MacroF1 Sort ascending
TMU	0.7699
NLP-CIC	0.7672
ITEC	0.7637
NLP-CIC	0.7468
CoastalCPH	0.7458