SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis

Multilingual Tweet Intimacy Analysis is a task to predict the intimacy of tweets in different languages. The task focuses on on perceived intimacy by asking annotators to give their subjective judgment of tweet intimacy.  Each annotator is asked to answer “How intimate do you think the given tweet is?” using a 1-5 likert scale. Tweets are annotated in 10 languages. The training data contains labeled intimacy for six languages: English, French, Spanish, Italian, ortuguese, and Chinese. To encourage new studies on understanding intimacy in language four other languages are included without training data (Dutch, Hindi, Korean, and Arabic). The participants are asked to build models that can predict tweet intimacy from 1 (not intimate at all) to 5 (very intimate).

Publication
Jiaxin Pei, Vítor Silva, Maarten Bos, Yozen Liu, Leonardo Neves, David Jurgens, and Francesco Barbieri. 2023. SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 2235–2246, Toronto, Canada. Association for Computational Linguistics.
Language
Spanish
English
Abstract task
Dataset
Year
2023
Ranking metric
Pearson correlation

Task results

System Pearson correlation
UZH_Clyp 0.7400
tmn 0.7400
king001 0.7840
arizonans 0.7350
opi 0.7750
Zhegu 0.7290
lazybob 0.7700
lottery 0.7500
ODA_SRIB 0.7470
OPD 0.7460

If you have published a result better than those on the list, send a message to odesia-comunicacion@lsi.uned.es indicating the result and the DOI of the article, along with a copy of it if it is not published openly.