AuTexTification: Model Generated Text Detection

This task aims to boost research on the detection of text generated automatically by text generation models. Participants must develop models that exploit clues about linguistic form and meaning to distinguish automatically generated text from human text. This subtask consists in distinguishing between human and generated text. It is framed as a binary classification task of human text (Hum) and MGT (Gen), where text from three domains is included in the training set, and submissions are evaluated in two unseen ones.

Publication

Areg Mikael Sarvazyan, José Ángel González, Marc Franco-Salvador, Francisco Rangel, Berta Chulvi, Paolo Rosso (2023) Overview of AuTexTification at IberLEF 2023: Detection and Attribution of Machine-Generated Text in Multiple Domains. Procesamiento del Lenguaje Natural, Revista nº 71, septiembre de 2023, pp. 275-288.

Competition

AuTexTification: Automated Text Identification

Language

Spanish

URL Task

https://sites.google.com/view/autextification/home

NLP topic

text generation

Abstract task

Classification

Dataset

AuTexTification 2023