AuTexTification 2023 | Portal ODESIA

The AuTexTification dataset consists of texts written by humans and LLMs in five domains: tweets, reviews, how-to articles, news and legal documents.

Language(s)

Spanish

English

Dataset description link

https://zenodo.org/record/7956207

Year

2023

Domain

General

Legal

News

Data access

Registration

Data link

https://zenodo.org/record/7956207

Publication

Areg Mikael Sarvazyan, José Ángel González, Marc Franco-Salvador, Francisco Rangel, Berta Chulvi, Paolo Rosso (2023) Overview of AuTexTification at IberLEF 2023: Detection and Attribution of Machine-Generated Text in Multiple Domains. Procesamiento del Lenguaje Natural, Revista nº 71, septiembre de 2023, pp. 275-288.

Publication link

http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6559

NLP Topic

text generation

Number of units

52191

Type of units

Samples of texts

Size - additional information

model generated or not, attributed model

Log in or register to post comments