Cross-Domain Authorship Attribution

Author identification attempts to reveal the authors behind texts. It is an area of research associated with applications in literary research, cyber-security, forensics, and social media analysis. This edition of PAN attempts at answering one question: given a document, are there any style changes or not? Two tasks are studied:  the task of cross-domain authorship attribution, where the texts of known and unknown authorship belong to different domains, and style change detection, where single-author and multi-author texts are to be distinguished. For the former task, fanfiction texts are used, a large part of contemporary fiction written by non-professional authors who are inspired by specific well-known works, to enable us control the domain of texts for the first time. The corpus of fanfiction texts covers five languages (English, French, Italian, Polish, and Spanish).