CT–CWT–23-ES

The dataset focused on three topics: COVID-19, climate change and technology. The Spanish dataset is a combination of CT-CWT-21, CT-CWT-22 and newly collected content. It is composed of tweets collected from Twitter accounts and transcriptions from Spanish politicians, which are manually annotated by professional journalists who are experts in fact-checking. Each tweet was labeled using both the image and the text.

Language(s)
Spanish
Year
2023
Domain
News
Text types
Tweets
Annotations
binary label indicating whether the message is worth fact-checking
Data access
Registration

Publication
Barrón-Cedeño, A. et al. (2023). Overview of the CLEF–2023 CheckThat! Lab on Checkworthiness, Subjectivity, Political Bias, Factuality, and Authority of News Articles and Their Source. In: Arampatzis, A., et al. Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2023. Lecture Notes in Computer Science, vol 14163. Springer, Cham. https://doi.org/10.1007/978-3-031-42448-9_20
Number of units
29984
Training set size
17487
Test set size
5000
Development set size
7497

If you have published a result better than those on the list, send a message to odesia-comunicacion@lsi.uned.es indicating the result and the DOI of the article, along with a copy of it if it is not published openly.