question answering

SQUAD-SQAC 2024 ES

SQUAD/SQAC 2024 is an extension of the datasets  SQUAD v1.1. (Stanford Question Answering Corpus) (Rajpurkar et al., 2016) for English and SQAC (Spanish Question Answering Corpus) (Gutiérrez-Fandiño et al., 2021)  for Spanish.  The dataset contains academic news from CSIC (Centro Superior de Investigaciones Científicas) for Spanish and  Cambridge University for English, with questions and extractive answers.

SQAC

The Spanish Question Answering Corpus (SQAC) is an extractive QA dataset with no unanswerable questions. It is created from texts extracted from the Spanish Wikipedia, encyclopedic articles, newswire articles from Wikinews, and the Spanish section of the AnCora corpus, which is a mix from different newswire and literature sources. It was created by commissioning the creation of 18,817 questions with the annotation of their answer spans from 6,247 textual contexts.