ClinAIS 2023

The ClinAIS corpus is a randomly-selected subset of the background CodiEsp corpus, consisting of 1038 distinct clinical notes annotated with seven types of medical sections of the notes.

Language(s)
Spanish
Year
2023
Domain
Health
Text types
Clinical notes
Data access
Registration

Publication
I. de la Iglesia, M. Vivó, P. Chocrón, G. de Maeztu, K. Gojenola, A. Atutxa, An Open Source Corpus and Automatic Tool for Section Identification in Spanish Health Records, Journal of Biomedical Informatics 145 (2023) 104461
Number of units
1038
Type of units
Documents
Documents
1038
Training set size
781
Test set size
130
Development set size
127
Size - additional information

sections of clinical notes

If you have published a result better than those on the list, send a message to odesia-comunicacion@lsi.uned.es indicating the result and the DOI of the article, along with a copy of it if it is not published openly.