The MEDDOPLACE Gold Standard corpus is a collection of 1,000 clinical case reports in Spanish from various medical specialties such as psychiatry, neurology, travel medicine, infectious diseases, cardiology, occupational medicine and oncology. The corpus is annotated on the one hand with places and locations and on the other hand location classes of clinical relevance: (a) birthplace, (b) residence, (c) movement, and (d) healthcare attention.
Language(s)
Spanish
Dataset description link
Year
2023
Domain
Health
Text types
Clinical notes
Annotation guide link
Data access
Public
Data link
Publication
Salvador Lima-López,, Eulàlia Farré-Maduell, Vicent Briva-Iglesias, Luis Gasco-Sanchez, Martin Krallinger (2023) MEDDOPLACE Shared Task overview: recognition, normalization and classification of locations and patient movement in clinical texts. Procesamiento del Lenguaje Natural, Revista nº 71, septiembre de 2023, pp. 301-311.
NLP Topic
Number of units
1000
Type of units
Documents
Documents
1000
Size - additional information
location named entities, classes of clinical locations