Language(s)
Spanish
Dataset description link
Year
2022
Domain
Health
Text types
Clinical case reports
Annotations
Species, infectious diseases, species classes according to NCBI Taxonomy.
Format
utf-8 encoded text tab separated
Annotation guide link
Data access
Public
NLP Topic
Number of units
1850
Type of units
Documents
Tokens
1234579
Sentences
65373
Documents
1985
Training set size
1000 docs
Test set size
485 docs
Development set size
500 docs