The goal of SocialDisNER is the automatic recognition of disease mentions in tweets.
Language(s)
Spanish
Year
2022
Domain
Health
Text types
Tweets
Annotations
disease mentions, SNOMED CT terms
Format
txt
Annotation guide link
Data access
Public
Data link
NLP Topic
Number of units
9500
Type of units
Tweets
Tokens
366277
Training set size
5000 tweets
Test set size
2000 tweets
Development set size
2500 tweets
Size - additional information
85000