Language(s)
Spanish
English
Dataset description link
Year
2017
Text types
Web pages
Annotations
disambiguated person names
Format
html
pdf
xml
Annotation guide link
Data access
Public
NLP Topic
Number of units
10420
Type of units
Web pages
Training set size
6793 web pages
Test set size
3627 web pages