A collection of historical and named-entity lexica for Bulgarian, Czech, Dutch, English, French, German, Polish, Slovene, Spanish and Latin.
REID2017
Example and evaluation dataset used for the ICDAR2017 Competition on Recognition of Early Indian printed Documents
HBR2013
Example and evaluation dataset used for the ICDAR2013 Competition on Historical Book Recognition.
BVMC Linked Open Data
The catalogue of the Biblioteca Virtual Miguel de Cervantes contains about 200,000 records which were originally created in compliance with the MARC21 standard.