A collection of historical and named-entity lexica for Bulgarian, Czech, Dutch, English, French, German, Polish, Slovene, Spanish and Latin.
RASM2018
Example and evaluation dataset used for the ICFHR2018 Competition on Recognition of Historical Arabic Scientific Manuscripts.
REID2017
Example and evaluation dataset used for the ICDAR2017 Competition on Recognition of Early Indian printed Documents
HBR2013
Example and evaluation dataset used for the ICDAR2013 Competition on Historical Book Recognition.
HDLAC2011
Example and evaluation dataset used for the ICDAR2011 Historical Document Layout Analysis Competition.
BVMC Linked Open Data
The catalogue of the Biblioteca Virtual Miguel de Cervantes contains about 200,000 records which were originally created in compliance with the MARC21 standard.