More than half a million representative text-based images compiled by a number of major European libraries.
Layout Analysis Dataset
This dataset has been created primarily for the evaluation of layout analysis (physical and logical) methods.
Census 1961 Project Dataset
Images containing tables from the 1961 Census for England and Wales.
RDCL2017
Example and evaluation dataset used for the ICDAR2017 Competition on Recognition of Documents with Complex Layouts
REID2017
Example and evaluation dataset used for the ICDAR2017 Competition on Recognition of Early Indian printed Documents
HNLA2013
Example and evaluation dataset used for the ICDAR2013 Competition on Historical Newspaper Layout Analysis
HDLAC2011
Example and evaluation dataset used for the ICDAR2011 Historical Document Layout Analysis Competition.