Datasets - IMPACT Centre of Competence

IMPACT Language Resources

Impact Centre of Competence 14 September, 2023

A collection of historical and named-entity lexica for Bulgarian, Czech, Dutch, English, French, German, Polish, Slovene, Spanish and Latin.

Impact Centre of Competence 13 September, 2023

The corpus accounts for 22M OCRed characters along with the corresponding Gold Standard (GS).