In linguistics, the lexicon of a language is its vocabulary, expressed as units – “lexemes” – that correspond to the forms of particular words. These forms may be linked semantically (as in the relationship of meaning between the words “have”, “has” and “had”), or grammatically (that is, as
belonging to a particular word category or part of speech), or historically (as in cases of
spelling variation across time or between countries). Within IMPACT, lexica have been built for several languages to both aid general linguistic research and to support the OCR process by
increasing the number of words in an OCR engine’s dictionary.
Lexicon building
« Back to Glossary Index