Senie  Search Word Frequency List

Corpus of Early Written Latvian Texts

A specialized corpus based on the early Latvian written sources (both printed and manuscripts) of the XVI-XVIII cc. The corpus provides word indices thus facilitating the study of lexis, morphology and syntax of the early texts and serves as the basis of "the Historical dictionary of Latvian (XVI-XVII cc.)".

Publication to be cited:
E. Andronova
The Corpus of Early Written Latvian: Current state and future tasks
University of Birmingham, UK, 2007
PDF
Corpus size 2M words (2.7M tokens)
Development period 2002–..
Developers Latvian Language Institute UL, Institute of Mathematics and Computer Science UL, Faculty of Humanties UL
Funding State Culture Capital Foundation
Homepage http://senie.korpuss.lv/
CLARIN http://hdl.handle.net/20.500.12574/12
Other publications
E. Andronova
Short Texts in the Corpus of Early Written Latvian
2020
PDF
E. Andronova, R. Silina-Pinke, A. Trumpa, P. Vanags
The Electronic Historical Latvian Dictionary based on the Corpus of Early Written Latvian Texts
Acta Baltico-Slavica, 40, 2016
PDF DOI