Corpus of Contemporary Latgalian Texts
The corpus consists of certain proportions of various Latgalian published texts (1988–2012) with accompanying metadata about the author, as well as place and time of publication.
|Corpus size||1M words (1.3M tokens)|
|Developers||Institute of Mathematics and Computer Science UL, Rezekne Academy of Technologies|
|Funding||Latvian-Lithuanian Cross Border Cooperation program, “Development of Research Infrastructure for Education in the Humanities in Eastern Latvia and Lithuania” (HipiLatLit)|