LiLa  Search Word Frequency List

Lithuanian-Latvian-Lithuanian Parallel Text Corpus

The sentence-level parallel text corpus (approximately eight million running words), consists of modern (1990s and onwards) translations of different genres from Latvian into Lithuanian and vice versa. The corpus enables research in various fields and allows the creation of dictionaries, language acquisition materials and other language resources and tools.

Publication to be cited:
A. Utka, K. Levane-Petrova, A. Bielinskiene, J. Kovalevskaite, E. Rimkute, D. Vevere
Lithuanian-Latvian-Lithuanian parallel corpus
IOS Press, 2012
PDF DOI
Corpus size 8M words
Development period 2011–2013
Developers Institute of Mathematics and Computer Science UL, Vytautas Magnus University
Funding Latvian-Lithuanian Cross Border Cooperation program, “Development of Research Infrastructure for Education in the Humanities in Eastern Latvia and Lithuania” (HipiLatLit)
Homepage http://hipilatlit.ru.lv/lv/products/lila_info.html
CLARIN http://hdl.handle.net/20.500.12574/6
Other publications
E. Rimkutė, A. Utka, K. Levane-Petrova
Lietuvių–latvių ir latvių–lietuvių kalbų lygiagretusis tekstynas LILA
Studies about Languages (Lithuanian-Latvian, Latvian-Lithuanian Parallel Corpus (LILA)), 70-77, 2013
PDF