Emuāri  Search Word Frequency List

Latvian Blog Corpus 2015

Authomatically harvested Latvian blog corpus.

Publication to be cited:
M. Laizans
Latviešu valodas korpusa izveide no emuāru tekstiem
Latvijas Universitāte, 2015
PDF
Corpus size 6.6M words (8M tokens)
Development period 2014–2015
Developers Institute of Mathematics and Computer Science UL