VVPP 

Corpus of the Tests of the State Language Proficiency Testing

The Corpus includes a collection of 900 Latvian language proficiency tests: 150 tests per each proficiency level (A1, A2, B1, B2, C1, C2). Error annotation has been perfomed in all texts.

Publication to be cited:
I. Auzina, G. Klava, A. Lazareva, K. Levane-Petrova, B. Murniece-Buleva, S. Pavulena, A. Semjonova
Latviešu valodas prasmes kvalitāte: valsts valodas prasmes pārbaudes kārtotāju rezultāti
Latviešu valodas aģentūra, 2019
PDF
Corpus size 150k tokens
Development period 2017–2018
Developers Institute of Mathematics and Computer Science UL
Funding Latvian Language Agency, "Quality of the Latvian language: results of the state language proficiency test"
CLARIN http://hdl.handle.net/20.500.12574/49
Other publications
R. Dargis, I. Auzina, K. Levane-Petrova
The Use of Text Alignment in Semi-Automatic Error Analysis: Use Case in the Development of the Corpus of the Latvian Language Learners
2018
PDF