|Title||Kansainvälinen oppijansuomen korpus (ICLFI): typologia, taustamuuttujat ja annotointi|
|Publication Type||Journal Article|
|Year of Publication||2011|
This overview introduces the International Corpus of Learner Finnish (ICLFI). The ICLFI is one of the major corpora representing Baltic-Finnic learner language, the others being the Estonian Interlanguage Corpus (EIC) and the Corpus of National Certificate Tests (CEFLING project). The compilation of the ICLFI started in 2007 in the project Corpus study on language-specific and universal features in learner language. The data is being compiled with the help of Finnish language teachers working at foreign universities. The corpus consists of the texts written by learners of Finnish, produced spontaneously in the language learning situations. So far (September 2011), the ICLFI contains circa one million words and texts from speakers of 22 different mother tongues. The text types vary from fiction to non-fiction.