Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revision | |
| en:cnk:nkjp [2018/11/06 10:33] – [Corpus NKJP_1M] numbers adrianzasina | en:cnk:nkjp [2018/11/12 16:09] (current) – [Corpus NKJP_1M] michalkren |
|---|
| ~~NOTOC~~ | ~~NOTOC~~ |
| ====== Corpus NKJP_1M ====== | ====== The NKJP_1M corpus ====== |
| |
| The NKJP_1M corpus is a manually annotated one million word subcorpus of the [[http://nkjp.pl| National Corpus of Polish]] (NKJP – //Narodowy Korpus Języka Polskiego//), composed of various text samples (see below). It is a corpus of contemporary Polish with texts published after the year 1945; it contains written, spoken and web communication. The corpus features lemmatisation, morphological annotation, and representative coverage of text categories. | The NKJP_1M corpus is a manually annotated one million word subcorpus of the [[http://nkjp.pl| National Corpus of Polish]] (NKJP – //Narodowy Korpus Języka Polskiego//), composed of various text samples (see below). It is a corpus of contemporary Polish with texts published after the year 1945; it contains written, spoken and web communication. The corpus features lemmatisation, morphological annotation, and representative coverage of text categories. |