Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | |
en:cnk:nkjp [2018/11/06 10:33] – [Corpus NKJP_1M] numbers adrianzasina | en:cnk:nkjp [2018/11/12 16:09] (current) – [Corpus NKJP_1M] michalkren |
---|
~~NOTOC~~ | ~~NOTOC~~ |
====== Corpus NKJP_1M ====== | ====== The NKJP_1M corpus ====== |
| |
The NKJP_1M corpus is a manually annotated one million word subcorpus of the [[http://nkjp.pl| National Corpus of Polish]] (NKJP – //Narodowy Korpus Języka Polskiego//), composed of various text samples (see below). It is a corpus of contemporary Polish with texts published after the year 1945; it contains written, spoken and web communication. The corpus features lemmatisation, morphological annotation, and representative coverage of text categories. | The NKJP_1M corpus is a manually annotated one million word subcorpus of the [[http://nkjp.pl| National Corpus of Polish]] (NKJP – //Narodowy Korpus Języka Polskiego//), composed of various text samples (see below). It is a corpus of contemporary Polish with texts published after the year 1945; it contains written, spoken and web communication. The corpus features lemmatisation, morphological annotation, and representative coverage of text categories. |