Skrýt
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
en:cnk:nkjp [2018/11/06 10:33]
Adrian Zasina [Corpus NKJP_1M] numbers
en:cnk:nkjp [2018/11/12 16:09] (current)
Michal Křen [Corpus NKJP_1M]
Line 1: Line 1:
 ~~NOTOC~~ ~~NOTOC~~
-====== ​Corpus ​NKJP_1M ======+====== ​The NKJP_1M ​corpus ​======
  
 The NKJP_1M corpus is a manually annotated one million word subcorpus of the [[http://​nkjp.pl| National Corpus of Polish]] (NKJP – //Narodowy Korpus Języka Polskiego//​),​ composed of various text samples (see below). It is a corpus of contemporary Polish with texts published after the year 1945; it contains written, spoken and web communication. The corpus features lemmatisation,​ morphological annotation, and representative coverage of text categories. The NKJP_1M corpus is a manually annotated one million word subcorpus of the [[http://​nkjp.pl| National Corpus of Polish]] (NKJP – //Narodowy Korpus Języka Polskiego//​),​ composed of various text samples (see below). It is a corpus of contemporary Polish with texts published after the year 1945; it contains written, spoken and web communication. The corpus features lemmatisation,​ morphological annotation, and representative coverage of text categories.