AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Last revisionBoth sides next revision
en:cnk:nkjp [2018/11/06 10:29] – [Text classification] numbers adrianzasinaen:cnk:nkjp [2018/11/06 10:33] – [Corpus NKJP_1M] numbers adrianzasina
Line 6: Line 6:
 <WRAP right 35%> <WRAP right 35%>
 ^ <fs medium>Name</fs> ^^ <fs medium>NKJP_1M</fs> ^ ^ <fs medium>Name</fs> ^^ <fs medium>NKJP_1M</fs> ^
-^ Positions ^ Number of positions (tokens) |  1 215 513 |   +^ Positions ^ Number of positions (tokens) |  1,215,513 |   
-^ ::: ^ Number of positions (excl. punctuation) |  992 014 |   +^ ::: ^ Number of positions (excl. punctuation) |  992,014 |   
-^ ::: ^ Number of word forms |  143 477 |   +^ ::: ^ Number of word forms |  143,477 |   
-^ ::: ^ Number of lemmas |  54 174 | +^ ::: ^ Number of lemmas |  54,174 | 
-^ Structures ^ Number of documents <doc> |  3 889 | +^ Structures ^ Number of documents <doc> |  3,889 | 
-^ ::: ^ Number of paragraphs <p> |  18 484 | +^ ::: ^ Number of paragraphs <p> |  18,484 | 
-^ ::: ^ Number of sentences <s> |  85 663 |+^ ::: ^ Number of sentences <s> |  85,663 |
 ^ Further information ^ Reference corpus |  YES |   ^ Further information ^ Reference corpus |  YES |  
 ^ ::: ^ Representative corpus |  YES | ^ ::: ^ Representative corpus |  YES |