AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
en:cnk:intercorp:verze14 [2022/01/14 17:07] – [Morphosyntactic annotation] alexandrrosenen:cnk:intercorp:verze14 [2022/04/01 15:50] – [Corpus size in thousands of words] michalskrabal
Line 1: Line 1:
 ====== InterCorp Release 14 ====== ====== InterCorp Release 14 ======
- 
-numbers: TODO! 
  
 ^ Name ^^ Czech -- core ^ Czech -- collections ^ other -- core ^ other -- collections ^ ^ Name ^^ Czech -- core ^ Czech -- collections ^ other -- core ^ other -- collections ^
-^ Positions ^ Number of tokens |  141,032,521 |  116,673,043 |  394,042,551 |  1,550,071,364 +^ Positions ^ Number of tokens |  145,640,866 |  116,673,038 |  418,967,492 |  1,548,425,287 
-^ ::: ^ Number of word forms |  113,838,505 |  89,819,773 |   327,968,369 |  1,223,270,610 +^ ::: ^ Number of word forms |  117,606,467 |  89,819,772 |   348,771,933 |  1,223,221,264 
-^ Structural attributes ^ Number of documents |  1,657 |  30 |  3,993 |   282 | +^ Structural attributes ^ Number of documents |  1,708 |  30 |  4,220 |   282 | 
-^ ::: ^ Number of texts |  1,657 |  111,951 |  3,993 |  1,843,528 | +^ ::: ^ Number of texts |  1,708 |  111,951 |  4,220 |  1,843,528 | 
-^ ::: ^ Number of sentences |  9,782,001 |  13,606,183 |  24,305,621 |  143,195,566 |+^ ::: ^ Number of sentences |  10,095,074 |  136,606,183 |  25,872,393 |  143,195,566 |
 ^ Further information ^ reference |  YES   ^^^^ ^ Further information ^ reference |  YES   ^^^^
 ^ ::: ^ representative |  NO  ^^^^ ^ ::: ^ representative |  NO  ^^^^
Line 65: Line 63:
 ^  hi  ^ Hindi |  409 |  0 |  0 |  0 |  0 |  0 |  0 |  409 | ^  hi  ^ Hindi |  409 |  0 |  0 |  0 |  0 |  0 |  0 |  409 |
 ^  hr  ^ Croatian |  22 736 |  0 |  0 |  0 |  0 |  19 048 |  571 |  42 356 | ^  hr  ^ Croatian |  22 736 |  0 |  0 |  0 |  0 |  19 048 |  571 |  42 356 |
-^  hu  Hungarian |  110 |  0 |  0 |  0 |  0 |  0 |  0 |  110 | +^  hs  Upper Sorbian |  110 |  0 |  0 |  0 |  0 |  0 |  0 |  110 | 
-^  hs  Upper Sorbian |  6 444 |  0 |  0 |  17 852 |  12 198 |  21 115 |  0 |  57 609 |+^  hu  Hungarian |  6 444 |  0 |  0 |  17 852 |  12 198 |  21 115 |  0 |  57 609 |
 ^  is  ^ Icelandic|  0 |  0 |  0 |  0 |  0 |  1 581 |  0 |  1 581 | ^  is  ^ Icelandic|  0 |  0 |  0 |  0 |  0 |  1 581 |  0 |  1 581 |
 ^  it  ^ Italian |  15 741 |  1 252 |  2 747 |  23 771 |  15 494 |  14 700 |  684 |  74 389 | ^  it  ^ Italian |  15 741 |  1 252 |  2 747 |  23 771 |  15 494 |  14 700 |  684 |  74 389 |
Line 235: Line 233:
 When citing a specific part of InterCorp please use the reference displayed in KonText in the corpus description, e.g. as: When citing a specific part of InterCorp please use the reference displayed in KonText in the corpus description, e.g. as:
  
-Rosen, A., Vavřín, M., Zasina, A. J. (2022). //The InterCorp Corpus – Czech((Insert languages actually used.)), version 14 of 17 January 2022//. Institute of the Czech National Corpus, Charles University, Prague 2020. Available on-line: https://kontext.korpus.cz/+Rosen, A., Vavřín, M., Zasina, A. J. (2022). //The InterCorp Corpus – Czech((Insert languages actually used.)), version 14 of 31 January 2022//. Institute of the Czech National Corpus, Charles University, Prague 2022. Available on-line: https://kontext.korpus.cz/
  
 </WRAP> </WRAP>