AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Last revisionBoth sides next revision
en:cnk:intercorp:historie [2022/01/24 20:55] – [Release 14] alexandrrosenen:cnk:intercorp:historie [2022/11/23 14:31] – [Release 14] alexandrrosen
Line 1: Line 1:
  
 ====== InterCorp: Version history ====== ====== InterCorp: Version history ======
 +
 +===== Release 15 =====
 +
 +Published 11 November 2022
 +
 +== Data: ==
 +
 +  * Total number of word forms in foreign language texts: 1 588 mil., including 362 mil. core and 1 226 mil. collections
 +  * Total number of word forms in Czech texts: 210 mil., including 120 mil. core and 90 mil. collections
 +  * The Project Syndicate collection was extended by texts published in 2019–2021; Arabic and Chinese texts were included for the first time
 +  * Instead of a national tagger for Norwegian, the UDPipe tagger is used starting this release, including tokenization and tagset according to the Universal Dependencies standard (same as for Belarusian and Ukrainian)  
 +  * [[en:cnk:intercorp:verze15|Information about the corpus]]
 +
  
 ===== Release 14 ===== ===== Release 14 =====