Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
en:cnk:intercorp:historie [2020/10/25 20:29] – [Release 9] alexandrrosen | en:cnk:intercorp:historie [2022/01/24 20:55] – [Release 14] alexandrrosen | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== InterCorp: Version history ====== | ====== InterCorp: Version history ====== | ||
+ | |||
+ | ===== Release 14 ===== | ||
+ | |||
+ | Published 31 January 2022 | ||
+ | |||
+ | == Data: == | ||
+ | |||
+ | * Total number of word forms in foreign language texts: 1 572 mil., including 349 mil. core and 1 223 mil. collections | ||
+ | * Total number of word forms in Czech texts: 207 mil., including 118 mil. core and 90 mil. collections | ||
+ | * Upper Sorbian (abbreviated as hs) was added as a new language. | ||
+ | * [[en: | ||
+ | |||
+ | ===== Release 13ud ===== | ||
+ | |||
+ | Published 22 December 2021 | ||
+ | |||
+ | [[https:// | ||
+ | |||
===== Release 13 ===== | ===== Release 13 ===== | ||
Line 11: | Line 29: | ||
* Total number of word forms in Czech texts: 203 mil., including 113 mil. core and 90 mil. collections | * Total number of word forms in Czech texts: 203 mil., including 113 mil. core and 90 mil. collections | ||
* Chinese is now represented also in the Core part | * Chinese is now represented also in the Core part | ||
+ | * The ReLDI tagger is now used also for tagging Slovene | ||
* [[en: | * [[en: | ||
Line 34: | Line 53: | ||
* Total number of word forms in foreign language texts: 1,508 mil., including 283 mil. core and 1,225 mil. collections | * Total number of word forms in foreign language texts: 1,508 mil., including 283 mil. core and 1,225 mil. collections | ||
- | * Total number of tokens | + | * Total number of word forms in Czech texts: 196 mil., including 107 mil. core and 89 mil. collections |
* Japanese is now represented also in the Core | * Japanese is now represented also in the Core | ||
* Newly tagged and lemmatized languages: Belarusian, Japanese, Ukrainian | * Newly tagged and lemmatized languages: Belarusian, Japanese, Ukrainian | ||
Line 49: | Line 68: | ||
* Total number of word forms in foreign language texts: 1,483 mil., including 258 mil. core and 1,225 mil. collections | * Total number of word forms in foreign language texts: 1,483 mil., including 258 mil. core and 1,225 mil. collections | ||
- | * Total number of tokens | + | * Total number of word forms in Czech texts: 192 mil., including 102 mil. core and 89 mil. collections |
* A new collection: translations of the Bible (Old and New Testament) in 18 languages | * A new collection: translations of the Bible (Old and New Testament) in 18 languages | ||
* Update of the //Project Syndicate// collection by new texts published in the previous two years | * Update of the //Project Syndicate// collection by new texts published in the previous two years | ||
Line 258: | Line 277: | ||
* first stable version | * first stable version | ||
- | Last update: //8 June 2015// | + | Last update: //14 January 2022// |