Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
en:cnk:online [2020/10/13 17:57] – created vaclavcvrcek | en:cnk:online [2022/12/22 15:25] (current) – [Generations of ONLINE corpora] vaclavcvrcek | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | ~~NOTOC~~ | ||
====== ONLINE corpora ====== | ====== ONLINE corpora ====== | ||
+ | ONLINE corpora together create a monitor corpus of the dynamic content of the Czech web, i.e. predominantly internet journalism, to some extent also discussions, | ||
+ | The key feature of the ONLINE corpora are regular updates. This means that their contents **change continually**, | ||
+ | |||
+ | The corpus is annotated using standard tools for the [[en: | ||
+ | |||
+ | |||
+ | ===== Generations of ONLINE corpora ===== | ||
+ | |||
+ | There are two generations of ONLINE corpora: | ||
+ | |||
+ | ^ Generation ^ Corpus name ^ Period covered ^ Composition ^ Year of publication ^ | ||
+ | | 1. | [[en: | ||
+ | | 2. | [[en: | ||
+ | |||
+ | |||
+ | The ONLINE corpora are disjunctive, | ||
+ | |||
+ | <WRAP round info 80%> | ||
+ | **Note on backwards compatibility: | ||
+ | |||
+ | Saved queries on the 1st generation ONLINE corpora (i.e. ONLINE_NOW and ONLINE_ARCHIVE) may not work after the 2nd generation is published (among other things due to change of corpus name). However, the ONLINE1 corpus contains all the texts of this previous generation and by replicating queries on it, it should be possible to arrive at the same results. | ||
+ | </ |