AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
en:cnk:online [2022/12/22 12:35] vaclavcvrceken:cnk:online [2022/12/22 15:25] (current) – [Generations of ONLINE corpora] vaclavcvrcek
Line 13: Line 13:
 There are two generations of ONLINE corpora: There are two generations of ONLINE corpora:
  
-^ Generation ^ Corpus name ^ Time span ^ Composition ^ Year of publication ^+^ Generation ^ Corpus name ^ Period covered ^ Composition ^ Year of publication ^
 |  1.      | [[en:cnk:online:gen1|ONLINE1]] | January 2017 – March 2021 | online journalism, social media, discussions, forums |  2020 | |  1.      | [[en:cnk:online:gen1|ONLINE1]] | January 2017 – March 2021 | online journalism, social media, discussions, forums |  2020 |
 |  2.      | [[en:cnk:online:gen2|ONLINE2_NOW, ONLINE2_ARCHIVE]] | April 2021 – present | online journalism |  2022 | |  2.      | [[en:cnk:online:gen2|ONLINE2_NOW, ONLINE2_ARCHIVE]] | April 2021 – present | online journalism |  2022 |
Line 20: Line 20:
 The ONLINE corpora are disjunctive, i.e. there is no intersection. Therefore, for searching in the whole time period since 2017, the results of queries on both corpora can simply be joined together, no manual corrections are needed. As both corpora are identical in their structure and annotation, the following description does not distinguish between them. The ONLINE corpora are disjunctive, i.e. there is no intersection. Therefore, for searching in the whole time period since 2017, the results of queries on both corpora can simply be joined together, no manual corrections are needed. As both corpora are identical in their structure and annotation, the following description does not distinguish between them.
  
 +<WRAP round info 80%>
 +**Note on backwards compatibility:**
  
 +Saved queries on the 1st generation ONLINE corpora (i.e. ONLINE_NOW and ONLINE_ARCHIVE) may not work after the 2nd generation is published (among other things due to change of corpus name). However, the ONLINE1 corpus contains all the texts of this previous generation and by replicating queries on it, it should be possible to arrive at the same results. 
 +</WRAP>