Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
en:cnk:net [2019/12/20 14:42] – [NET Corpus] michalkren | en:cnk:net [2021/02/15 09:58] (current) – Fix publication year of NET v1 jeziorsky | ||
---|---|---|---|
Line 3: | Line 3: | ||
====== NET Corpus ====== | ====== NET Corpus ====== | ||
- | NET corpus is the first version of a synchronic corpus of Czech semi-official internet communication. | + | <WRAP right 45%> |
+ | ^ <fs medium> | ||
+ | ^ [[en: | ||
+ | ^ ::: ^ Number of [[en: | ||
+ | ^ ::: ^ Number of [[en: | ||
+ | ^ [[en: | ||
+ | ^ ::: ^ Number of [[en: | ||
+ | ^ ::: ^ Number of paragraphs <p> | 267 026 | 1 817 088 | | ||
+ | ^ ::: ^ Number of sentences <s> | 2 622 636 | 8 905 016 | | ||
+ | ^ Further Information ^ [[en: | ||
+ | ^ ::: ^ [[en: | ||
+ | ^ ::: ^ Year of publication | 2019 | 2021 | | ||
+ | </ | ||
+ | |||
+ | NET corpus is the first version of a synchronic corpus of Czech semi-official internet communication. | ||
==== Discussion forums ==== | ==== Discussion forums ==== | ||
- | This part of the corpus is concentrated on discussion forums run on the phpBB platform. For the time being, there are neither | + | This part of the corpus is concentrated on discussion forums run on the phpBB platform. For the time being, there are neither |
==== Personal blogs ==== | ==== Personal blogs ==== | ||
- | Jedná se většinou o vedlejší součást zpravodajských serverů nebo internetových magazínů | + | Personal blogs have been downloaded mostly from news servers and web magazines where they often form a supplementary part of the main web. There are no corporate or other formal blogs included in the NET corpus. |
+ | |||
+ | ===== Version 2 (2021) ===== | ||
+ | |||
+ | In 2021, version 2 of the NET corpus was published. The covered domains have been updated with fresh data from 2020, and at the same time, the number of blogs and forums has been significantly increased (currently more than 120 domains). This has also increased the overall size and coverage of the NET corpus. | ||
+ | |||
+ | ===== How to cite ===== | ||
+ | |||
+ | <WRAP round tip 70%> | ||
+ | Jeziorský, T.: //NET v1: korpus | ||
+ | |||
+ | Jeziorský, T.: //NET v2: korpus polooficiální internetové komunikace// | ||
+ | |||
+ | </ | ||