Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | |
en:cnk:klaus [2024/11/21 14:08] – michalkren | en:cnk:klaus [2024/11/21 14:12] (current) – michalkren |
---|
====== Václav Klaus Corpus ====== | ====== Václav Klaus Corpus ====== |
| |
**Václav Klaus Corpus** ('VK') is an author corpus of texts by [[https://en.wikipedia.org/wiki/V%C3%A1clav_Klaus|Václav Klaus]] which was created as a data basis for [[https://dspace.cuni.cz/handle/20.500.11956/191695?locale-attribute=en|the thesis Václav Klaus’ Idiolect: A Corpus-based Analysis]]. The data used for the creation of the corpus were sourced from [[https://www.klaus.cz/|Klaus’ official website]], which contains texts intended primarily for this website, as well as texts originally published elsewhere (e.g. newspaper articles or magazine interviews) or created for specific events (e.g., presidential speeches or lectures at conferences). | **Václav Klaus Corpus** ('VK') is an author corpus of texts by [[https://en.wikipedia.org/wiki/V%C3%A1clav_Klaus|Václav Klaus]] which was created as a data basis for [[https://dspace.cuni.cz/handle/20.500.11956/191695?locale-attribute=en|the thesis Václav Klaus’ Idiolect: A Corpus-based Analysis]]. The data used for the creation of the corpus were sourced from [[https://www.klaus.cz/|his official website]], which contains texts intended primarily for this website, as well as texts originally published elsewhere (e.g. newspaper articles or magazine interviews) or created for specific events (e.g., presidential speeches or lectures at conferences). |
| |
In addition to Klaus’ texts, the website also contains texts for which Klaus is only a co-author (e.g. joint statements) or for which he is not an author (e.g. communications from the press department of the presidential office). However, the 'VK' corpus is an author corpus in the narrower sense and, therefore, does not include these texts. For many texts, especially for a considerable portion of interviews, the mode (written or spoken) cannot be reliably determined. In the case of the spoken texts (the debates and some interviews), the situation is complicated by the apparent editorial modifications of Klaus’ speeches, the extent and nature of which vary considerably from text to text. To preserve the authenticity of the linguistic material, the corpus does not contain texts whose mode could not be clearly identified, nor does it include ‘purely’ spoken texts. The following four conditions can define the texts selected for the corpus: | In addition to Klaus’ texts, the website also contains texts for which Václav Klaus is only a co-author (e.g. joint statements) or for which he is not an author (e.g. communications from the press department of the presidential office). However, the 'VK' corpus is an author corpus in the narrower sense and, therefore, does not include these texts. For many texts, especially for a considerable portion of interviews, the mode (written or spoken) cannot be reliably determined. In the case of the spoken texts (the debates and some interviews), the situation is complicated by the apparent editorial modifications of Klaus’ speeches, the extent and nature of which vary considerably from text to text. To preserve the authenticity of the linguistic material, the corpus does not contain texts whose mode could not be clearly identified, nor does it include ‘purely’ spoken texts. The following four conditions can define the texts selected for the corpus: |
| |
- only texts published on the website www.klaus.cz; | - only texts published on the website www.klaus.cz; |