Differences

This shows you the differences between two versions of the page.

--- en:manualy:kwords [2023/04/05 17:32] – old revision restored (2021/03/09 15:39) michalkren
+++ en:manualy:kwords [2023/11/13 09:51] – [KWords] vaclavcvrcek
@@ Line 1: / Line 1: @@
 ====== KWords ======
-{{ :manualy:k-words_logo.png?nolink&200|}}
+{{ :manualy:kwords_logo_v2.png?nolink&|}}
 The KWords application is used for the analysis of texts based on their comparison with the general usage ([[en:pojmy:referencni|reference]] corpus). Its aim is to identify so-called [[en:pojmy:keyword|keywords]], which are [[en:pojmy:word|word forms]] appearing in the inspected text with a significantly higher frequency than in the reference corpus which should reflect the common usage. These key words serve as a basis for textual analysis and interpretation.
@@ Line 7: / Line 7: @@
 KWords is an online application (the only thing we need to use it is a web browser) and it is accessible without  [[en:kurz:zaciname|registration]] to all users at  **[[http://kwords.korpus.cz|kwords.korpus.cz]]**.
-The KWords applcation was originally created for the purpose of analyzing political speeches, and is being developed further in cooperation with [[http://www.brown.edu|Brown University]]. It is currently implemented for the analysis of Czech and English texts of up to approx. 20 thousand words.
+The first version of KWords was developed for the purpose of analyzing political speeches in collaboration with [[http://www.brown.edu|Brown University]]. The second version was developed as part of the [[https://threat-defuser.org|Threat-defuser project]]. This version supports more than 30 languages and allows keyword analysis as well as keymorph analysis.((see Fidler, M. - Cvrček, V.: [[https://doi.org/10.1515/cllt-2016-0073|Keymorph analysis, or how morphosyntax informs discourse]]. Corpus Linguistics and Linguistic Theory. 15/1, p. 39–70.))
 ===== Prominent units =====
@@ Line 37: / Line 37: @@
 ==== Thematic concentration ====
-Words which are highlighted in <html><span style="background-color: yellow">yellow</span></html> in the analyzed text are those which bear thematic concentration (TC words). They are not identified through comparison with a reference corpus, but only by their placement in the frequency distribution of the units in the analyzed text: when we arrange all the words in the text from those which are most frequent and down to words which appear only once, we get a so-called [[en:pojmy:zipf|Zipf]] distribution. In this distribution we are looking for a so-called //h// point, for which we can say that rank = frequency (e.g. 32nd most frequent word has a frequency of 32 occurrences). All autosemantic words (bearing meaning independent of context) above this point (i.e. in our case with a frequency higher than 32) we label thematic concentration. More details and a specific application of this approach to literary texts can be found for example in the article of [[http://www.cechradek.cz/publ/2013_Davidova_Cech_Tematicka_koncentrace_Jehlicka_NR.pdf|R. Čech]] (2013).
+Words which are highlighted in yellow in the analyzed text are those which bear thematic concentration (TC words). They are not identified through comparison with a reference corpus, but only by their placement in the frequency distribution of the units in the analyzed text: when we arrange all the words in the text from those which are most frequent and down to words which appear only once, we get a so-called [[en:pojmy:zipf|Zipf]] distribution. In this distribution we are looking for a so-called //h// point, for which we can say that rank = frequency (e.g. 32nd most frequent word has a frequency of 32 occurrences). All autosemantic words (bearing meaning independent of context) above this point (i.e. in our case with a frequency higher than 32) we label thematic concentration. More details and a specific application of this approach to literary texts can be found for example in the article of [[http://www.cechradek.cz/publ/2013_Davidova_Cech_Tematicka_koncentrace_Jehlicka_NR.pdf|R. Čech]] (2013).
 ===== How it works =====
@@ Line 57: / Line 57: @@
 ===== Application images =====
-[{{:kurz:kwords-vstup.png?direct&300|Inputting text into KWords}}]
+{{:manualy:kwords2.png?direct&400 |}}
-[{{:kurz:kwords-vystup.png?direct&300|Analyzed text with highlighted keywords}}]
+{{:manualy:kwords2_nastaveni.png?direct&400 |}}
-[{{:kurz:kwords-tab.png?direct&300|List of keywords}}]
+{{:manualy:kwords2_klicova_slova.png?direct&400|}}
-[{{:kurz:kwords-distrib.png?direct&300|Distribution of keywords throughout the analyzed text}}]
+{{:manualy:kwords2_graf.png?direct&400 |}}
-[{{:kurz:kwords-links.png?direct&300|Mutual relations between keywords (keyword links)}}]
+{{:manualy:kwords2_distribuce.png?direct&400 |}}
-[{{:kurz:kwords-comp.png?direct&300|Comparison of several speeches -- multi-analysis}}]
+{{:manualy:kwords2_konkordance.png?direct&400 |}}
+{{:manualy:kwords2_links.png?direct&400|}}
+===== Application images (previous version)=====
+[{{:kurz:kwords-vstup.png?direct&400 |Inputting text into KWords}}]
+[{{:kurz:kwords-vystup.png?direct&400 |Analyzed text with highlighted keywords}}]
+[{{:kurz:kwords-tab.png?direct&400|List of keywords}}]
+[{{:kurz:kwords-distrib.png?direct&400 |Distribution of keywords throughout the analyzed text}}]
+[{{:kurz:kwords-links.png?direct&400 |Mutual relations between keywords (keyword links)}}]
+[{{:kurz:kwords-comp.png?direct&400|Comparison of several speeches -- multi-analysis}}]
 ==== Related links  ====

Trace:

Differences

Search

Navigation

Print/export

Tools

Languages

Licence