AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Last revisionBoth sides next revision
en:manualy:kontext:kolokace [2016/09/12 16:35] – [Collocation list] Jan Koceken:manualy:kontext:kolokace [2016/11/08 17:07] Veronika Pojarová
Line 3: Line 3:
 [{{ :en:manualy:kontext:kolokace-form.png?direct&300|Form for specification of analysis of collocation candidates }}] [{{ :en:manualy:kontext:kolokace-form.png?direct&300|Form for specification of analysis of collocation candidates }}]
  
-One of the principal properties of [[en:manualy:kontext:index|interface KonText]] is the possibility to use statistical methods to identify [[en:pojmy:kolokace|collocations]] of a wanted word. By collocation, we understand a meaningful, fixed, syntagmatic sequence of two (or more) words in the immediate proximity. A collocation consists of a key word (**node** which  usually is also [[en:pojmy:kwic|KWIC]]) and a contextual word (**collocate**). The  list of collocation candidates with which a wanted word or a phrase collocates forms the basis for corpus analysis, as it enables us to determine what kind of context is typical for a wanted phenomenon.+One of the principal properties of [[en:manualy:kontext:index|interface KonText]] is the possibility to use statistical methods to identify [[wp>Collocation|collocations]] of a wanted word. By collocation, we understand a meaningful, fixed, syntagmatic sequence of two (or more) words in the immediate proximity. A collocation consists of a key word (**node** which  usually is also [[en:pojmy:kwic|KWIC]]) and a contextual word (**collocate**). The  list of collocation candidates with which a wanted word or a phrase collocates forms the basis for corpus analysis, as it enables us to determine what kind of context is typical for a wanted phenomenon.
  
  
Line 13: Line 13:
   - ** In the range from - to**: specification of the contextual span (in the proximity of [[en:pojmy:kwic|KWIC]]) where the collocates will be searched for (negative numbers indicate the positions preceding KWIC, while the positive ones follow KWIC, cf. [[en:manualy:kontext:frekvencni_distribuce#frekvencni_distribuce_podle_pozicnich_atributu|frequency distribution]]))   - ** In the range from - to**: specification of the contextual span (in the proximity of [[en:pojmy:kwic|KWIC]]) where the collocates will be searched for (negative numbers indicate the positions preceding KWIC, while the positive ones follow KWIC, cf. [[en:manualy:kontext:frekvencni_distribuce#frekvencni_distribuce_podle_pozicnich_atributu|frequency distribution]]))
   - **Minimum frequency in corpus**: establishes minimum overall frequency of a unit in order to be included in the collocate list (provided that the minimum frequency is set on 5, the collocate of lemma //dřevo// cannot be those items that occur in the whole corpus less than 5 times)   - **Minimum frequency in corpus**: establishes minimum overall frequency of a unit in order to be included in the collocate list (provided that the minimum frequency is set on 5, the collocate of lemma //dřevo// cannot be those items that occur in the whole corpus less than 5 times)
-  - **Minimum frequency in given range**: provided that we specified the context span for collocate search from -3 to 3, then the minimum frequency in given range optiom determines how frequently should an item co-occur with KWIC to be included in the collocate list (when calculating the association measures only those items will be taken into consideration which occur at least 3 times in the proximity of KWIC, lemma //dřevo// in our examle)+  - **Minimum frequency in given range**: provided that we specified the context span for collocate search from -3 to 3, then the minimum frequency in given range option determines how frequently should an item co-occur with KWIC to be included in the collocate list (when calculating the association measures only those items will be taken into consideration which occur at least 3 times in the proximity of KWIC, lemma //dřevo// in our example)
   - **Show functions**: which association measures will be calculated and listed for each of the collocates that  the conditions specified above are met   - **Show functions**: which association measures will be calculated and listed for each of the collocates that  the conditions specified above are met
   - **Sort by**: according to which of the association measures will the list be sorted (especially useful for the long lists)   - **Sort by**: according to which of the association measures will the list be sorted (especially useful for the long lists)