Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
en:obc:collocations [2020/02/14 16:42] – created michalskrabal | en:obc:collocations [2020/02/19 14:03] (current) – michalskrabal | ||
---|---|---|---|
Line 1: | Line 1: | ||
===== Lesson 8: Collocations ===== | ===== Lesson 8: Collocations ===== | ||
- | blabla | + | In this lesson, we will focus on [[https:// |
+ | |||
+ | To identify a collocation, | ||
+ | |||
+ | **Searching the corpus** | ||
+ | |||
+ | To search for collocations, | ||
+ | |||
+ | Let’s create a concordance of the word //boy//. Select the OBC from the corpora list and set the query type on //Basic//. Search for the form //boy// and when the concordance appears, click on // | ||
+ | |||
+ | - **Attribute: | ||
+ | - **Collocation window span:** Specifies the proximity to the key word, the default value is -3 to 3, which means all the words which occur in the first, second and third positions to the left and to the right of the key word will be considered. | ||
+ | - **Minimum collocate frequency in the corpus:** Determines the least number of occurrences in the concordance for the word/tag to be included on the collocations list. The default minimum frequency is 3, which means that forms with fewer occurrences in the concordance will not be included in the list of collocates. | ||
+ | - **Minimum collocate frequency in the span:** Determines how frequently an item should co-occur with the key word for it to be included on the list. | ||
+ | - **Collocation measures:** Here, you can select which association measures will be calculated and employed in the search for collocations and according to which the list should be sorted. | ||
+ | |||
+ | {{: | ||
+ | |||
+ | Once you are satisfied with your selection, you can click on the //Make candidate list// button. It should be noted here, that the interface does not provide you with a list of collocations, | ||
+ | |||
+ | {{: | ||
+ | |||
+ | Try rearranging the list by sorting according to different association measures. We have selected //logDice// for our first sorting value. This measure is based only on the frequency of the node (key word) and the collocate and the frequency of the whole collocation; | ||
+ | |||
+ | <WRAP round help 40%> | ||
+ | **Task:** | ||
+ | |||
+ | Find out, which words frequently follow the adjectives //modest// and // | ||
+ | |||
+ | * Make sure you have selected the OBC as your corpus. | ||
+ | * You can use the basic, word form or CQL query types. | ||
+ | * Set the range to 0 to 1 – this way you are looking only for the words which directly follow the node (key word). | ||
+ | * Sort by logDice. | ||
+ | </ | ||
+ | |||
+ | You can find the solution [[en: | ||
+ | |||
+ | And that´s it. [[en: |