Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| en:eebo:collocations [2016/11/10 23:02] – [Finding collocations of a word] kristinavalentinyova | en:eebo:collocations [2018/07/30 14:49] (current) – vaclavcvrcek | ||
|---|---|---|---|
| Line 8: | Line 8: | ||
| Thanks to the corpus linguistics, | Thanks to the corpus linguistics, | ||
| - | Let's try finding collocations of a word such as //bread//. We select EEBO as the corpus we wish to work with and then use a basic query type. After clicking on the search button, concordance lines will appear. //Bread//, a key word, is always located in the middle of the line (higlighted in pink). We then click on the **collocations** button located in the upper menu and select **custom** from the dropdown menu. | + | Let's try finding collocations of a word such as //bread//. We select |
| [{{eebo-9.png? | [{{eebo-9.png? | ||
| Line 40: | Line 40: | ||
| ======= Association measures ======= | ======= Association measures ======= | ||
| - | Association measures are used to identify a collocation. | + | Association measures are used to identify a collocation. |
| ^ Collocate ^ Frequency | ^ Collocate ^ Frequency | ||
| Line 50: | Line 50: | ||
| How can we interpret these results? | How can we interpret these results? | ||
| - | * **MI** prefers words with lower frequency and therefore the results | + | * **MI** prefers words with lower frequency and therefore the results |
| * **T-score** is based on the co-occurrence frequency and therefore the results of T-score and frequency almost coincide. This association measure prefers words with a high frequency and therefore there are mostly grammatical words and punctuation marks in the first positions. Established collocations may be found in the lower positions of the list. | * **T-score** is based on the co-occurrence frequency and therefore the results of T-score and frequency almost coincide. This association measure prefers words with a high frequency and therefore there are mostly grammatical words and punctuation marks in the first positions. Established collocations may be found in the lower positions of the list. | ||
| Line 58: | Line 58: | ||
| * The negative numbers indicate the positions preceding the key word, while the positive ones refer to the right positions. | * The negative numbers indicate the positions preceding the key word, while the positive ones refer to the right positions. | ||
| * Minimum frequency in corpus: establishes minimum overall frequency of a unit in order to be included in the collocate list | * Minimum frequency in corpus: establishes minimum overall frequency of a unit in order to be included in the collocate list | ||
| - | * Minimum frequency in given range: provided that we specified the context span for collocate search from -3 to 3, then the minimum frequency in given range optiom | + | * Minimum frequency in given range: provided that we specified the context span for collocate search from -3 to 3, then the minimum frequency in given range option |
| </ | </ | ||
| <WRAP round help 40%> | <WRAP round help 40%> | ||
| - | Look at the lists of words below. Using the EEBO corpus, find out which collocate with the following three near synonyms: //godly, divine or sacred//? | + | Look at the lists of words below. Using the EEBO corpus, find out which words collocate with the following three near synonyms: //godly, divine or sacred//? |
| </ | </ | ||
| Line 76: | Line 76: | ||
| ^5th collocate |men|Person|humane| | ^5th collocate |men|Person|humane| | ||
| + | ---- | ||
| + | **If you are ready, you can continue to [[en: | ||
| + | |||
| + | ---- | ||