Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
en:eebo:collocations [2016/11/10 23:02] – [Finding collocations of a word] kristinavalentinyova | en:eebo:collocations [2018/07/30 14:49] (current) – vaclavcvrcek | ||
---|---|---|---|
Line 8: | Line 8: | ||
Thanks to the corpus linguistics, | Thanks to the corpus linguistics, | ||
- | Let's try finding collocations of a word such as //bread//. We select EEBO as the corpus we wish to work with and then use a basic query type. After clicking on the search button, concordance lines will appear. //Bread//, a key word, is always located in the middle of the line (higlighted in pink). We then click on the **collocations** button located in the upper menu and select **custom** from the dropdown menu. | + | Let's try finding collocations of a word such as //bread//. We select |
[{{eebo-9.png? | [{{eebo-9.png? | ||
Line 40: | Line 40: | ||
======= Association measures ======= | ======= Association measures ======= | ||
- | Association measures are used to identify a collocation. | + | Association measures are used to identify a collocation. |
^ Collocate ^ Frequency | ^ Collocate ^ Frequency | ||
Line 50: | Line 50: | ||
How can we interpret these results? | How can we interpret these results? | ||
- | * **MI** prefers words with lower frequency and therefore the results | + | * **MI** prefers words with lower frequency and therefore the results |
* **T-score** is based on the co-occurrence frequency and therefore the results of T-score and frequency almost coincide. This association measure prefers words with a high frequency and therefore there are mostly grammatical words and punctuation marks in the first positions. Established collocations may be found in the lower positions of the list. | * **T-score** is based on the co-occurrence frequency and therefore the results of T-score and frequency almost coincide. This association measure prefers words with a high frequency and therefore there are mostly grammatical words and punctuation marks in the first positions. Established collocations may be found in the lower positions of the list. | ||
Line 58: | Line 58: | ||
* The negative numbers indicate the positions preceding the key word, while the positive ones refer to the right positions. | * The negative numbers indicate the positions preceding the key word, while the positive ones refer to the right positions. | ||
* Minimum frequency in corpus: establishes minimum overall frequency of a unit in order to be included in the collocate list | * Minimum frequency in corpus: establishes minimum overall frequency of a unit in order to be included in the collocate list | ||
- | * Minimum frequency in given range: provided that we specified the context span for collocate search from -3 to 3, then the minimum frequency in given range optiom | + | * Minimum frequency in given range: provided that we specified the context span for collocate search from -3 to 3, then the minimum frequency in given range option |
</ | </ | ||
<WRAP round help 40%> | <WRAP round help 40%> | ||
- | Look at the lists of words below. Using the EEBO corpus, find out which collocate with the following three near synonyms: //godly, divine or sacred//? | + | Look at the lists of words below. Using the EEBO corpus, find out which words collocate with the following three near synonyms: //godly, divine or sacred//? |
</ | </ | ||
Line 76: | Line 76: | ||
^5th collocate |men|Person|humane| | ^5th collocate |men|Person|humane| | ||
+ | ---- | ||
+ | **If you are ready, you can continue to [[en: | ||
+ | |||
+ | ---- | ||