AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
en:obc:intro_to_metadata [2020/02/19 12:22] michalskrabalen:obc:intro_to_metadata [2020/02/27 12:22] jankocek
Line 54: Line 54:
 Verbs in the progressive passive tense are formed by the auxiliary verb //be// followed by the present participle form //being// plus the past participle of a full verb, e.g. //I am being watched//, //the house was being built.// Searching for such constructions is done best by the use of tags (see [[en:obc:spell3|Lesson 4]]). Verbs in the progressive passive tense are formed by the auxiliary verb //be// followed by the present participle form //being// plus the past participle of a full verb, e.g. //I am being watched//, //the house was being built.// Searching for such constructions is done best by the use of tags (see [[en:obc:spell3|Lesson 4]]).
  
-For the auxiliary verb, we need to search for //am//, //are//, //is//, ''was ''and //were// (if we wish to include both present and past progressive passive) – tagged as VBM, VBR, VBZ, VBDZ and VBDR respectively. The tags should be used in the query instead of the full forms of the verbs, as the tags encompass the contracted forms as well as any unusual spellings which you would not be able to find just by searching for the full forms. In this case, it is not advisable to use all tags starting with V (using e.g. “V.*” expression), as the concordance would then include other verb forms as well. Rather, it is necessary to type out all the tags and separate them with the vertical bar |, which can be used inside the token:+For the auxiliary verb, we need to search for //am//, //are//, //is//, ''was ''and //were// (if we wish to include both present and past progressive passive) – tagged as VBM, VBR, VBZ, VBDZ and VBDR respectively. The tags should be used in the query instead of the full forms of the verbs, as the tags encompass the contracted forms as well as any unusual spellings which you would not be able to find just by searching for the full forms. In this case, it is not advisable to use all tags starting with V (using e.g. ''“V.*”'' expression), as the concordance would then include other verb forms as well. Rather, it is necessary to type out all the tags and separate them with the vertical bar |, which can be used inside the token:
  
 ''[tag="VBM|VBR|VBZ|VBDZ|VBDR"]'' ''[tag="VBM|VBR|VBZ|VBDZ|VBDR"]''
Line 68: Line 68:
 ''[tag="VBM|VBR|VBZ|VBDZ|VBDR"] [word="being"] [tag="VVN.*"]'' ''[tag="VBM|VBR|VBZ|VBDZ|VBDR"] [word="being"] [tag="VVN.*"]''
  
-If you wish to see an overview of the structural attributes of the whole concordance along with their frequencies, click on **Frequency → Text Types**.+If you wish to see an overview of the structural attributes of the whole concordance along with their frequencies, click on //Frequency → Text Types//.
  
 {{:en:obc:l5_1.png?direct&600|}} {{:en:obc:l5_1.png?direct&600|}}
Line 76: Line 76:
 {{:en:obc:l5_2.png?direct&400|}} {{:en:obc:l5_2.png?direct&400|}}
  
-It is important to note here, that some of the utterances are not tagged fully; in this case, there are 48 utterances that are missing the information about the decade in which they were written. You can use the negative filter (p/**n**) to discard them and work only with the fully annotated data.+It is important to note here, that some of the utterances are not tagged fully; in this case, there are 48 utterances that are missing the information about the decade in which they were written. You can use the negative filter (p///n//) to discard them and work only with the fully annotated data.
  
 By clicking on the header of each column, you can change the sorting – alphabetically according to the labels of that attribute (here decades), according to the frequency or i.p.m. Here i.p.m. (Items Per Million) indicates the relative frequency of the given form in relation to the overall size of the part of the corpus tagged with the respective value of the structural attribute (e.g. in this case the number of occurrences per million tokens in each decade). The relative frequency allows for comparison of the number of occurrences in differently-sized parts of the corpus. By clicking on the header of each column, you can change the sorting – alphabetically according to the labels of that attribute (here decades), according to the frequency or i.p.m. Here i.p.m. (Items Per Million) indicates the relative frequency of the given form in relation to the overall size of the part of the corpus tagged with the respective value of the structural attribute (e.g. in this case the number of occurrences per million tokens in each decade). The relative frequency allows for comparison of the number of occurrences in differently-sized parts of the corpus.
Line 86: Line 86:
 {{:en:obc:l5_3.png?direct&400|}} {{:en:obc:l5_3.png?direct&400|}}
  
-Here you can see all the information available for the given utterance. As was mentioned above, some information may be missing. You can access the whole text of the proceeding including the scan of the original publication by clicking on the link under **text.url**.+Here you can see all the information available for the given utterance. As was mentioned above, some information may be missing. You can access the whole text of the proceeding including the scan of the original publication by clicking on the link under //text.url//.
  
 <WRAP round help 40%> <WRAP round help 40%>
Line 97: Line 97:
 </WRAP> </WRAP>
  
-Solution:+You will find the solution [[en:obc:solution#lesson_5|here]]. 
  
-[[https://kontext.korpus.cz/view?q=~jIj4JkJbEVZs|Split infinitive]]:+----
  
-Query: ''[word="to"[tag="RR"[tag="VVI"]''+**If you are ready, you can continue to [[en:obc:specific_query|Lesson 6]].**
  
-**Frequency → Text Types** +----
- +
-{{:en:obc:l5_5.png?direct&400|}} +
- +
-{{:en:obc:l5_4.png?direct&400|}} +
- +
-[[https://kontext.korpus.cz/view?q=~tAru1r2aLncK|Double comparative]]: +
- +
-Query: ''[word="more"] [tag="JJR"]'' +
- +
-**Frequency → Text Types** +
- +
-{{:en:obc:l5_6.png?direct&400|}} +
- +
-{{:en:obc:l5_7.png?direct&400|}} +
- +
-You will find solution here.  +
- +
-You can now proceed to [[en:obc:specific_query|Lesson 6]].+