AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
en:pojmy:ud [2022/08/13 12:32] – [Universal Dependencies – UD] Alexandr Rosenen:pojmy:ud [2023/04/04 12:08] (current) – [Other categories] Alexandr Rosen
Line 1: Line 1:
 ====== Universal Dependencies – UD ====== ====== Universal Dependencies – UD ======
  
-[[https://universaldependencies.org|Universal Dependencies]] is a framework for consistent annotation of grammar (parts of speech, morphological features, and syntactic dependencies) across different human languages. UD is an open community effort with over 300 contributors producing nearly 200 treebanks in over 100 languages. A recent version of the [[https://wiki.korpus.cz/doku.php/en:cnk:intercorp|InterCorp]] parallel corpus ([[https://wiki.korpus.cz/doku.php/en:cnk:intercorp:verze13ud|13ud]]) has been annotated in terms of morphological categories, syntactic functions and syntactic structure following the UD guidelines and using the tools developed within the UD project.+[[https://universaldependencies.org|Universal Dependencies]] is a framework for consistent annotation of grammar (parts of speech, morphological features, and syntactic dependencies) across different human languages. UD is an open community effort with over 300 contributors producing nearly 200 treebanks in over 100 languages. A recent version of the [[en:cnk:intercorp|InterCorp]] parallel corpus ([[en:cnk:intercorp:verze13ud|13ud]]) has been annotated in terms of morphological categories, syntactic functions and syntactic structure following the UD guidelines and using the tools developed within the UD project.
  
 General guidelines for annotation are provided on the UD project website ([[https://universaldependencies.org/guidelines.html|UD Guidelines]]), including a detailed description of: General guidelines for annotation are provided on the UD project website ([[https://universaldependencies.org/guidelines.html|UD Guidelines]]), including a detailed description of:
Line 9: Line 9:
  
 Key specifics of the UD annotation as used in InterCorp: Key specifics of the UD annotation as used in InterCorp:
-  * In other releases of InterCorp, word class and morphological categories of a word are specified as the value of the ''tag'' attribute. For most languages, InterCorp release 13ud retains these language-specific tags in the ''xpos'' attribute. However, the UD **word class** and **morphological categories**, denoted uniformly for all languages, are listed separately as values of the ''upos'' and ''feats'' attributes (see below [[en:pojmy:ud#parts_of_speech|Parts of speech]], and [[en:cnk:intercorp:verze13ud#other_categories|Other categories]], respectively). Frequently used morphological categories from the ''feats'' list have been promoted to the status of regular attributes at the same level as ''upos''. This applies, for example, to morphological case, number, gender or person (''case'', ''number'', ''gender'', ''person'').  +  * In other releases of InterCorp, word class and morphological categories of a word are specified as the value of the ''tag'' attribute. For most languages, InterCorp release 13ud retains these language-specific tags in the ''xpos'' attribute. However, the UD **word class** and **morphological categories**, denoted uniformly for all languages, are listed separately as values of the ''upos'' and ''feats'' attributes (see below [[en:pojmy:ud#parts_of_speech|Parts of speech]], and [[en:pojmy:ud#other_categories|Other categories]], respectively). Frequently used morphological categories from the ''feats'' list have been promoted to the status of regular attributes at the same level as ''upos''. This applies, for example, to morphological case, number, gender or person (''case'', ''number'', ''gender'', ''person'').  
-  * For use in KonText, **fused forms** or //aggregates//, ie word forms composed of two or even three syntactic words, were modified as divided tokens. In English it concerns, for example, the forms //isn't// or //cannot//. For more details see [[en:cnk:intercorp:verze13ud#multi-part_tokens|Multi-part tokens]] below. +  * For use in KonText, **fused forms** or //aggregates//, ie word forms composed of two or even three syntactic words, were modified as divided tokens. In English it concerns, for example, the forms //isn't// or //cannot//. For more details see [[en:pojmy:ud#multi-part_tokens|Multi-part tokens]] below. 
-  * Each word is assigned its **syntactic function** (''deprel'' – see [[en:cnk:intercorp:verze13ud#syntactic_functions|Syntactic functions]]) and its syntactic governor in the dependency tree (''head''). To facilitate orientation in the syntactic structure, each word is also annotated with references to important properties of its head (lemma, part of speech and morphological categories), see [[en:cnk:intercorp:verze13ud#references_to_syntactic_heads|References to syntactic head]]. If a content word occurs with a **function word** (eg. preposition, auxiliary verb, subordinate conjunction, determiner), the content word includes some properties of the function word (see [[en:cnk:intercorp:verze13ud#references_to_function_words|References to function words]]). +  * Each word is assigned its **syntactic function** (''deprel'' – see [[en:pojmy:ud#syntactic_functions|Syntactic functions]]) and its syntactic governor in the dependency tree (''head''). To facilitate orientation in the syntactic structure, each word is also annotated with references to important properties of its head (lemma, part of speech and morphological categories), see [[en:pojmy:ud#references_to_syntactic_heads|References to syntactic head]]. If a content word occurs with a **function word** (eg. preposition, auxiliary verb, subordinate conjunction, determiner), the content word includes some properties of the function word (see [[en:pojmy:ud#references_to_function_words|References to function words]]). 
-  * **Annotations between languages ​​differ** in the number of categorial attributes and in links to function words, see {{cnk:intercorp:ud_ic_attributes.pdf | List of attributes by language}}, described below in [[https://wiki.korpus.cz/doku.php/en:cnk:intercorp:verze13ud#description_of_the_list_of_attributes|Description of the list of attributes]]. +  * **Annotations between languages ​​differ** in the number of categorial attributes and in links to function words, see {{cnk:intercorp:ud_ic_attributes.pdf | List of attributes by language}}, described below in [[en:pojmy:ud#description_of_the_list_of_attributes|Description of the list of attributes]]. 
-  * KonText makes supports queries by word class and other morphological categories using the **''Insert tag''** function, which inserts a UD POS (''upos'') and any category from the ''feats'' list into the query. The ''Insert tag'' feature is available for all linguistically annotated languages. +  * KonText supports queries by word class and other morphological categories using the **''Insert tag''** function, which inserts a UD POS (''upos'') and any category from the ''feats'' list into the query. The ''Insert tag'' feature is available for all linguistically annotated languages.
 ===== Morphological annotation ===== ===== Morphological annotation =====
  
Line 51: Line 50:
   * A list of such pairs is the value of the ''feats'' attribute.   * A list of such pairs is the value of the ''feats'' attribute.
   * Categories in the ''feats'' attribute are separated by "|", e.g. the Russian form //школы// /'ʂkolɨ/ 'school' in genitive singular is marked as ''feats=%%"Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing"%%''   * Categories in the ''feats'' attribute are separated by "|", e.g. the Russian form //школы// /'ʂkolɨ/ 'school' in genitive singular is marked as ''feats=%%"Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing"%%''
-  * In an advanced query using the CQL query language each category can be specified separately: the Czech form //moře// 'sea' is one of the answers to the query ''%%[upos="NOUN" & feats="Number=Sing"]%%'' The Russian form is found follwoing the query ''[upos=%%"NOUN"%% & feats=%%"Gender=Fem"%% & feats=%%"Case=Gen"%%]''. The order of categories in the query is irrelevant. +  * In an advanced query using the CQL query language each category can be specified separately: the Czech form //moře// 'sea' is one of the answers to the query ''%%[upos="NOUN" & feats="Number=Sing"]%%'' The Russian form is found following the query ''[upos=%%"NOUN"%% & feats=%%"Gender=Fem"%% & feats=%%"Case=Gen"%%]''. The order of categories in the query is irrelevant. 
   * The value of ''feats'' can also be treated as a string of characters using regular expressions, e.g. ''[upos=%%"NOUN"%% & feats=%%".*Case=Gen.*Gender=Fem.*"%%]''. Here the order of categories in the query should correspond to their order in the corpus. The result is the same in both cases.   * The value of ''feats'' can also be treated as a string of characters using regular expressions, e.g. ''[upos=%%"NOUN"%% & feats=%%".*Case=Gen.*Gender=Fem.*"%%]''. Here the order of categories in the query should correspond to their order in the corpus. The result is the same in both cases.
-  * Some of the categories in ''feats'' are listed also outside the list as **categorial attributes** at the same level as ''upos''. As a result, a query for a singular noun can be simply as follows: ''%%[upos="NOUN" & number="Sing"]%%''. Similarly, the query  for the Russian form ''[upos=%%"NOUN"%% & %%gender="Fem"%% & %%case="Gen"%%]'' gives the same result as the two queries above. Categorial attributes can be also used to generate frequency lists.((Note that for technical reasons the names of the categorial attributes are all in lower case, including names such as ''VerbForm'' (in ''feats''), rendered as ''verb_form'', or ''NumType'', rendered as ''num_type''. The attribute values, such as ''Fem'', retain the initial upper case character, but are enclosed in double quotes, like other non-embedded attributes.))  Such attributes appear on the <fc #f4a460>light brown</fc> background in {{cnk:intercorp:ud_ic_atributy.pdf|Attribute list by language}} or in KonText in the lower part of the table shown in ''View'' / ''Corpus-specific settings...'' .+  * Some of the categories in ''feats'' are listed also outside the list as **categorial attributes** at the same level as ''upos''. As a result, a query for a singular noun can be simply as follows: ''%%[upos="NOUN" & number="Sing"]%%''. Similarly, the query  for the Russian form ''[upos=%%"NOUN"%% & %%gender="Fem"%% & %%case="Gen"%%]'' gives the same result as the two queries above. Categorial attributes can be also used to generate frequency lists.((Note that for technical reasons the names of the categorial attributes are all in lower case, including names such as ''VerbForm'' (in ''feats''), rendered as ''verb_form'', or ''NumType'', rendered as ''num_type''. The attribute values, such as ''Fem'', retain the initial upper case character, but are enclosed in double quotes, like other attribute values outside ''feats''.))  Such attributes appear on the <fc #f4a460>light brown</fc> background in {{cnk:intercorp:ud_ic_attributes.pdf|Attribute list by language}} or in KonText in the lower part of the table shown in ''View'' / ''Corpus-specific settings...'' .
  
 ^ category ^ gloss ^ example values ^ ^ category ^ gloss ^ example values ^
Line 83: Line 82:
   * Some tokens, in the UD parlance called **fused words**, or **aggregates** in some Czech corpus-related literature, consist of multiple parts. These parts correspond to different nodes in the syntactic structure. In English, such tokens represent **contractions**, consisting of a verb and the negative particle such as //isn't// or //cannot//   * Some tokens, in the UD parlance called **fused words**, or **aggregates** in some Czech corpus-related literature, consist of multiple parts. These parts correspond to different nodes in the syntactic structure. In English, such tokens represent **contractions**, consisting of a verb and the negative particle such as //isn't// or //cannot//
   * The orthographic form of these words is preserved in the corpus, the individual parts are separated only in the annotation - e.g. in the value of the ''lemma'' attribute, with the "|" sign as the separator. It is therefore possible to search for them like other words, by typing the full form into the search box in a simple query (e.g. //ses// in Czech, //can't// in English or //byłbym// in Polish), or in the advanced query using the CQL search language give the same strings as the value of the **''word''** attribute .   * The orthographic form of these words is preserved in the corpus, the individual parts are separated only in the annotation - e.g. in the value of the ''lemma'' attribute, with the "|" sign as the separator. It is therefore possible to search for them like other words, by typing the full form into the search box in a simple query (e.g. //ses// in Czech, //can't// in English or //byłbym// in Polish), or in the advanced query using the CQL search language give the same strings as the value of the **''word''** attribute .
-  * In some languages, including English and Czech, a part of the fused token has a different form when occuring in a different context as an orthographically separate word. E.g. //n't//, a part of //isn't//, corresponds to //not//, the Czech auxiliary clitic //s//, a part of //ses//, corresponds to //jsi//. Both variants are represented in the annotation: the **''iword''** attribute shows the original form ''is|n't'' or ''se|s'', while the **''sword''** attribute shows the unabreviated, "reconstructed" version: ''is|not'' or ''se|jsi''.((Aggregates are present in the following languages: ar, ca, cs, de, el, en, es, fi, fr, he, it, pl, pt, tr and uk. A list of all aggregates for a given language is displayed as the frequency distribution of word forms following the query %%[sword = ".|.+"]%%.))+  * In some languages, including English and Czech, a part of the fused token has a different form when occurring in a different context as an orthographically separate word. E.g. //n't//, a part of //isn't//, corresponds to //not//, the Czech auxiliary clitic //s//, a part of //ses//, corresponds to //jsi//. Both variants are represented in the annotation: the **''iword''** attribute shows the original form ''is|n't'' or ''se|s'', while the **''sword''** attribute shows the unabbreviated, "reconstructed" version: ''is|not'' or ''se|jsi''.((Aggregates are present in the following languages: ar, ca, cs, de, el, en, es, fi, fr, he, it, pl, pt, tr and uk. A list of all aggregates for a given language is displayed as the frequency distribution of word forms following the query %%[sword = ".|.+"]%%.))
   * In addition to the English tokens //isn't// (''is|n't'' – ''is|not'') or //cannot// (''can|not''),((The first form, preceding the dash, is the original form, i.e. the value of the ''iword'' attribute, the second form, after the dash, is the reconstructed form, i.e. the value of the ''sword'' attribute. If a parenthesis includes just one form, the two options are identical, or the given language does not provide reconstructed forms.)) in Czech there are tokens such as //abychom// (''a|bychom'' – ''aby|bychom''), //bylas// (''byla|s'' – ''byla|jsi'') or //oč// (''o|č'' – ''o|co''); in German //zur// (''zu|r'' – ''zu|der'') or //am// (''a|m'' – ''an|dem''); in Polish //miałam// (''miała|m''), //żebyś// (''że|by|ś'') or //chciałbym// (''chciał|by|m''); in French //des// (''de|s'' – ''de|les''), //aux// (''au|x'' – ''à|les'') or //auquel// (''au|quel'' – ''à|lequel'').   * In addition to the English tokens //isn't// (''is|n't'' – ''is|not'') or //cannot// (''can|not''),((The first form, preceding the dash, is the original form, i.e. the value of the ''iword'' attribute, the second form, after the dash, is the reconstructed form, i.e. the value of the ''sword'' attribute. If a parenthesis includes just one form, the two options are identical, or the given language does not provide reconstructed forms.)) in Czech there are tokens such as //abychom// (''a|bychom'' – ''aby|bychom''), //bylas// (''byla|s'' – ''byla|jsi'') or //oč// (''o|č'' – ''o|co''); in German //zur// (''zu|r'' – ''zu|der'') or //am// (''a|m'' – ''an|dem''); in Polish //miałam// (''miała|m''), //żebyś// (''że|by|ś'') or //chciałbym// (''chciał|by|m''); in French //des// (''de|s'' – ''de|les''), //aux// (''au|x'' – ''à|les'') or //auquel// (''au|quel'' – ''à|lequel'').
  
Line 98: Line 97:
   * In some languages, some deprels may have **subtypes**. The subtype name follows the colon after the deprel name, e.g. ''acl:relcl'' indicates an attribute expressed by a relative clause. The list below contains only subtypes relevant to English and represented in the corpus. Functions with subtypes for all languages are listed at [[https://universaldependencies.org/u/dep/index.html|Universal Dependency Relations]].   * In some languages, some deprels may have **subtypes**. The subtype name follows the colon after the deprel name, e.g. ''acl:relcl'' indicates an attribute expressed by a relative clause. The list below contains only subtypes relevant to English and represented in the corpus. Functions with subtypes for all languages are listed at [[https://universaldependencies.org/u/dep/index.html|Universal Dependency Relations]].
   * When querying a deprel that may have a subtype, a possible subtype should be taken into account. For example, to find all words with the deprel ''acl'', whether or not the deprel has a subtype, use the expression ''%%deprel="acl.*"%%'' instead of ''%%deprel="acl"%%''. To find all auxiliary verbs, use the expression ''%%deprel="aux.*"%%'' instead of ''%%deprel="aux"%%''. To find all subjects, use the expression ''%%deprel="nsubj.*"%%''.   * When querying a deprel that may have a subtype, a possible subtype should be taken into account. For example, to find all words with the deprel ''acl'', whether or not the deprel has a subtype, use the expression ''%%deprel="acl.*"%%'' instead of ''%%deprel="acl"%%''. To find all auxiliary verbs, use the expression ''%%deprel="aux.*"%%'' instead of ''%%deprel="aux"%%''. To find all subjects, use the expression ''%%deprel="nsubj.*"%%''.
-  * When a queried deprel targets a **coordinated structure**, only the first conjunct is found. The second and subsequent conjuncts are marked as ''%%deprel="conj"%%''. The syntactic function of the entire coordination is thus specified by the ''deprel'' attribute of the first cunjunct, the head of all other conjuncts. To query the "true" deprel of a non-initial conjunct (''%%deprel="conj"%%''), use the ''p_deprel'' attribute. See [[https://wiki.korpus.cz/doku.php/cnk:intercorp:verze13ud#koordinace|Coordination]] below for details.+  * When a queried deprel targets a **coordinated structure**, only the first conjunct is found. The second and subsequent conjuncts are marked as ''%%deprel="conj"%%''. The syntactic function of the entire coordination is thus specified by the ''deprel'' attribute of the first conjunct, the head of all other conjuncts. To query the "true" deprel of a non-initial conjunct (''%%deprel="conj"%%''), use the ''p_deprel'' attribute. See [[en:pojmy:ud#coordination|Coordination]] below for details.
  
  
Line 140: Line 139:
 | **nmod:tmod** | [[https://universaldependencies.org/en/dep/nmod-tmod.html | temporal modifier ]] | //In Plenary **today** I supported the amendment.// | | **nmod:tmod** | [[https://universaldependencies.org/en/dep/nmod-tmod.html | temporal modifier ]] | //In Plenary **today** I supported the amendment.// |
 | **nsubj** | [[https://universaldependencies.org/u/dep/obj.html | nominal subject]] | //**Those** **who** venture upon its currents look for prosperity or fame, even if **they** often founder in its depths.// | | **nsubj** | [[https://universaldependencies.org/u/dep/obj.html | nominal subject]] | //**Those** **who** venture upon its currents look for prosperity or fame, even if **they** often founder in its depths.// |
-| **nsubj:pass** | [[https://universaldependencies.org/u/dep/nsubj-pass.html | nominal subject of poassive clause]] | //The **horses** were adorned with just one red scarf.// | +| **nsubj:pass** | [[https://universaldependencies.org/u/dep/nsubj-pass.html | nominal subject of passive clause]] | //The **horses** were adorned with just one red scarf.// | 
 | **nummod** | [[https://universaldependencies.org/cs/dep/nummod.html | numeric modifier ]] | // Dissolution does but give birth to fresh modes of organization, and **one** death is the parent of a **thousand** lives.// | | **nummod** | [[https://universaldependencies.org/cs/dep/nummod.html | numeric modifier ]] | // Dissolution does but give birth to fresh modes of organization, and **one** death is the parent of a **thousand** lives.// |
 | **obj** | [[https://universaldependencies.org/u/dep/obj.html | object ]] | // But who can stop the **people**? **What** do you mean? I don't know **what** to do. // | | **obj** | [[https://universaldependencies.org/u/dep/obj.html | object ]] | // But who can stop the **people**? **What** do you mean? I don't know **what** to do. // |
Line 157: Line 156:
  
   * In addition to the pointer to its head (''head'' as the word ID of the head, i.e. its word order position within the sentence, or ''parent'' as its position relative to the given word), some other attributes of the head are listed for each token: lemma (''p_lemma''), POS (''p_upos''), morphological category (''p_feats''), and syntactic function (''p_deprel'').   * In addition to the pointer to its head (''head'' as the word ID of the head, i.e. its word order position within the sentence, or ''parent'' as its position relative to the given word), some other attributes of the head are listed for each token: lemma (''p_lemma''), POS (''p_upos''), morphological category (''p_feats''), and syntactic function (''p_deprel'').
-  * A token may also have attributes that specify the properties of a fuction word that depends on the token. For example, the lemma of a preposition is shown by the attribute ''case_lemma'', morphological categories of an auxiliary by ''aux_feats'', morphological categories of a copula by ''cop_feats'', part of speech of a determiner by ''det_upos'', lemma of a marker by ''mark_lemma''.+  * A token may also have attributes that specify the properties of a function word that depends on the token. For example, the lemma of a preposition is shown by the attribute ''case_lemma'', morphological categories of an auxiliary by ''aux_feats'', morphological categories of a copula by ''cop_feats'', part of speech of a determiner by ''det_upos'', lemma of a marker by ''mark_lemma''.
   * Similar means of representing syntactic structure are used by other syntactically annotated corpora available in the KonText browser (e.g. ''syn2020'').   * Similar means of representing syntactic structure are used by other syntactically annotated corpora available in the KonText browser (e.g. ''syn2020'').
  
Line 182: Line 181:
 === Basic query === === Basic query ===
  
-  * A basic query for a word form or phrase is entered in the same way as in previous releases of InterCorp.((In a basic query, it is no longer necessary in some languages to separate parts of the aggregate with a space, eg //był//, //by//, and  //m// of the Polish agglutinated form //byłbym // or //is// and //n't// of the English contraction //isn't//, even in a longer expression (//aren't I//). However, a basic query for //is// or //n't// will not show concordances including the for //isn't//.))+  * A basic query for a word form or phrase is entered in the same way as in previous releases of InterCorp.((In a basic query, it is no longer necessary in some languages to separate parts of the aggregate with a space, eg //był//, //by//, and  //m// of the Polish agglutinated form //byłbym // or //is// and //n't// of the English contraction //isn't//, even in a longer expression (//aren't I//). However, a basic query for //is// or //n't// will not show concordances including the form //isn't//.))
  
 === Query for a lemma and a morphological tag === === Query for a lemma and a morphological tag ===
Line 189: Line 188:
   * According to UD, part of speech and morphological categories are listed separately as values ​​of the attributes ''upos'' and ''feats'', respectively. Their values ​​can be entered using the ''Insert tag'' function.    * According to UD, part of speech and morphological categories are listed separately as values ​​of the attributes ''upos'' and ''feats'', respectively. Their values ​​can be entered using the ''Insert tag'' function. 
   * Parts of speech (''upos'') are the same for all languages. E.g. a query for proper names without using the ''Insert tag'' function can be specified as follows: %%[upos = "PROPN"]%%.   * Parts of speech (''upos'') are the same for all languages. E.g. a query for proper names without using the ''Insert tag'' function can be specified as follows: %%[upos = "PROPN"]%%.
-  * Other morphological categories are listed under the ''feats'' attribute. Some of them are available separately under categorial attributes. For details see [[https://wiki.korpus.cz/doku.php/en:cnk:intercorp:verze13ud#other_categories | Other categories]] above. +  * Other morphological categories are listed under the ''feats'' attribute. Some of them are available separately under categorial attributes. For details see [[en:pojmy:ud#other_categories | Other categories]] above. 
  
 === Query for a part of speech and morphological categories using the menu === === Query for a part of speech and morphological categories using the menu ===
Line 197: Line 196:
 === Query for a syntactic function === === Query for a syntactic function ===
  
-   * Syntactic function is specified for each token as the value of the ''deprel'' attribute (see [[https://wiki.korpus.cz/doku.php/en:cnk:intercorp:verze13ud#syntactic_functions | Syntactic functions ]] above.+   * Syntactic function is specified for each token as the value of the ''deprel'' attribute (see [[en:pojmy:ud#syntactic_functions | Syntactic functions ]] above.
    * E.g. a query to show the occurrences of the verb //run// in the function of the governor of an adnominal clause, is entered as %%[lemma="run" & deprel="acl"]%%. Results include examples such as //Everyone of the rabbits was seized by the instinct **to __run__ away**, to go underground. Some people have the idea that rabbits spend a good deal of their time **__running__ away from foxes**.//    * E.g. a query to show the occurrences of the verb //run// in the function of the governor of an adnominal clause, is entered as %%[lemma="run" & deprel="acl"]%%. Results include examples such as //Everyone of the rabbits was seized by the instinct **to __run__ away**, to go underground. Some people have the idea that rabbits spend a good deal of their time **__running__ away from foxes**.//
  
Line 239: Line 238:
   * The governing verbs can be listed using frequency distribution according to the ''p_lemma'' attribute (in the KonText menu: ''%%Frequency / Custom... / Attribute: p_lemma%%'').   * The governing verbs can be listed using frequency distribution according to the ''p_lemma'' attribute (in the KonText menu: ''%%Frequency / Custom... / Attribute: p_lemma%%'').
   * The query does not assume any specific word order, the noun could also precede the preposition, which would indeed be highly unlikely.   * The query does not assume any specific word order, the noun could also precede the preposition, which would indeed be highly unlikely.
 +
 +
 +=== Verbs taking an indirect object ===
 +
 +<code>[deprel="iobj"]</code>
 +
 +  * [[https://www.korpus.cz/kontext/view?q=~MwKKiaMYIgcg|This query]] finds indirect objects.
 +  * The lemma of the indirect object's head can be listed using frequency distribution according to the attribute ''p_lemma''.
 +
  
 === Direct or indirect objects, also as conjuncts === === Direct or indirect objects, also as conjuncts ===
Line 247: Line 255:
   * Note that for coordinated constituents, a separate concordance is shown for each conjunct.   * Note that for coordinated constituents, a separate concordance is shown for each conjunct.
   * Either the keyword's ''deprel'' denotes the direct or indirect object (''%%deprel="i?obj"%%'', or -- equivalently -- ''%%deprel="obj|iobj"%%''), or the keyword's ''deprel'' is ''conj'' (''%%deprel="conj"%%'') and depends on a direct or indirect object (''%%p_deprel="i?obj"%%''), i.e. it is the non-initial conjunct in a coordinated constituent functioning as direct or indirect object.   * Either the keyword's ''deprel'' denotes the direct or indirect object (''%%deprel="i?obj"%%'', or -- equivalently -- ''%%deprel="obj|iobj"%%''), or the keyword's ''deprel'' is ''conj'' (''%%deprel="conj"%%'') and depends on a direct or indirect object (''%%p_deprel="i?obj"%%''), i.e. it is the non-initial conjunct in a coordinated constituent functioning as direct or indirect object.
 +
 +
 +
  
 === Proper nouns as subjects, also as conjuncts === === Proper nouns as subjects, also as conjuncts ===