Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
en:pojmy:syntakticka_komplexita [2024/09/30 18:29] – [Measures for sentences] alexandrrosen | en:pojmy:syntakticka_komplexita [2024/10/18 20:39] (current) – [References] alexandrrosen | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== Syntactic Complexity ====== | ====== Syntactic Complexity ====== | ||
- | InterCorp release 16ud is annotated by several measures of syntactic complexity. They are specified as metadata for each sentence and each text, for each linguistically annotated language. In KonText, they can be displayed and queried like any other metadata items, such as author or sentence ID. | + | InterCorp release 16ud is annotated by several measures of syntactic complexity. They are specified as metadata for each sentence and each text, for each linguistically annotated language. In KonText, they can be displayed and queried like any other metadata items, such as text author or sentence ID. |
In addition to syntactic complexity measures each text of sufficient length includes also two measures of **[[en: | In addition to syntactic complexity measures each text of sufficient length includes also two measures of **[[en: | ||
Line 14: | Line 14: | ||
* **maxNPDepth**: | * **maxNPDepth**: | ||
* For bare head the measure equals 0. | * For bare head the measure equals 0. | ||
- | * Function words (such as determiners or postpositions) introduce an additional level of embedding. | + | * Function words (such as determiners or prepositions) introduce an additional level of embedding. |
* Punctuation is ignored. | * Punctuation is ignored. | ||
* Coordination does not introduce an additional level of embedding. | * Coordination does not introduce an additional level of embedding. | ||
Line 36: | Line 36: | ||
===== Measures for texts ===== | ===== Measures for texts ===== | ||
- | The following measures are average values based on the measures for sentences. The mdd value is counted as the average for all words in the text. | + | The following measures are average values based on the measures for sentences. The **mdd** value is counted as the average for all words in the text. Average values for all combinations of a language and a text type in InterCorp v16ud are shown in the table [[https:// |
* **maxNPLengthAvg**: | * **maxNPLengthAvg**: | ||
Line 123: | Line 123: | ||
Jagaiah, T., Olinghouse, N.G. & Kearns, D.M. (2020). Syntactic complexity measures: variation by genre, grade-level, | Jagaiah, T., Olinghouse, N.G. & Kearns, D.M. (2020). Syntactic complexity measures: variation by genre, grade-level, | ||
- | [[https://docs.google.com/document/d/1nSPzyhT6oHKUDN8A_uYmWrZH6tAmxTH_pUMOdjg01Eg/edit? | + | Rosen, A. (2024): Lexical and syntactic variability |
+ | of languages and text genres – a corpus-based study. | ||
- | [[https:// | + | |
+ | Rosen, A. (2024). | ||