AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
en:pojmy:syntakticka_komplexita [2024/05/24 21:11] – [Syntactic Complexity] alexandrrosenen:pojmy:syntakticka_komplexita [2024/07/02 19:51] – [Measures for sentences] alexandrrosen
Line 4: Line 4:
  
 ===== Measures for sentences =====  ===== Measures for sentences ===== 
 +
 +
 +Two measures (maxNPLength and maxNPDepth) concern noun phrases, defined as subtrees headed by words whose upos is NOUN, PNOM or PRON.
 +
 +Except for the mdd measure, punctuation and coordination is excluded. 
  
   * maxNPLength: number of words in the longest noun phrase   * maxNPLength: number of words in the longest noun phrase
   * maxNPDepth: number of embeddings in the noun phrase with the longest chain of embeddings    * maxNPDepth: number of embeddings in the noun phrase with the longest chain of embeddings 
   * sLength: sentence length = no. of words in the sentence (punctuation excluded)   * sLength: sentence length = no. of words in the sentence (punctuation excluded)
-  * subRatio: subordination ratio = (no. of T-units + no. of clauses) / no. of T-units((T-unit is a main clause including all its embedded/dependent clauses. Each top-level clausal conjunct, including any embedded/dependent clauses, counts as a T-unit.))+  * subRatio: subordination ratio = (no. of T-units + no. of subordinate clauses) / no. of T-units((T-unit is a main clause including all its embedded/dependent clauses. Each top-level clausal conjunct, including any embedded/dependent clauses, counts as a T-unit.))
   * maxTreeDepth: maximum number of clause embeddings (coordination does not count)   * maxTreeDepth: maximum number of clause embeddings (coordination does not count)
   * mdd: mean dependency distance: average number of word boundaries between words and their heads   * mdd: mean dependency distance: average number of word boundaries between words and their heads
  
 ===== Measures for texts =====  ===== Measures for texts ===== 
 +
 +The following measures are average values based on the measures for sentences. The mdd value is counted as the average for all words in the text.
 +
 +  * maxNPLengthAvg: average number of words in the longest noun phrase
 +  * maxNPDepthAvg: average number of embeddings in the noun phrase with the longest chain of embeddings 
 +  * sLengthAvg: average sentence length = no. of words in the sentence (punctuation excluded)
 +  * subRatioAvg: average subordination ratio = (no. of T-units + no. of clauses) / no. of T-units
 +  * maxTreeDepthAvg: average maximum number of clause embeddings (coordination does not count)
 +  * mdd: mean dependency distance: average number of word boundaries between words and their heads
 +
 +In addition to syntactic complexity measures each text of sufficient length includes also two measures of [[en:pojmy:lexikalni_bohatost|lexical diversity]].