Differences
This shows you the differences between two versions of the page.
Next revision | Previous revisionNext revisionBoth sides next revision | ||
en:pojmy:syntakticka_komplexita [2024/05/24 20:28] – created alexandrrosen | en:pojmy:syntakticka_komplexita [2024/05/24 21:11] – [Syntactic Complexity] alexandrrosen | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== Syntactic Complexity ====== | ====== Syntactic Complexity ====== | ||
- | InterCorp release 16ud is annotated by sethe veral measures of syntactic complexity. They are specified as metadata for each sentence and each text, for each linguistically annotated language. | + | InterCorp release 16ud is annotated by several |
+ | ===== Measures for sentences ===== | ||
- | * maxNPLength | + | * maxNPLength: number of words in the longest noun phrase |
- | * maxNPDepth | + | * maxNPDepth: number of embeddings in the noun phrase with the longest chain of embeddings |
- | * sLength | + | * sLength: sentence length = no. of words in the sentence (punctuation excluded) |
- | * subRatio | + | * subRatio: subordination ratio = (no. of T-units + no. of clauses) / no. of T-units((T-unit is a main clause including all its embedded/ |
- | * maxTreeDepth | + | * maxTreeDepth: maximum number of clause embeddings (coordination does not count) |
- | * mdd | + | * mdd: mean dependency distance: average number of word boundaries between words and their heads |
- | + | ||
+ | ===== Measures for texts ===== |