AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
en:pojmy:lexikalni_bohatost [2024/08/28 23:33] alexandrrosenen:pojmy:lexikalni_bohatost [2024/09/08 14:25] (current) alexandrrosen
Line 12: Line 12:
 The measures are based on the type-token ratio. They show the average number of different types (word forms or lemmas) in a moving window of 1000 tokens. If the text has less than 1000 tokens, the measures are not defined and the value of both attributes equals the underscore character (''_'').   The measures are based on the type-token ratio. They show the average number of different types (word forms or lemmas) in a moving window of 1000 tokens. If the text has less than 1000 tokens, the measures are not defined and the value of both attributes equals the underscore character (''_'').  
      
 +===== References =====
 +
 +[[https://docs.google.com/document/d/1nSPzyhT6oHKUDN8A_uYmWrZH6tAmxTH_pUMOdjg01Eg/edit?usp=sharing|InterCorp a Universal Dependencies: nové možnosti výzkumu]] (workshop 20. a 27. 3. 2024 v rámci Teoreticko-metodologického semináře Ústavu českého jazyka a teorie komunikace)
 +
 +[[https://drive.google.com/file/d/1L9yTjj0bTrGgf8lDcOAsJoJOoeYEoPEm/view?usp=sharing|Exploring InterCorp v16ud: the potential of a multilingual parallel treebank with complexity and diversity metrics]] (slides from the seminar at the University of Warsaw, 10 July 2024)