AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
en:cnk:etalon [2021/06/01 21:21] – created Jan Křivanen:cnk:etalon [2021/06/02 19:14] (current) – [Accessing the corpus] Hana Skoumalová
Line 33: Line 33:
 ===== Morphological annotation ===== ===== Morphological annotation =====
  
-The Etalon corpus is segmented, lemmatized, and morphologically annotated in the same way as [[en:cnk:syn2020#annotation_of_syn2020changes_compared_to_other_corpora_of_the_syn_series|SYN2020]]: the corpus contains attributes [[en:cnk:syn2020#multiple_lemmatization_and_tagging_aggregate|word, sforma]], [[en:cnk:syn2020#lemmatization|lemma, sublemma]], [[en:cnk:syn2020#morphological_tagging|tag]] and [[en:cnk:syn2020#verb_tagging_verbtag|verbtag]]. +The Etalon corpus is segmented, lemmatized, and morphologically annotated in the same way as [[en:cnk:syn2020#annotation_of_syn2020changes_compared_to_other_corpora_of_the_syn_series|SYN2020]]: the corpus contains attributes [[en:cnk:syn2020#multiple_lemmatization_and_tagging_aggregate|word, synword]], [[en:cnk:syn2020#lemmatization|lemma, sublemma]], [[en:cnk:syn2020#morphological_tagging|tag]] and [[en:cnk:syn2020#verb_tagging_verbtag|verbtag]]. 
  
 ===== Accessing the corpus ===== ===== Accessing the corpus =====
Line 40: Line 40:
  
   - CNK corpus via the [[en:manualy:kontext:index|Kontext]] interface.   - CNK corpus via the [[en:manualy:kontext:index|Kontext]] interface.
-  - Data in vertical form: this data can be downloaded from the LINDAT / CLARIN repository (for non-commercial use). This data is divided into segments of a maximum of 100 words (without punctuation) and the segments are shuffled.+  - Data in vertical form: this data can be downloaded from the [[http://hdl.handle.net/11234/1-3698|LINDAT/CLARIN]] repository (for non-commercial use). This data is divided into segments of a maximum of 100 words (without punctuation) and the segments are shuffled.
  
 ===== Acknowledgments ===== ===== Acknowledgments =====