Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
en:cnk:etalon [2021/06/01 21:21] – created Jan Křivan | en:cnk:etalon [2021/06/02 19:14] (current) – [Accessing the corpus] Hana Skoumalová | ||
---|---|---|---|
Line 33: | Line 33: | ||
===== Morphological annotation ===== | ===== Morphological annotation ===== | ||
- | The Etalon corpus is segmented, lemmatized, and morphologically annotated in the same way as [[en: | + | The Etalon corpus is segmented, lemmatized, and morphologically annotated in the same way as [[en: |
===== Accessing the corpus ===== | ===== Accessing the corpus ===== | ||
Line 40: | Line 40: | ||
- CNK corpus via the [[en: | - CNK corpus via the [[en: | ||
- | - Data in vertical form: this data can be downloaded from the LINDAT / CLARIN repository (for non-commercial use). This data is divided into segments of a maximum of 100 words (without punctuation) and the segments are shuffled. | + | - Data in vertical form: this data can be downloaded from the [[http:// |
===== Acknowledgments ===== | ===== Acknowledgments ===== |