Differences
This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| en:cnk:etalon [2021/06/01 21:21] – created jankrivan | en:cnk:etalon [2021/06/02 19:14] (current) – [Accessing the corpus] hanaskoumalova | ||
|---|---|---|---|
| Line 33: | Line 33: | ||
| ===== Morphological annotation ===== | ===== Morphological annotation ===== | ||
| - | The Etalon corpus is segmented, lemmatized, and morphologically annotated in the same way as [[en: | + | The Etalon corpus is segmented, lemmatized, and morphologically annotated in the same way as [[en: |
| ===== Accessing the corpus ===== | ===== Accessing the corpus ===== | ||
| Line 40: | Line 40: | ||
| - CNK corpus via the [[en: | - CNK corpus via the [[en: | ||
| - | - Data in vertical form: this data can be downloaded from the LINDAT / CLARIN repository (for non-commercial use). This data is divided into segments of a maximum of 100 words (without punctuation) and the segments are shuffled. | + | - Data in vertical form: this data can be downloaded from the [[http:// |
| ===== Acknowledgments ===== | ===== Acknowledgments ===== | ||