AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revisionBoth sides next revision
en:cnk:czesl-man [2020/11/17 15:13] alexandrrosenen:cnk:czesl-man [2020/11/17 15:27] alexandrrosen
Line 1: Line 1:
 +====== CzeSL-man – a corpus of non-native Czech with manual error annotation in a simplified tiered scheme ======
 +
 +//CzeSL-man// is the name used in the search interface [[http://www.korpus.cz/kontext|KonText]] for //CzeSL-man v1 searchable//, a corpus including annotated texts of non-native speakers of Czech. It is part of the texts from the [[cnk:czesl-sgt|CzeSL-SGT]] corpus. The corpus in the format of the [[https://bitbucket.org/czesl/feat/|feat]] annotation editor can be downloaded under the name //CzeSL-man v1 downloadable// from [[https://bitbucket.org/czesl/czesl-man/|here]].
 +
 +The manual error annotation used in //CzeSL-man v1 searchable// is a simplified version of a two-stage annotation scheme created for the [[http://utkl.ff.cuni.cz/learncorp/|CzeSL]] project. A consequence of the simplification is the reversal of the source text and its annotation. The base text is a corrected version of the original text. The words of the corrected version are therefore tokens of this corpus. The original text is available in the annotation. Not all words from the original text are retained and their order may be affected by the word order of the correction.
 +
 +The annotation also contains types of errors and – for the corrected text – morphosyntactic categories, lemmas, dependency syntactic structure and functions. The texts are also equipped with metadata about the author and the text.
 +
 +For more information on the //CzeSL// learner corpus project, including an overview of all versions of the //CzeSL// learner corpus with links to search or download options, see [[http://utkl.ff.cuni.cz/learncorp/|CzeSL – a Learner Corpus of Czech]] and [[https://dspace.cuni.cz/handle/20.500.11956/123103|Rosen et al. (2020)]].
 +
 ===== Citing CzeSL-SGT ===== ===== Citing CzeSL-SGT =====