Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | Last revisionBoth sides next revision | ||
en:cnk:czesl-plain [2018/08/07 12:39] – alexandrrosen | en:cnk:czesl-plain [2018/08/07 12:47] – metadata missing alert alexandrrosen | ||
---|---|---|---|
Line 19: | Line 19: | ||
The essays and handwritten school exams were collected as manuscripts, | The essays and handwritten school exams were collected as manuscripts, | ||
+ | |||
+ | |||
+ | Texts of non-native speakers (the **ciz** part), extended by some newer texts, are available as the CzeSL-sgt corpus, together with metadata and automatically performed morphosyntactic and error annotation, including the identification of incorrect forms. The CzeSL-plain corpus is also available from the LINDAT-Clarin repository as AKCES 3 a AKCES4. See also CzeSL – a Learner Corpus of Czech. | ||
Although the CzeSL-plain corpus does not contain any linguistic annotation at the moment, its next release will include more texts (the corpus is thus non-reference) and provide automatic identification of incorrect forms and morphosyntactic tags. Some of the texts included in the CzeSL-plain corpus are annotated by correct forms, error labels, morphosyntactic tags and lemmas and are due for release under a different purpose-built search interface. | Although the CzeSL-plain corpus does not contain any linguistic annotation at the moment, its next release will include more texts (the corpus is thus non-reference) and provide automatic identification of incorrect forms and morphosyntactic tags. Some of the texts included in the CzeSL-plain corpus are annotated by correct forms, error labels, morphosyntactic tags and lemmas and are due for release under a different purpose-built search interface. |