AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
Last revisionBoth sides next revision
en:cnk:czesl-sgt [2015/10/07 12:03] – vytvořeno alexandrrosenen:cnk:czesl-sgt [2015/10/24 11:28] – [Citing CzeSL-SGT] version in English vaclavhorky
Line 1: Line 1:
-[[http://ucnk.ff.cuni.cz/english/czesl-sgt.php|CzeSL-SGT – a corpus of non-native speakers’ Czech with automatic annotation]]+~~NOTOC~~ 
 +====== CzeSL-SGT – a corpus of non-native speakers’ Czech with automatic annotation ====== 
 + 
 +The CzeSL-SGT corpus (//**Cze**ch as a **S**econd **L**anguage with **S**pelling, **G**rammar and **T**ags//) includes transcriptions of essays written by non-native speakers of Czech, extending the “foreign” (ciz) part of the [[cnk:CzeSL-plain]] corpus by texts collected in 2013. 
 + 
 +Word forms are tagged by word class, morphological categories and base forms (lemmas). Some forms are corrected and the resulting texts are tagged again. Original and corrected forms are compared and error labels are assigned. The annotation is assigned automatically, which necessarily results in some inaccuracy and error rate. 
 + 
 +Most texts are equipped with metadata about the author and the text. 
 + 
 +The corpus is available either for on-line searching using the [[http://www.korpus.cz/kontext|search interface]] of the Czech National Corpus, or for [[http://hdl.handle.net/11858/00-097C-0000-0023-95B1-E|download as a whole]] from the [[http://www.lindat.cz/|LINDAT]] data repository. 
 + 
 +For more about the CzeSl-SGT corpus see [[http://utkl.ff.cuni.cz/%7Erosen/public/2014-czesl-sgt-en.pdf]]. 
 + 
 +===== Citing CzeSL-SGT ===== 
 + 
 +<WRAP round tip 70%> 
 +Šebesta, K. -- Bedřichová, Z. -- Šormová, K. -- Štindlová, B. -- Hrdlička, M. -- Hrdličková, T. -- Hana, J. -- Petkevič, V. -- Jelínek, T. -- Škodová, S. -- Poláčková, M. -- Janeš, P. -- Lundáková, K. -- Skoumalová, H. -- Sládek, Š. -- Pierscieniak, P. -- Toufarová, D. -- Richter, M. -- Straka, M. -- Rosen, A.: //CzeSL-SGT: korpus češtiny nerodilých mluvčích s automaticky provedenou anotací, version 2 from 28 Sep 2014//. Ústav Českého národního korpusu FF UK, Praha 2014. Available on-line: http://www.korpus.cz 
 +</WRAP>