AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revisionBoth sides next revision
en:cnk:czesl-plain [2015/10/24 11:27] – [Citing CzeSL] version in English vaclavhorkyen:cnk:czesl-plain [2018/08/07 12:39] alexandrrosen
Line 11: Line 11:
   * **kval** – academic texts obtained from non-native speakers of Czech studying at Czech universities in Masters or doctoral programmes;   * **kval** – academic texts obtained from non-native speakers of Czech studying at Czech universities in Masters or doctoral programmes;
   * **rom** – transcripts of texts written at school by pupils and students with Romani background in communities endangered by social exclusion.   * **rom** – transcripts of texts written at school by pupils and students with Romani background in communities endangered by social exclusion.
 +
 +The corpus does not include any other data about the author or about the text itself.
  
 The texts in all the three groups were produced by speakers who have not (yet) acquired the Czech linguistic skills of an adult native speaker. As an acquisition corpus the texts may serve both for research in the field of learning and teaching and for practical educational purposes. The first two datasets concern Czech as a second/foreign language, constituting an L2 acquisition (learner) subcorpus, while the third dataset is a subcorpus focusing on L1 acquisition (Czech is not considered to be a foreign language for students with Romani background monitored in this project). This is the very first publicly available corpus of this type for Czech. The texts in all the three groups were produced by speakers who have not (yet) acquired the Czech linguistic skills of an adult native speaker. As an acquisition corpus the texts may serve both for research in the field of learning and teaching and for practical educational purposes. The first two datasets concern Czech as a second/foreign language, constituting an L2 acquisition (learner) subcorpus, while the third dataset is a subcorpus focusing on L1 acquisition (Czech is not considered to be a foreign language for students with Romani background monitored in this project). This is the very first publicly available corpus of this type for Czech.