AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revisionBoth sides next revision
en:cnk:koditex [2018/06/05 11:28] – [Chunks] petrapoukarovaen:cnk:koditex [2018/10/25 16:05] – [Sources of data] michalkren
Line 134: Line 134:
 ===== Sources of data ===== ===== Sources of data =====
  
-The vast majority of the material in the Koditex corpus draws on the resources of the Czech National Corpus (CNC); types of language data which are not collected by the CNC were acquired from other research centers. We would also like to thank Karel Pala and Vít Baisa from the [[https://nlp.fi.muni.cz/en/NLPCentre|NLPC at Masaryk University]], and Josef Šlerka and his team at Socialinsider, for providing raw data for the //wik// class and //mul// division, respectively.+The vast majority of the material in the Koditex corpus draws on the resources of the Czech National Corpus (CNC); types of language data which are not collected by the CNC were acquired from other research centers. We would also like to thank Martin Prošek and Petr Kaderka from the [[http://www.ujc.cas.cz/en|Czech Language Institute]] of the Czech Academy of Sciences for providing data from the [[http://ujc.dialogy.cz|DIALOG]] corpus, Karel Pala and Vít Baisa from the [[https://nlp.fi.muni.cz/en/NLPCentre|NLPC at Masaryk University]], and Josef Šlerka and his team at Socialinsider, for providing raw data for the //wik// class and //mul// division, respectively.
  
 The Koditex corpus was created by sampling various sources and using a number of tools, all of which are cited here: The Koditex corpus was created by sampling various sources and using a number of tools, all of which are cited here: