AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
en:cnk:koditex [2018/06/05 11:28] – [Chunks] petrapoukarovaen:cnk:koditex [2018/10/25 16:06] – [Sources of data] michalkren
Line 134: Line 134:
 ===== Sources of data ===== ===== Sources of data =====
  
-The vast majority of the material in the Koditex corpus draws on the resources of the Czech National Corpus (CNC); types of language data which are not collected by the CNC were acquired from other research centers. We would also like to thank Karel Pala and Vít Baisa from the [[https://nlp.fi.muni.cz/en/NLPCentre|NLPC at Masaryk University]], and Josef Šlerka and his team at Socialinsider, for providing raw data for the //wik// class and //mul// division, respectively.+The vast majority of the material in the Koditex corpus draws on the resources of the Czech National Corpus (CNC); types of language data which are not collected by the CNC were acquired from other research centers. We would also like to thank Martin Prošek and Petr Kaderka from the [[http://www.ujc.cas.cz/en|Czech Language Institute]] of the Czech Academy of Sciences for providing data from the [[http://ujc.dialogy.cz/?q=en/node/80|DIALOG]] corpus, Karel Pala and Vít Baisa from the [[https://nlp.fi.muni.cz/en/NLPCentre|NLPC at Masaryk University]], and Josef Šlerka and his team at Socialinsider, for providing raw data for the //wik// class and //mul// division, respectively.
  
 The Koditex corpus was created by sampling various sources and using a number of tools, all of which are cited here: The Koditex corpus was created by sampling various sources and using a number of tools, all of which are cited here: