AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
en:cnk:koditex [2018/10/25 16:05] – [Sources of data] michalkrenen:cnk:koditex [2018/11/01 16:15] (current) – [How to cite Koditex] vaclavcvrcek
Line 134: Line 134:
 ===== Sources of data ===== ===== Sources of data =====
  
-The vast majority of the material in the Koditex corpus draws on the resources of the Czech National Corpus (CNC); types of language data which are not collected by the CNC were acquired from other research centers. We would also like to thank Martin Prošek and Petr Kaderka from the [[http://www.ujc.cas.cz/en|Czech Language Institute]] of the Czech Academy of Sciences for providing data from the [[http://ujc.dialogy.cz|DIALOG]] corpus, Karel Pala and Vít Baisa from the [[https://nlp.fi.muni.cz/en/NLPCentre|NLPC at Masaryk University]], and Josef Šlerka and his team at Socialinsider, for providing raw data for the //wik// class and //mul// division, respectively.+The vast majority of the material in the Koditex corpus draws on the resources of the Czech National Corpus (CNC); types of language data which are not collected by the CNC were acquired from other research centers. We would also like to thank Martin Prošek and Petr Kaderka from the [[http://www.ujc.cas.cz/en|Czech Language Institute]] of the Czech Academy of Sciences for providing data from the [[http://ujc.dialogy.cz/?q=en/node/80|DIALOG]] corpus, Karel Pala and Vít Baisa from the [[https://nlp.fi.muni.cz/en/NLPCentre|NLPC at Masaryk University]], and Josef Šlerka and his team at Socialinsider, for providing raw data for the //wik// class and //mul// division, respectively.
  
 The Koditex corpus was created by sampling various sources and using a number of tools, all of which are cited here: The Koditex corpus was created by sampling various sources and using a number of tools, all of which are cited here:
Line 153: Line 153:
  
 <WRAP round tip 70%> <WRAP round tip 70%>
-Zasina, Adrian J., David Lukeš, Zuzana Komrsková, Petra Poukarová  & Anna Řehořková. 2018. Koditex (A corpus of diversified texts)Faculty of Arts, Institute of the Czech National Corpus, Charles University in Prague.+Zasina, A. J. – Lukeš, D. – Komrsková, Z. – Poukarová, P. – Řehořková, A.: //KoditexA corpus of diversified texts//. Institute of the Czech National Corpus, Faculty of Arts, Charles UniversityPrague 2018. Available at WWW: www.korpus.cz
 </WRAP> </WRAP>
 +