AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revisionBoth sides next revision
en:cnk:diakorp [2015/11/04 16:59] – [The List of Texts of the DIACORP Corpus] annazitovaen:cnk:diakorp [2015/11/04 17:02] – [Diakorp] annazitova
Line 4: Line 4:
 Diakorp represents the diachronic section of the Czech National Corpus and aims to cover the texts of a total of seven centuries of the Czech language development. The first completed version (approximately 700 000 word forms) of the corpus was made accessible to the public in September 2005. Making the data public after the processing phase continues at a pace of about 250 000 word forms yearly. Diakorp represents the diachronic section of the Czech National Corpus and aims to cover the texts of a total of seven centuries of the Czech language development. The first completed version (approximately 700 000 word forms) of the corpus was made accessible to the public in September 2005. Making the data public after the processing phase continues at a pace of about 250 000 word forms yearly.
  
-Due to the length of the time span aimed to be covered and due to the decision to include whole texts instead of samples, Diakorp was not designed to be a representative nor balanced corpus (whether in terms of register variability or period size). These aspects will be regarded in the CNC'new line of diachronic corpora (in preparation).+Due to the length of the time span aimed to be covered and due to the decision to include whole texts instead of samples, Diakorp was not designed to be a representative nor balanced corpus (whether in terms of register variability or period size). These aspects will be regarded in new line of CNC diachronic corpora (in preparation).