Both sides previous revisionPrevious revisionNext revision | Previous revision |
en:cnk:syn:verze12 [2023/12/28 17:18] – [Structure and annotation of SYN version 12] michalkren | en:cnk:syn:verze12 [2023/12/29 09:15] (current) – [How to cite SYN version 12] michalkren |
---|
</WRAP> | </WRAP> |
| |
Every **SYN corpus** contains all the [[en:pojmy:synchronni|synchronic]] [[en:pojmy:psany|written]] corpora of the [[en:cnk:syn|SYN]] series published up until the time of the given version's publication. The corpus SYN version 12 therefore contains the [[en:cnk:syn2000|SYN2000]], [[en:cnk:syn2005|SYN2005]], [[en:cnk:syn2006pub|SYN2006PUB]], [[en:cnk:syn2009pub|SYN2009PUB]], [[en:cnk:syn2010|SYN2010]],[[en:cnk:syn2013pub|SYN2013PUB]], [[en:cnk:syn2015|SYN2015]] and [[en:cnk:syn2020|SYN2020]] corpora; additionally, it contains a journalistic component predominantly from 2010–2021 (already included into [[en:cnk:syn:verze4|SYN version 4]] -- [[en:cnk:syn:verze10|SYN version 11]]) corpora, and as yet **unpublished journalistic texts from 2022** in yearly volume almost 150 mil. words. | Every **SYN corpus** contains all the [[en:pojmy:synchronni|synchronic]] [[en:pojmy:psany|written]] corpora of the [[en:cnk:syn|SYN]] series published up until the time of the given version's publication. The corpus SYN version 12 therefore contains the [[en:cnk:syn2000|SYN2000]], [[en:cnk:syn2005|SYN2005]], [[en:cnk:syn2006pub|SYN2006PUB]], [[en:cnk:syn2009pub|SYN2009PUB]], [[en:cnk:syn2010|SYN2010]],[[en:cnk:syn2013pub|SYN2013PUB]], [[en:cnk:syn2015|SYN2015]] and [[en:cnk:syn2020|SYN2020]] corpora; additionally, it contains a journalistic component predominantly from 2010–2021 (already included into [[en:cnk:syn:verze4|SYN version 4]] -- [[en:cnk:syn:verze11|SYN version 11]]) corpora, and as yet **unpublished journalistic texts from 2022** in yearly volume almost 150 mil. words. |
| |
The SYN corpus is not [[en:pojmy:reprezentativnost|representative]]; the dominant component is journalism, which is the result of the predominance of journalistic corpora [[en:cnk:syn2006pub|SYN2006PUB]], [[en:cnk:syn2009pub|SYN2009PUB]], [[en:cnk:syn2013pub|SYN2013PUB]] and the journalistic component from 2010--2022. | The SYN corpus is not [[en:pojmy:reprezentativnost|representative]]; the dominant component is journalism, which is the result of the predominance of journalistic corpora [[en:cnk:syn2006pub|SYN2006PUB]], [[en:cnk:syn2009pub|SYN2009PUB]], [[en:cnk:syn2013pub|SYN2013PUB]] and the journalistic component from 2010--2022. |
| |
<WRAP round tip 70%> | <WRAP round tip 70%> |
Křen, M. – Cvrček, V. – Hnátková, M. – Jelínek, T. – Kocek, J. – Kováříková, D. – Křivan, J. – Milička, J. – Petkevič, V. – Procházka, P. – Skoumalová, H. – Šindlerová, J. – Škrabal, M.: //Corpus SYN, version 12 from 29. 12. 2023//. Ústav Českého národního korpusu FF UK, Praha 2023. Available online: https://www.korpus.cz. | Křen, M. – Cvrček, V. – Čapka, T. – Hnátková, M. – Jelínek, T. – Kocek, J. – Kováříková, D. – Křivan, J. – Milička, J. – Petkevič, V. – Skoumalová, H. – Šindlerová, J. – Škrabal, M.: //Corpus SYN, version 12 from 29. 12. 2023//. Ústav Českého národního korpusu FF UK, Praha 2023. Available online: https://www.korpus.cz. |
| |
| |