Both sides previous revisionPrevious revisionNext revision | Previous revision |
en:cnk:syn [2022/12/21 13:09] – [SYN corpus] michalkren | en:cnk:syn [2023/12/29 12:21] (current) – michalkren |
---|
^ <fs medium> SYN corpus versions</fs> ^^^^ | ^ <fs medium> SYN corpus versions</fs> ^^^^ |
^ version ^ year of publication ^ size (no. of words) ^ content ^ | ^ version ^ year of publication ^ size (no. of words) ^ content ^ |
| ^ [[en:cnk:syn:verze12|SYN version 12]] | 2023 | 5.175G | [[en:cnk:syn2000|SYN2000]], [[en:cnk:syn2005|SYN2005]], [[en:cnk:syn2006PUB|SYN2006PUB]], [[en:cnk:syn2009PUB|SYN2009PUB]], [[en:cnk:syn2010|SYN2010]], [[en:cnk:syn2013PUB|SYN2013PUB]], [[en:cnk:syn2015|SYN2015]], [[en:cnk:syn2020|SYN2020]], other journalistic texts | |
^ [[en:cnk:syn:verze11|SYN version 11]] | 2022 | 5.032G | [[en:cnk:syn2000|SYN2000]], [[en:cnk:syn2005|SYN2005]], [[en:cnk:syn2006PUB|SYN2006PUB]], [[en:cnk:syn2009PUB|SYN2009PUB]], [[en:cnk:syn2010|SYN2010]], [[en:cnk:syn2013PUB|SYN2013PUB]], [[en:cnk:syn2015|SYN2015]], [[en:cnk:syn2020|SYN2020]], other journalistic texts | | ^ [[en:cnk:syn:verze11|SYN version 11]] | 2022 | 5.032G | [[en:cnk:syn2000|SYN2000]], [[en:cnk:syn2005|SYN2005]], [[en:cnk:syn2006PUB|SYN2006PUB]], [[en:cnk:syn2009PUB|SYN2009PUB]], [[en:cnk:syn2010|SYN2010]], [[en:cnk:syn2013PUB|SYN2013PUB]], [[en:cnk:syn2015|SYN2015]], [[en:cnk:syn2020|SYN2020]], other journalistic texts | |
^ [[en:cnk:syn:verze10|SYN version 10]] | 2022 | 4.882G | [[en:cnk:syn2000|SYN2000]], [[en:cnk:syn2005|SYN2005]], [[en:cnk:syn2006PUB|SYN2006PUB]], [[en:cnk:syn2009PUB|SYN2009PUB]], [[en:cnk:syn2010|SYN2010]], [[en:cnk:syn2013PUB|SYN2013PUB]], [[en:cnk:syn2015|SYN2015]], [[en:cnk:syn2020|SYN2020]], other journalistic texts | | ^ [[en:cnk:syn:verze10|SYN version 10]] | 2022 | 4.882G | [[en:cnk:syn2000|SYN2000]], [[en:cnk:syn2005|SYN2005]], [[en:cnk:syn2006PUB|SYN2006PUB]], [[en:cnk:syn2009PUB|SYN2009PUB]], [[en:cnk:syn2010|SYN2010]], [[en:cnk:syn2013PUB|SYN2013PUB]], [[en:cnk:syn2015|SYN2015]], [[en:cnk:syn2020|SYN2020]], other journalistic texts | |
====== Advantages of the SYN corpus ====== | ====== Advantages of the SYN corpus ====== |
| |
* access to extensive language data (more than 4 billion words) | * access to extensive language data (more than 5 billion words) |
* possibility to search all the SYN-series corpora at the same time | * possibility to search all the SYN-series corpora at the same time |
* possibility to create subcorpora that correspond to the original corpora | * possibility to create subcorpora that correspond to the original corpora |