| Both sides previous revisionPrevious revisionNext revision | Previous revision |
| en:cnk:syn:verze9 [2021/12/17 09:10] – [SYN version 9] michalkren | en:cnk:syn:verze9 [2026/01/23 11:52] (current) – [Structure and annotation of SYN version 9] krivan |
|---|
| ====== Structure and annotation of SYN version 9 ====== | ====== Structure and annotation of SYN version 9 ====== |
| |
| Generally speaking, structure and annotation of SYN version 9 are based on that of the SYN2020 corpus. In particular, hierarchy of structural tags for SYN version 9 has been taken over from SYN2020, as well as the [[en:cnk:syn2020#annotation_of_syn2020changes_compared_to_other_corpora_of_the_syn_series|lemmatization and morphological tagging]]. Please note that **SYN version 9 differs in this from its predecessor, [[en:cnk:syn:verze8|SYN version 8]]**. | Generally speaking, structure and annotation of SYN version 9 are based on that of the SYN2020 corpus. Hierarchy of structural tags for SYN version 9 has been taken over from SYN2020. Morphological tagging, lemmatization, and tokenization of the corpus are performed fully automatically according to the [[en:cnk:anotacni_standard_cnk|unified CNC annotation scheme]]. Please note that **SYN version 9 differs in this from its predecessor, [[en:cnk:syn:verze8|SYN version 8]]**. |
| |
| This correspondence of structure and annotation between SYN version 9 and [[en:cnk:syn2020|SYN2020]] only has the following exceptions: | This correspondence of structure and annotation between SYN version 9 and [[en:cnk:syn2020|SYN2020]] only has the following exceptions: |
| |
| <WRAP round tip 70%> | <WRAP round tip 70%> |
| Křen, M. – Cvrček, V. – Henyš, J. – Hnátková, M. – Jelínek, T. – Kocek, J. – Kováříková, D. – Křivan, J. – Milička, J. – Petkevič, V. – Procházka, P. – Skoumalová, H. – Šindlerová, J. – Škrabal, M.: //Corpus SYN, verze 9 from 5. 12. 2021//. Ústav Českého národního korpusu FF UK, Praha 2021. Available online: https://www.korpus.cz. | Křen, M. – Cvrček, V. – Henyš, J. – Hnátková, M. – Jelínek, T. – Kocek, J. – Kováříková, D. – Křivan, J. – Milička, J. – Petkevič, V. – Procházka, P. – Skoumalová, H. – Šindlerová, J. – Škrabal, M.: //Corpus SYN, version 9 from 5. 12. 2021//. Ústav Českého národního korpusu FF UK, Praha 2021. Available online: https://www.korpus.cz. |
| |
| |
| Hnátková, M. – Křen, M. – Procházka, P. – Skoumalová, H. (2014): [[http://www.lrec-conf.org/proceedings/lrec2014/pdf/294_Paper.pdf|The SYN-series corpora of written Czech]]. In //Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)//, 160–164. Reykjavík: ELRA. ISBN 978-2-9517408-8-4. | Hnátková, M. – Křen, M. – Procházka, P. – Skoumalová, H. (2014): [[http://www.lrec-conf.org/proceedings/lrec2014/pdf/294_Paper.pdf|The SYN-series corpora of written Czech]]. In //Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)//, 160–164. Reykjavík: ELRA. ISBN 978-2-9517408-8-4. |
| | |
| | Jelínek, T. – Křivan, J. – Petkevič, V. – Skoumalová, H. – Šindlerová, J. (2021): [[https://doi.org/10.1007/978-3-030-83527-9_4|SYN2020: A new corpus of Czech with an innovated annotation]]. In: K. Ekštein – F. Pártl – M. Konopík (eds.), //Text, Speech, and Dialogue.// TSD 2021. Lecture Notes in Computer Science, vol. 12848. Cham: Springer, 48–59. |
| | |
| | Křivan, J. – Šindlerová, J. (2022): [[http://sas.ujc.cas.cz/archiv.php?lang=en&art=4508|Změny v morfologické anotaci korpusů řady SYN: nové možnosti zkoumání české gramatiky a lexikonu]]. //Slovo a slovesnost//, 83, 2/2022, 122–145. |
| | |
| </WRAP> | </WRAP> |
| |