Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | Next revisionBoth sides next revision |
en:cnk:fictree [2017/12/18 09:40] – [1. A CNC corpus in the KonText interface] luciechlumska | en:cnk:fictree [2017/12/18 09:44] – [3. Data annotated in the Universal Dependencies standard] luciechlumska |
---|
===== 3. Data annotated in the Universal Dependencies standard ===== | ===== 3. Data annotated in the Universal Dependencies standard ===== |
| |
The morphological and syntactic annotation according to the UD guidelines was assigned by converting the original PDT annotation. The conversion procedure was designed by Dan Zeman and implemented in [[https://github.com/ufal/treex|Treex]]. | The morphological and syntactic annotation according to the UD guidelines was performed by converting the original PDT annotation. The conversion procedure was designed by Dan Zeman and implemented in [[https://github.com/ufal/treex|Treex]]. |
The data are available on the [[http://universaldependencies.org/treebanks/cs_fictree/index.html|Universal Dependencies]] webpage. They are in the [[http://universaldependencies.org/format.html|CONLL-U format]]. The original texts are divided into segments of maximum 100 tokens, the segments are shuffled and divided into a train, val and test data set. The FicTree treebank in UD standard is also accessible using the query tool [[https://lindat.mff.cuni.cz/services/pmltq/|PML-TQ]]. | The data are available on the [[http://universaldependencies.org/treebanks/cs_fictree/index.html|Universal Dependencies]] webpage. They are in the [[http://universaldependencies.org/format.html|CONLL-U format]]. The original texts are divided into segments of maximum 100 tokens, the segments are shuffled and divided into a train, val and test data set. The FicTree treebank in UD standard is also accessible using the query tool [[https://lindat.mff.cuni.cz/services/pmltq/|PML-TQ]]. |
| |