Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision |
en:cnk:intercorp:verze11 [2019/10/06 20:43] – [Taggers/lemmatizers:] michalskrabal | en:cnk:intercorp:verze11 [2019/12/19 23:51] – [Texts in the corpus] alexandrrosen |
---|
| |
| |
[{{:cnk:intercorp:intercorp_wordcounts_v11.png|Setup of the parallel corpus – the core and collections}}] | [{{:cnk:intercorp:intercorp_wordcounts_v12.png|Setup of the parallel corpus – the core and collections}}] |
| |
[{{:cnk:intercorp:intercorp_wordcounts2_v11.png|Setup of the parallel corpus – the core}}] | [{{:cnk:intercorp:intercorp_wordcounts2_v12.png|Setup of the parallel corpus – the core}}] |
| |
[{{:cnk:intercorp:intercorp_wordcounts3_v11.png|Setup of the parallel corpus – collections}}] | [{{:cnk:intercorp:intercorp_wordcounts3_v12.png|Setup of the parallel corpus – collections}}] |
| |
===== Corpus size in thousands of words ===== | ===== Corpus size in thousands of words ===== |
* [[https://peteris.rocks/blog/latvian-part-of-speech-tagging/|LVTagger]] for Latvian (thanks to Pēteris Paikens and Michal Škrabal) | * [[https://peteris.rocks/blog/latvian-part-of-speech-tagging/|LVTagger]] for Latvian (thanks to Pēteris Paikens and Michal Škrabal) |
* [[http://ufal.mff.cuni.cz/udpipe|UD Pipe]] for Belarusian and Ukrainian (thanks to Bohdan Moskalevskyi) | * [[http://ufal.mff.cuni.cz/udpipe|UD Pipe]] for Belarusian and Ukrainian (thanks to Bohdan Moskalevskyi) |
| * [[https://taku910.github.io/mecab/|MeCab]] and [[https://osdn.net/projects/unidic/|Unidic]] for Japanese |
| |
| |