Both sides previous revisionPrevious revision | Next revisionBoth sides next revision |
en:cnk:uvod [2022/01/14 15:54] – [Corpora of the Czech National Corpus project] alexandrrosen | en:cnk:uvod [2022/01/14 15:55] – [Corpora of the Czech National Corpus project] alexandrrosen |
---|
^ corpus ^ size (word count) ^ lemmas ^ morphological tags ^ year ^ characteristic features ^ | ^ corpus ^ size (word count) ^ lemmas ^ morphological tags ^ year ^ characteristic features ^ |
| **Parallel corpora** |||||| | | **Parallel corpora** |||||| |
| [[en:cnk:intercorp|InterCorp]] ([[en:cnk:intercorp:verze13ud|Release 13ud]], [[en:cnk:intercorp:verze14|Release 14]]) | 1.8G | (✓) | (✓) | 2008–2022 | versioned parallel corpus for 41 languages | | | [[en:cnk:intercorp|InterCorp]] ([[en:cnk:intercorp:verze13ud|release 13ud]], [[en:cnk:intercorp:verze14|release 14]]) | 1.8G | (✓) | (✓) | 2008–2022 | versioned parallel corpus for 41 languages | |
| **Comparable corpora** |||||| | | **Comparable corpora** |||||| |
| [[en:cnk:aranea|Aranea]] | 1G | ✓ | ✓ | 2014 | comparable web corpora for several languages (cs, de, en, es, fi, fr, hu, it, nl, pl, pt, ru, sk, zh) | | | [[en:cnk:aranea|Aranea]] | 1G | ✓ | ✓ | 2014 | comparable web corpora for several languages (cs, de, en, es, fi, fr, hu, it, nl, pl, pt, ru, sk, zh) | |