AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

en:cnk:orwell [2015/10/22 22:49] – created Václav Horkýen:cnk:orwell [2015/10/23 15:40] (current) Václav Cvrček (admin)
Line 2: Line 2:
 ====== Corpus ORWELL ====== ====== Corpus ORWELL ======
  
-This corpus was created as part of the EU Multext-East project and it is formed by the text of George Orwell's novel **[[https://en.wikipedia.org/wiki/Nineteen_Eighty-Four|1984]]** (from the English original translated by Eva Šimečková; Prague: Naše vojsko, 1991). The corpus contains c. 80 thousand words and 20 thousand punctuation marks, that is approximately 100 thousand of corpus positions and it is morphologically tagged. The relatively small size of this corpus allowed the hand-correction of mistakes, which were created during the automatic morphological analysis, which means it is almost flawlessly tagged.+This corpus was created as part of the EU [[http://nl.ijs.si/ME|Multext-East]] project and it is formed by the text of George Orwell's novel **[[https://en.wikipedia.org/wiki/Nineteen_Eighty-Four|1984]]** (from the English original translated by Eva Šimečková; Prague: Naše vojsko, 1991). The corpus contains c. 80 thousand words and 20 thousand punctuation marks, that is approximately 100 thousand of corpus positions and it is morphologically tagged. The relatively small size of this corpus allowed the hand-correction of mistakes, which were created during the automatic morphological analysis, which means it is almost flawlessly tagged.