AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
en:manualy:syd [2016/12/07 20:43] – [The Synchronic part] veronikapojarovaen:manualy:syd [2016/12/11 10:57] – [The Synchronic part] veronikapojarova
Line 22: Line 22:
   * [[en:cnk:oral2006|Oral2006]] + [[en:cnk:oral2008|Oral2008]] + [[en:cnk:oral2013|Oral2013]] for spoken informal language   * [[en:cnk:oral2006|Oral2006]] + [[en:cnk:oral2008|Oral2008]] + [[en:cnk:oral2013|Oral2013]] for spoken informal language
  
-In the synchronic part of the analysis it is possible to use [[wp>Lemma_(psycholinguistics)|lemmatization]] (i.e. to search for an entire lexeme including all of its possible forms), however extra care must be taken when assessing the results. While the SYN series corpora use standard lemmatization, data for spoken Czech and for correspondence are not lemmatized, and therefore the extent of the lemma is estimated based on the written language (the query is first assessed in the SYN2010 corpus and based on the forms identified a query for the non-lemmatized corpora is constructed.+In the synchronic part of the analysis it is possible to use [[en:pojmy:lemma|lemmatization]] (i.e. to search for an entire lexeme including all of its possible forms), however extra care must be taken when assessing the results. While the SYN series corpora use standard lemmatization, data for spoken Czech and for correspondence are not lemmatized, and therefore the extent of the lemma is estimated based on the written language (the query is first assessed in the SYN2010 corpus and based on the forms identified a query for the non-lemmatized corpora is constructed.
  
-Synchronní část poskytuje informaci o rozložení jevů v psaných textech (na základě [[pojmy:atributy_strukturni|strukturních atributů]] [[pojmy:txtype|txtype]] [[pojmy:genre|genre]]) i v mluveném jazyce (na základě atribtutů pohlavívěkvždělání a regionální příslušnost). Všechny údaje jsou relativizovány s ohledem na velikost dané kategorie v korpusech.+The synchronic part provides information about the distribution of phenomena in written texts (based on the [[en:pojmy:atributy_strukturni|structural attributes]] //[[en:pojmy:txtype|txtype]]// and //[[en:pojmy:genre|genre]]//and in spoken language (based on the attributes of genderageeducation and region). All data are made relative with regard to the size of the given category in the corpora.
  
-Pro analýzu lexikálních odlišností zkoumaných variant poskytuje aplikace SyD i zjednodušenou verzi [[pojmy:kolokace|kolokačních]] paradigmat k jednotlivým dotazům.+For the analysis of the lexical differences of the examined variants, the SyD application offers a simplified version of [[en:pojmy:kolokace|collocational]] paradigms to the individual queries.
  
 ===== The Diachronic part ===== ===== The Diachronic part =====