Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision |
en:manualy:syd [2016/12/11 10:57] – [The Synchronic part] veronikapojarova | en:manualy:syd [2021/03/09 15:10] (current) – jankocek |
---|
====== SyD ====== | ====== SyD ====== |
| |
{{ kurz:syd-logo.png?direct&200|}} | {{ :manualy:syd_logo.png?nolink&200|}} |
| |
The SyD application (for the analysis of **Sy**nchronic and **D**iachronic variants) serves primarily to study competing linguistic phenomena. It serves as a supplementation of the more universal [[en:pojmy:korpusovy_manazer|corpus managers]], and can quickly and easily provide corpus results to lay users. As its name suggests, the application has two (essentially separate) parts: | The SyD application (for the analysis of **Sy**nchronic and **D**iachronic variants) serves primarily to study competing linguistic phenomena. It serves as a supplementation of the more universal [[en:pojmy:korpusovy_manazer|corpus managers]], and can quickly and easily provide corpus results to lay users. As its name suggests, the application has two (essentially separate) parts: |
===== The Diachronic part ===== | ===== The Diachronic part ===== |
| |
The basis for the diachronic analysis is the Diakon corpus which is composed of the [[en:cnk:diakorp|Diakorp]] corpus texts, expanded upon with data from earlier forms of Czech which have not yet been reviewed manually. A makeshift list of source texts before the year 1989 is available in the [[en:seznamy:index#zdrojove_texty_diachronnich_korpusu|lists]] section. The most modern period is represented by a selection from the synchronic corpora of the [[en:cnk:syn|SYN]] series. Due to the fact that the older texts are not yet [[wp>Lemma_(psycholinguistics)|lemmatized]], it is possible to use only the [[en:pojmy:word|word]] attribute (word form) for queries. | The basis for the diachronic analysis is the Diakon corpus which is composed of the [[en:cnk:diakorp|Diakorp]] corpus texts, expanded upon with data from earlier forms of Czech which have not yet been reviewed manually. A makeshift list of source texts before the year 1989 is available in the [[en:seznamy:index#zdrojove_texty_diachronnich_korpusu|lists]] section. The most modern period is represented by a selection from the synchronic corpora of the [[en:cnk:syn|SYN]] series. Due to the fact that the older texts are not yet [[en:pojmy:lemma|lemmatized]], it is possible to use only the [[en:pojmy:word|word]] attribute (word form) for queries. |
| |
The SyD application will evaluate all queries and find their relative frequencies in various timer periods. Because the coverage of all the time periods is not optimal, it is advisable to understand the temporal information to be more of an approximation, and to use a moving average (with an adjustable window) for displaying trends. The portrayal on the timeline also includes an error rate which should also be taken into account when analyzing data. | The SyD application will evaluate all queries and find their relative frequencies in various timer periods. Because the coverage of all the time periods is not optimal, it is advisable to understand the temporal information to be more of an approximation, and to use a moving average (with an adjustable window) for displaying trends. The portrayal on the timeline also includes an error rate which should also be taken into account when analyzing data. |