Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision |
en:manualy:kontext:konkordance [2023/05/17 13:25] – [Text surroundings of the KWIC] michalskrabal | en:manualy:kontext:konkordance [2023/05/17 13:46] – [Shuffle] michalskrabal |
---|
==== Syntactic graph ==== | ==== Syntactic graph ==== |
| |
V případě, že je korpus [[en:pojmy:syntakticka_analyza|syntakticky označkován]] (např. [[en:cnk:syn2015|SYN2015]]), nachází se mezi zaškrtávacím políčkem a metainformací o textu ikonka {{:manualy:kontext:syntax-tree-icon.png?nolink&20|}} sloužící k vyvolání **syntaktického grafu**. | If the corpus is [[en:pojmy:syntakticka_analyza|tagged syntatically]] (e.g. [[en:cnk:syn2015|SYN2015]]), there is an icon {{:manualy:kontext:syntax-tree-icon.png?nolink&20|}} between the checkbox and the text meta-information, used to show the **syntactic graph** of the given sentence. |
FIXME | |
| |
==== Manual labelling of concordance lines ==== | ==== Manual labelling of concordance lines ==== |
==== Shuffle ==== | ==== Shuffle ==== |
| |
FIXME In the default settings, the concordance is ordered according to the order in which the search results (individual concordance lines) are found in the corpus (e.g. in the corpus [[en:cnk:syn2015|SYN2015]] the first texts are fiction, then non-fiction and finally journalistic). To má výhodu zejména v rychlejším vyhledání odpovídajících řádků. But in situations when the concordance is extensive and we need to acquire a representative sample (e.g. for manual analysis), je vhodnější pracovat s náhodně promíchanými řádky. Toho lze dosáhnout právě volbou **Concordance → Shuffle**. Výsledkem operace je promíchání jednotlivých řádek konkordance, které je sice náhodné, ale zároveň opakovatelné. | In the default settings, the concordance is ordered according to the order in which the search results (individual concordance lines) are found in the corpus (e.g. in the corpus [[en:cnk:syn2015|SYN2015]] the first texts are fiction, then non-fiction and finally journalistic). This has the advantage of making it quicker to find matching rows. Nonetheless, if the concordance is extensive, and one needs to acquire a representative sample (e.g. for manual analysis), it is preferable to work with randomly shuffled lines. This can be done with the **Concordance → Shuffle** option. The resulting shuffle of the concordance lines is random yet repeatable. |
| |
We recommend that the option //Shuffle// be used automatically, which ensures that every concordance, before being displayed, is randomized in this way. Trvalé nastavení promíchávání konkordančních řádek lze nastavit v menu **Zobrazení → Obecné volby zobrazení** (volba //Automaticky promíchat konkordanční řádky//). Such an approach functions as an effective prevention against drawing incorrect conclusions from studying a sample of results which originate from an unrepresentative set of texts. FIXME | Using the Shuffle option automatically is recommended, as it ensures that every concordance, before being displayed, is randomized in this way. The permanent setting of concordance lines shuffling is achievable in the **View → General view options** menu (the //Shuffle concordance lines by default// option). Such an approach functions as an effective prevention against drawing incorrect conclusions from studying a sample of results which originate from an unrepresentative set of texts. |
| |
The result of the operation **Shuffle** is the shuffling of the individual concordance lines. This shuffle is random, but it is repeatable. For every concordance there is a definite shuffle algorithm which causes the results after the first, second, third... //n// shuffle are the same after repeated trials on the same query. This guarantees the replicability of the corpus experiments and when the shuffled concordance is used. | If concordance lines shuffling is enabled by default, the Shuffle option will perform another random rearrangement. For each concordance, an explicit shuffling algorithm causes the results after the first, second, third... nth shuffle to match on repeated attempts on the same query. This guarantees the repeatability of experiments on corpora even when using a shuffled concordance. |
| |
| |
==== Sample==== | ==== Sample==== |