AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
en:manualy:kontext:konkordance [2023/05/17 13:29] – [Syntactic graph] michalskrabalen:manualy:kontext:konkordance [2023/07/10 17:00] lukes
Line 2: Line 2:
  
 [{{:en:manualy:kontext:konkordance-liska.png?direct&450 |Concordance list}}] [{{:en:manualy:kontext:konkordance-liska.png?direct&450 |Concordance list}}]
-The basic type of query evaluation is [[en:pojmy:konkordance|concordance]], in particular, the concordance list. It is a list of all verbs or phrases which fit the ([[en:pojmy:kwic|KWIC]]) query, along with their right and left contexts, possibly also with information about the source text. A long concordance list is usually divided into several pages, and it is possible to switch between them with the help of the arrows placed in the header and footer of the concordance list. Display parameter settings (the number of rows on a page, length of context etc.) can be changed with the [[en:manualy:kontext:moznosti_zobrazeni|View]] option.+The basic kind of query is [[en:pojmy:konkordance|concordance]], in particular, concordance list. It is a list of all words or phrases which fit the ([[en:pojmy:kwic|KWIC]]) query, along with their right and left contexts, possibly also with information about the source text. A long concordance list is usually divided into several pages, and it is possible to switch between them with the help of the arrows placed in the header and footer of the concordance list. Display parameter settings (the number of rows on a page, length of context etc.) can be changed using the [[en:manualy:kontext:moznosti_zobrazeni|View]] menu. 
 + 
 +The following sections describe the functionality of the page, basic work with a concordance list, sorting, shuffling the list and creating a sample. Further work with the concordance (i.e. ([[en:manualy:kontext:filtr|filtering]], [[en:manualy:kontext:frekvence|frequency]], and [[[[en:manualy:kontext:kolokace|collocation]] analysis) is covered on separate pages corresponding to other menu items.
  
-The following sections describe the functionality of the page, basic work with a concordance list, sorting, shuffling the list and creating a sample. Further work with the concordance, i.e. ([[en:manualy:kontext:filtr|filtering]], [[en:manualy:kontext:frekvence|frequency]], and [[[[en:manualy:kontext:kolokace|collocation]] analysis), is devoted to separate pages corresponding to other menu items. 
 ===== Navigation with concordance editing sequence ===== ===== Navigation with concordance editing sequence =====
  
-Under the KonText logo, there is navigation that provides basic information about the searched corpus (item **Corpus**) and allows to modify the initial query (item **Query**) as well as to change any subsequent modifications above the concordance list (shuffling and other items corresponding to the modifications). +Under the KonText logo, navigation overview provides basic information about the searched corpus (item **Corpus**) and allows modifying the initial query (item **Query**)as well as changing any subsequent actions performed on the concordance list (e.g. shuffling, filters etc.). 
  
  
Line 14: Line 15:
 \\ \\
 \\ \\
-The text of the link in the Query item always shows the query entered by the user in a simplified way, followed by the number of occurrences in parentheses. The query can be displayed (including the selected corpus and all conditions) and edited using the link, i.e. its parameters can be changed directly above the searched concordance (without the need to enter a new query). This feature is especially useful if you want to refine or update the original query. When editing a query, you must also decide whether the following operations should be performed on the concordance. They include a useful shuffle option (this can be run once from the menu **Concordance → Shuffle** or set as default in the menu **View → General view options**, see the section below for details), so it is usually a good idea to leave the option **Perform automatically also subsequent operations** checked.+The text of the link in the **Query** item always shows a simplified representation of the query entered by the user, followed by the number of occurrences in parentheses. The query can be displayed (including the selected corpus and all conditions) and edited using the link, i.e. its parameters can be changed directly above the search results (without the need to enter a new query). This feature is especially useful if you want to refine or update the original query. When editing a query, you must also decide whether the following operations should be performed on the concordance. These include a useful shuffle option (this can be run manually from the menu **Concordance → Shuffle** or set as default in the menu **View → General view options**, see below for details), so it is usually a good idea to leave the option **Perform automatically also subsequent operations** checked.
  
-The following line items represent the steps in the edit sequence that change the form and extent of the concordance. These can be handled in a similar way to the initial query. +The subsequent navigation items represent the steps in the edit sequence that change the form and extent of the concordance. These can be manipulated in a similar way as the initial query. 
  
 [{{:en:manualy:kontext:popis-dotazu_en.png?direct&600|Description of a more complex query }}] [{{:en:manualy:kontext:popis-dotazu_en.png?direct&600|Description of a more complex query }}]
Line 22: Line 23:
  
  
-The query overview with the sequence of all concordance modifications is also available in tabular form by clicking the last link **Details**.+The query overview with the sequence of all concordance modifications is also available in tabular form by clicking the last link**Details**.
  
-Besides editing the query, the user can also return to the individual steps and intermediate phases, renewing the form of the concordance from any of the previous steps. E.g. we can easily return to the default (and subsequently shuffledconcordance list using the **View** link on the second line with the shuffle operation.+Besides editing the query, the user can also return to the form of the concordance from any of the previous steps and intermediate phases. E.g. we can easily return to the default concordance list (including shuffling) using the **View** link in the second row with the shuffle operation.
  
-Query overview thus allows for an exact query specification for future use, e.g. in a research report, scientific paper etc. It is a more complex and more general variant of the simple //Undo// button in a web browser. While the Query overview allows you to browse and edit operations on a concordance, the Undo button allows you to browse (but only in a fixed order) both the concordance and, say, the frequency or collocation analysis results.+The query overview thus allows for an exact query specification for future use, e.g. in a research report, scientific paper etc. It is a more complex and more general variant of the web browser'simple built-in //Undo// button. While the Query overview allows you to browse and edit operations on a concordance, the Undo button allows you to browse (but only in a fixed order) both the concordance and, say, the frequency or collocation analysis results.
  
-===== Working with concordance =====+===== Working with concordance =====
  
 ==== Introduction ==== ==== Introduction ====
Line 47: Line 48:
 In **newer written corpora** (starting with [[[en:cnk:syn2020|SYN2020]] and [[en:cnk:syn:verze9|SYN verze9]]) it is possible to switch between two window display options: a default view corresponding to the [[en:pojmy:token|tokenization]] of the corpus, or the formatted text view preserving to some extent also the typographic form of the source text (this can be used, for example, to display poetry more clearly). In **newer written corpora** (starting with [[[en:cnk:syn2020|SYN2020]] and [[en:cnk:syn:verze9|SYN verze9]]) it is possible to switch between two window display options: a default view corresponding to the [[en:pojmy:token|tokenization]] of the corpus, or the formatted text view preserving to some extent also the typographic form of the source text (this can be used, for example, to display poetry more clearly).
  
-[{{:kurz:dialog_p.png?nolink&direct&450 |Expanding the context in spoken corpora and its representation by utterances FIXME}}] +[{{:en:manualy:kontext:dialog_p_en.png?&direct&450 |Expanding the context in spoken corpora and its representation by utterances}}] 
 In **spoken corpora**, switching between two display options -- linear display (as in written corpora) or display by individual speakers’ utterances -- is possible too. Obviously, the Speech view makes it easier to navigate through spoken language transcripts.  In **spoken corpora**, switching between two display options -- linear display (as in written corpora) or display by individual speakers’ utterances -- is possible too. Obviously, the Speech view makes it easier to navigate through spoken language transcripts. 
  
Line 92: Line 93:
 ==== Shuffle ==== ==== Shuffle ====
    
-FIXME In the default settings, the concordance is ordered according to the order in which the search results (individual concordance lines) are found in the corpus (e.g. in the corpus [[en:cnk:syn2015|SYN2015]] the first texts are fiction, then non-fiction and finally journalistic). To má výhodu zejména v rychlejším vyhledání odpovídajících řádkůBut in situations when the concordance is extensive and we need to acquire a representative sample (e.g. for manual analysis), je vhodnější pracovat s náhodně promíchanými řádkyToho lze dosáhnout právě volbou **Concordance → Shuffle**. Výsledkem operace je promíchání jednotlivých řádek konkordance, které je sice náhodné, ale zároveň opakovatelné.+In the default settings, the concordance is ordered according to the order in which the search results (individual concordance lines) are found in the corpus (e.g. in the corpus [[en:cnk:syn2015|SYN2015]] the first texts are fiction, then non-fiction and finally journalistic). This has the advantage of making it quicker to find matching rowsNonetheless, if the concordance is extensiveand one needs to acquire a representative sample (e.g. for manual analysis), it is preferable to work with randomly shuffled linesThis can be done with the **Concordance → Shuffle** optionThe resulting shuffle of the concordance lines is random yet repeatable.
  
-We recommend that the option //Shuffle// be used automatically, which ensures that every concordance, before being displayed, is randomized in this way. Trvalé nastavení promíchávání konkordančních řádek lze nastavit v menu **Zobrazení → Obecné volby zobrazení** (volba //Automaticky promíchat konkordanční řádky//). Such an approach functions as an effective prevention against drawing incorrect conclusions from studying a sample of  results which originate from an unrepresentative set of texts. FIXME+Using the Shuffle option automatically is recommendedas it ensures that every concordance, before being displayed, is randomized in this way. The permanent setting of concordance lines shuffling is achievable in the **View → General view options** menu (the //Shuffle concordance lines by default// option). Such an approach functions as an effective prevention against drawing incorrect conclusions from studying a sample of results which originate from an unrepresentative set of texts. 
  
-The result of the operation **Shuffle** is the shuffling of the individual concordance lines. This shuffle is random, but it is repeatable. For every concordance there is a definite shuffle algorithm which causes the results after the first, second, third... //n// shuffle are the same after repeated trials on the same query. This guarantees the replicability of the corpus experiments and when the shuffled concordance is used. +If concordance lines shuffling is enabled by default, the Shuffle option will perform another random rearrangement. For each concordance, an explicit shuffling algorithm causes the results after the first, second, third... nth shuffle to match on repeated attempts on the same query. This guarantees the repeatability of experiments on corpora even when using a shuffled concordance. 
- +
  
 ==== Sample==== ==== Sample====