Both sides previous revisionPrevious revisionNext revision | Previous revision |
en:manualy:kontext:frekvence [2023/03/07 17:02] – [Custom settings of frequency distribution] jankrivan | en:manualy:kontext:frekvence [2023/03/13 14:17] (current) – [Custom settings of frequency distribution] lukes |
---|
After clicking on the heading of the column, the table will automatically be rearranged according to the selected column. This way, it is possible to create a list that is arranged alphabetically (in addition to the usual list arranged according to the frequency). | After clicking on the heading of the column, the table will automatically be rearranged according to the selected column. This way, it is possible to create a list that is arranged alphabetically (in addition to the usual list arranged according to the frequency). |
| |
FIXME The **Share the table** function (the link is placed in the row above the table) generates a permanent link to the current concordance, which can be sent directly from the form window to the specified e-mail address or later mentioned in an article, study, etc. FIXME | The **Share the table** function (the link is placed in the row above the table) generates a permanent link to the table, which can be sent directly from the form window to the specified e-mail address or later mentioned in an article, study, etc. |
| |
==== Chart view ==== | ==== Chart view ==== |
The graphical display allows you to visualize the information presented in the previous section (absolute and relative frequencies of items with their confidence intervals) in the form of two types of graphs: either a horizontal **bar chart** or a "**word cloud**" graph. | The graphical display allows you to visualize the information presented in the previous section (absolute and relative frequencies of items with their confidence intervals) in the form of two types of graphs: either a horizontal **bar chart** or a "**word cloud**" graph. |
| |
[{{:en::manualy:kontext:fqdist-word-drevo_en.png?direct&400|Visualization type: bar }}] | [{{:en::manualy:kontext:fqdist-word-drevo_en.png?direct&350|Visualization type: bar }}] |
\\ | \\ |
By default, a bar chart with relative frequencies including 95% confidence intervals is displayed. | By default, a bar chart with relative frequencies including 95% confidence intervals is displayed. |
Finally, the graph can be switched to a "word cloud," which displays a group of examined items (in our example, word forms) in sizes corresponding relatively to their frequencies. For this type of graph, only the option to export the graph and limit the number of items in the graph are relevant in the user settings. | Finally, the graph can be switched to a "word cloud," which displays a group of examined items (in our example, word forms) in sizes corresponding relatively to their frequencies. For this type of graph, only the option to export the graph and limit the number of items in the graph are relevant in the user settings. |
| |
[{{:en::manualy:kontext:fqdist-word-cloud_en.png?direct&400|Visualization type: Word cloud }}] | [{{:en::manualy:kontext:fqdist-word-cloud_en.png?direct&350|Visualization type: Word cloud }}] |
\\ | \\ |
| |
===== Custom settings of frequency distribution ===== | ===== Custom settings of frequency distribution ===== |
| |
The form which appears after clicking on the option **Frequency distribution → Custom** consists of two sections: | The form which appears after clicking on the menu item **Frequency → Custom** offers four options: |
| |
- form for multilevel frequency distribution (which can be used to analyze [[en:pojmy:atributy_pozicni|positional attributes]]) such as word, lemma, sublemma, tag, verbtag, etc.) | - multilevel frequency distribution (which can be used to analyze [[en:pojmy:atributy_pozicni|positional attributes]]) such as word, lemma, sublemma, tag, verbtag, etc.) |
- form for frequency distribution according to the [[en:pojmy:atributy_strukturni|structure attributes]] (such as ''[[en:pojmy:txtype|txtype]]'', ''[[en:pojmy:medium|med]]'' or ''[[en:pojmy:srclang|srclang]]'') | - frequency distribution according to the [[en:pojmy:atributy_strukturni|structure attributes]] (such as ''[[en:pojmy:txtype|txtype]]'', ''[[en:pojmy:medium|med]]'' or ''[[en:pojmy:srclang|srclang]]'') |
- FIXME dispersion showing the distribution of the searched concordances across the entire corpus FIXME | - dispersion plot showing the distribution of the searched concordance across the entire corpus |
- form for frequency distribution reflecting the two-attribute interrelationship (both positional and structure attributes) | - 2-dimensional frequency distribution reflecting the relationship between two attributes (both positional and structure attributes) |
| |
[{{ :en:manualy:kontext:fqdist-pozice_en.png?direct&400|Form for multilevel frequency distribution ([[en:pojmy:atributy_pozicni|positional attributes]]) }}] | [{{ :en:manualy:kontext:fqdist-pozice_en.png?direct&300|Form for multilevel frequency distribution ([[en:pojmy:atributy_pozicni|positional attributes]]) }}] |
| |
==== Frequency distribution according to the positional attributes ==== | ==== Frequency distribution according to the positional attributes ==== |
Afterwards, it is necessary to select whether frequency distribution should be calculated regardless of the letter case. Selection of the option [[wp>Case_sensitivity|case-insensitive]] causes that all of the items are interpreted as having lower case, regardless of what type of case they actually have in the corpus. | Afterwards, it is necessary to select whether frequency distribution should be calculated regardless of the letter case. Selection of the option [[wp>Case_sensitivity|case-insensitive]] causes that all of the items are interpreted as having lower case, regardless of what type of case they actually have in the corpus. |
| |
[{{ :en:manualy:kontext:fqdist-reference_en.png?direct&400|Form for frequency distribution according to [[en:pojmy:atributy_strukturni|structural attributes]] }}] | [{{ :en:manualy:kontext:fqdist-reference_en.png?direct&300|Form for frequency distribution according to [[en:pojmy:atributy_strukturni|structural attributes]] }}] |
| |
In case of custom settings of frequency distribution, we do not need to restrict ourselves to KWIC only (unlike when working with quick selection). It can be calculated from any context position to the right or left from the wanted word. The item //position// in the form enables us to select not only positions from the left (the preceding) context (6L-1L), but also KWIC itself and positions to the right (the following) context (1R-6R). The numbering of the positions (according to both current and older notation) is summed up in the following table: | In case of custom settings of frequency distribution, we do not need to restrict ourselves to KWIC only (unlike when working with quick selection). It can be calculated from any context position to the right or left from the wanted word. The item //position// in the form enables us to select not only positions from the left (the preceding) context (6L-1L), but also KWIC itself and positions to the right (the following) context (1R-6R). The numbering of the positions (according to both current and older notation) is summed up in the following table: |
=== Usage example: frequency list according to text types === | === Usage example: frequency list according to text types === |
| |
[{{ :en:manualy:kontext:fqdist-txtype-drevo_en.png?direct&400|Frequency list of the text types and their groups of lemma //dřevo// }}] | [{{ :en:manualy:kontext:fqdist-txtype-drevo_en.png?direct&300|Frequency list of the text types and their groups of lemma //dřevo// }}] |
| |
The following example shows how to use frequency list when working with the [[en:cnk:syn2020|SYN2020]] corpus to search for a query of [[en:pojmy:lemma|lemma]] //dřevo// (''[lemma=%%"%%dřevo%%"%%]''): Frequency distribution of the values of structural attributes ''txtype'' and ''txtype_group'' of lemma //dřevo// (excluding the values with zero frequency). | The following example shows how to use frequency list when working with the [[en:cnk:syn2020|SYN2020]] corpus to search for a query of [[en:pojmy:lemma|lemma]] //dřevo// (''[lemma=%%"%%dřevo%%"%%]''): Frequency distribution of the values of structural attributes ''txtype'' and ''txtype_group'' of lemma //dřevo// (excluding the values with zero frequency). |
==== Disperze ==== | ==== Disperze ==== |
| |
FIXME Funkce [[pojmy:frekvence#disperze_jevu|Disperze]] umožňuje graficky znázornit rozložení daného vyhledaného jevu napříč textem/korpusem. V úvodním formuláři je třeba nastavit počet úseků (nejvýše 1000), na něž bude korpus pro účel zobrazení disperze rozdělen. Ve výsledném grafu jsou pak na ose //y// zaneseny počty výskytů vyhledaného jevu pro každý úsek. | The [[pojmy:frekvence#disperze_jevu|Dispersion]] function allows you to graphically represent the distribution of a given searched phenomenon across the text/corpus. In the initial form you need to set the number of sections (maximum 1000) into which the corpus will be divided for the purpose of displaying the dispersion. The resulting graph then shows the number of occurrences of the searched phenomenon within each section on the y-axis. |
[{{:manualy:kontext:disperze.png?direct&450|Disperze lemmatu //dřevo// (rozdělení na 100 úseků) v SYN2020 FIXME}}] | |
| [{{en:manualy:kontext:disperze.png?direct&450|Dispersion of the lemma //dřevo// (division into 100 sections) in SYN2020}}] |
| |
| |