Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
en:kurz:hledani_v_paralelnim_korpusu [2016/06/22 16:12] – Alexandr Rosen | en:kurz:hledani_v_paralelnim_korpusu [2019/12/19 23:42] – [Access to the corpus] Alexandr Rosen | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====Using KonText to search a parallel corpus==== | + | =====Using KonText to search a parallel corpus===== |
We make an effort to keep the following instructions up-to-date. However, the search interface is being developed and inconsistencies may occur. We are sorry about any such problems. Please report such cases using the [[https:// | We make an effort to keep the following instructions up-to-date. However, the search interface is being developed and inconsistencies may occur. We are sorry about any such problems. Please report such cases using the [[https:// | ||
- | ===Main differences in comparison with the Park interface=== | + | ====Main differences in comparison with the Park interface==== |
* Same environment for searching both monolingual and parallel corpora | * Same environment for searching both monolingual and parallel corpora | ||
Line 17: | Line 17: | ||
* Query results cannot be displayed horizontally, | * Query results cannot be displayed horizontally, | ||
- | ===Access to the corpus=== | + | ====Access to the corpus==== |
- | InterCorp is accessible using the same login as for other CNC corpora. If you do not have a login for the CNC corpora yet you can get it for free by registering on the [[https:// | + | InterCorp is accessible using the same login as for other CNC corpora. If you do not have a login for the CNC corpora yet you can get it for free by registering on the [[https:// |
[[https:// | [[https:// | ||
- | ===Selecting languages=== | + | ====Selecting languages==== |
Click on one of the languages, such as **InterCorp v9 Czech** to choose the primary language for your search. For the primary language a non-empty query is required. The query box for this language must be filled in. The order of the languages also matters when you wish to create a subcorpus (see below). The range of texts to create subcorpora can be specified only for the primary language. In other respects, the order of languages is irrelevant. | Click on one of the languages, such as **InterCorp v9 Czech** to choose the primary language for your search. For the primary language a non-empty query is required. The query box for this language must be filled in. The order of the languages also matters when you wish to create a subcorpus (see below). The range of texts to create subcorpora can be specified only for the primary language. In other respects, the order of languages is irrelevant. | ||
Line 29: | Line 29: | ||
After choosing the primary language a brief description of the selected part of the corpus appears in the page heading together with its size, measured in the number of tokens (so-called positions, i.e. word forms and punctuation symbols). To add an additional language choose the relevant corpus part within the frame **Aligned corpora** and then click on **Add**. For the additional language a query need not be entered. Tick **include empty lines** if you wish the result to include concordances that do not have an equivalent in the given language. More languages can be added in a similar way. Searching one part of the parallel corpus only, i.e. within a single language, is also possible. If so, do not add other languages and proceed to selecting the type of query and specifying the query itself. | After choosing the primary language a brief description of the selected part of the corpus appears in the page heading together with its size, measured in the number of tokens (so-called positions, i.e. word forms and punctuation symbols). To add an additional language choose the relevant corpus part within the frame **Aligned corpora** and then click on **Add**. For the additional language a query need not be entered. Tick **include empty lines** if you wish the result to include concordances that do not have an equivalent in the given language. More languages can be added in a similar way. Searching one part of the parallel corpus only, i.e. within a single language, is also possible. If so, do not add other languages and proceed to selecting the type of query and specifying the query itself. | ||
- | ===Entering a query=== | + | ====Entering a query==== |
You can choose from six **Query Types** (see below). All types of queries except **Basic** are case-sensitive and can handle regular expressions. For the query type **Word Form** the default is case-insensitive but **Match case** can be turned on. For the second and other languages you can also specify whether the concordances should or should not include terms specified in the query box. | You can choose from six **Query Types** (see below). All types of queries except **Basic** are case-sensitive and can handle regular expressions. For the query type **Word Form** the default is case-insensitive but **Match case** can be turned on. For the second and other languages you can also specify whether the concordances should or should not include terms specified in the query box. | ||
- | * **Basic** - searches for the given word form, case-insensitive, | + | * **Basic** - searches for the given word form, case-insensitive, |
* **Lemma** - searches for all forms of the given lemma | * **Lemma** - searches for all forms of the given lemma | ||
* **Phrase** - searches for the given sequence of word forms | * **Phrase** - searches for the given sequence of word forms | ||
Line 47: | Line 47: | ||
Click on **Make Concordance** to evaluate the query. | Click on **Make Concordance** to evaluate the query. | ||
- | ===Query results=== | + | ====Query results==== |
The response to the query is shown as a list of concordances. The menu on the top of the page can be used to modify the view, to store, sort or filter the concordances, | The response to the query is shown as a list of concordances. The menu on the top of the page can be used to modify the view, to store, sort or filter the concordances, | ||
- | ===Filtering - restricting the range of searched texts using the metadata=== | + | ====Filtering - restricting the range of searched texts using the metadata==== |
By default, searching is performed on all texts in the selected languages. The range of texts can be restricted before evaluating the query by clicking on the **Specify query according to the meta-information** item below. The restrictions apply to the primary language. To select which texts should be included tick appropriate values of the relevant attributes. If no value for a given attribute is selected, the search range is not restricted by that attribute. This is equivalent to selecting all values by clicking the **Select All** button. | By default, searching is performed on all texts in the selected languages. The range of texts can be restricted before evaluating the query by clicking on the **Specify query according to the meta-information** item below. The restrictions apply to the primary language. To select which texts should be included tick appropriate values of the relevant attributes. If no value for a given attribute is selected, the search range is not restricted by that attribute. This is equivalent to selecting all values by clicking the **Select All** button. | ||
Line 57: | Line 57: | ||
If you wish to search only original Czech fiction, check the boxes **fiction** and **cs** in the corresponding lists of attributes. To see the specific texts selected using the restrictions above, click on **refine selection** below – the titles show up below **div.title**. | If you wish to search only original Czech fiction, check the boxes **fiction** and **cs** in the corresponding lists of attributes. To see the specific texts selected using the restrictions above, click on **refine selection** below – the titles show up below **div.title**. | ||
- | ===Subcorpora=== | + | ====Subcorpora==== |
Click on **Subcorpora/ | Click on **Subcorpora/ |