Differences
This shows you the differences between two versions of the page.
Next revision | Previous revisionNext revisionBoth sides next revision | ||
en:kurz:hledani_v_paralelnim_korpusu [2016/06/22 15:59] – created alexandrrosen | en:kurz:hledani_v_paralelnim_korpusu [2016/12/11 10:58] – veronikapojarova | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====Using KonText to search a parallel corpus==== | + | =====Using KonText to search a parallel corpus===== |
- | ===Main differences in comparison with the Park interface=== | + | We make an effort to keep the following instructions up-to-date. However, the search interface is being developed and inconsistencies may occur. We are sorry about any such problems. Please report such cases using the [[https:// |
+ | |||
+ | |||
+ | ====Main differences in comparison with the Park interface==== | ||
* Same environment for searching both monolingual and parallel corpora | * Same environment for searching both monolingual and parallel corpora | ||
Line 14: | Line 17: | ||
* Query results cannot be displayed horizontally, | * Query results cannot be displayed horizontally, | ||
- | ===Access to the corpus=== | + | ====Access to the corpus==== |
InterCorp is accessible using the same login as for other CNC corpora. If you do not have a login for the CNC corpora yet you can get it for free by registering on the [[https:// | InterCorp is accessible using the same login as for other CNC corpora. If you do not have a login for the CNC corpora yet you can get it for free by registering on the [[https:// | ||
Line 20: | Line 23: | ||
[[https:// | [[https:// | ||
- | ===Selecting languages=== | + | ====Selecting languages==== |
Click on one of the languages, such as **InterCorp v9 Czech** to choose the primary language for your search. For the primary language a non-empty query is required. The query box for this language must be filled in. The order of the languages also matters when you wish to create a subcorpus (see below). The range of texts to create subcorpora can be specified only for the primary language. In other respects, the order of languages is irrelevant. | Click on one of the languages, such as **InterCorp v9 Czech** to choose the primary language for your search. For the primary language a non-empty query is required. The query box for this language must be filled in. The order of the languages also matters when you wish to create a subcorpus (see below). The range of texts to create subcorpora can be specified only for the primary language. In other respects, the order of languages is irrelevant. | ||
Line 26: | Line 29: | ||
After choosing the primary language a brief description of the selected part of the corpus appears in the page heading together with its size, measured in the number of tokens (so-called positions, i.e. word forms and punctuation symbols). To add an additional language choose the relevant corpus part within the frame **Aligned corpora** and then click on **Add**. For the additional language a query need not be entered. Tick **include empty lines** if you wish the result to include concordances that do not have an equivalent in the given language. More languages can be added in a similar way. Searching one part of the parallel corpus only, i.e. within a single language, is also possible. If so, do not add other languages and proceed to selecting the type of query and specifying the query itself. | After choosing the primary language a brief description of the selected part of the corpus appears in the page heading together with its size, measured in the number of tokens (so-called positions, i.e. word forms and punctuation symbols). To add an additional language choose the relevant corpus part within the frame **Aligned corpora** and then click on **Add**. For the additional language a query need not be entered. Tick **include empty lines** if you wish the result to include concordances that do not have an equivalent in the given language. More languages can be added in a similar way. Searching one part of the parallel corpus only, i.e. within a single language, is also possible. If so, do not add other languages and proceed to selecting the type of query and specifying the query itself. | ||
- | ===Entering a query=== | + | ====Entering a query==== |
You can choose from six **Query Types** (see below). All types of queries except **Basic** are case-sensitive and can handle regular expressions. For the query type **Word Form** the default is case-insensitive but **Match case** can be turned on. For the second and other languages you can also specify whether the concordances should or should not include terms specified in the query box. | You can choose from six **Query Types** (see below). All types of queries except **Basic** are case-sensitive and can handle regular expressions. For the query type **Word Form** the default is case-insensitive but **Match case** can be turned on. For the second and other languages you can also specify whether the concordances should or should not include terms specified in the query box. | ||
- | * **Basic** - searches for the given word form, case-insensitive, | + | * **Basic** - searches for the given word form, case-insensitive, |
* **Lemma** - searches for all forms of the given lemma | * **Lemma** - searches for all forms of the given lemma | ||
* **Phrase** - searches for the given sequence of word forms | * **Phrase** - searches for the given sequence of word forms | ||
Line 44: | Line 47: | ||
Click on **Make Concordance** to evaluate the query. | Click on **Make Concordance** to evaluate the query. | ||
- | ===Query results=== | + | ====Query results==== |
The response to the query is shown as a list of concordances. The menu on the top of the page can be used to modify the view, to store, sort or filter the concordances, | The response to the query is shown as a list of concordances. The menu on the top of the page can be used to modify the view, to store, sort or filter the concordances, | ||
- | ===Filtering - restricting the range of searched texts using the metadata=== | + | ====Filtering - restricting the range of searched texts using the metadata==== |
By default, searching is performed on all texts in the selected languages. The range of texts can be restricted before evaluating the query by clicking on the **Specify query according to the meta-information** item below. The restrictions apply to the primary language. To select which texts should be included tick appropriate values of the relevant attributes. If no value for a given attribute is selected, the search range is not restricted by that attribute. This is equivalent to selecting all values by clicking the **Select All** button. | By default, searching is performed on all texts in the selected languages. The range of texts can be restricted before evaluating the query by clicking on the **Specify query according to the meta-information** item below. The restrictions apply to the primary language. To select which texts should be included tick appropriate values of the relevant attributes. If no value for a given attribute is selected, the search range is not restricted by that attribute. This is equivalent to selecting all values by clicking the **Select All** button. | ||
Line 54: | Line 57: | ||
If you wish to search only original Czech fiction, check the boxes **fiction** and **cs** in the corresponding lists of attributes. To see the specific texts selected using the restrictions above, click on **refine selection** below – the titles show up below **div.title**. | If you wish to search only original Czech fiction, check the boxes **fiction** and **cs** in the corresponding lists of attributes. To see the specific texts selected using the restrictions above, click on **refine selection** below – the titles show up below **div.title**. | ||
- | ===Subcorpora=== | + | ====Subcorpora==== |
Click on **Subcorpora/ | Click on **Subcorpora/ | ||
- | In addition to ticking the appropriate attribute values the subcorpus can be specified in more detail by using the Custom ' | + | In addition to ticking the appropriate attribute values the subcorpus can be specified in more detail by using the **Custom ' |
+ | < | ||
txtype=" | txtype=" | ||
- | The same result would be achieved by checking the boxes drama and cs in the corresponding lists of attributes. To create the subcorpus, name it in the box New subcorpus name and click on Create Subcorpus. While entering a query the subcorpus can be invoked by choosing one of the Available subcorpora in the Subcorpora menu item. | + | </ |
+ | |||
+ | The same result would be achieved by checking the boxes **drama** and **cs** in the corresponding lists of attributes. To create the subcorpus, name it in the box **New subcorpus name** and click on **Create Subcorpus**. While entering a query the subcorpus can be invoked by choosing one of the **Available subcorpora** in the **Subcorpora** menu item. |