AplikaceAplikace
Nastavení

This is an old revision of the document!


Menu: Corpora

Available Corpora

A list of all the corpora available to the user are accessible via the menu itemCorpora → Available corpora. Due to the large number of corpora and their respective versions, following the first login, the user is shown a pre-filtered list of corpora with the label “Czech” (containing both the SYN series corpora, and the ORAL series, and many specialized and hosted corpora). A complete list of all corpora in alphabetical order appears after clicking on the label “Reset,“ on the far left. With all following visits the KonText interface will remember the user’s most recent settings and will display a list just as how the user had himself compiled it during his last visit.

Next to corpora which are limited in some way, usually due to licensing, there is an icon in the shape of a lock. If the user is interested in gaining access to such a corpus, he can put in his request by clicking on the icon and the corpus will be, if possible, made accessible to him.

Similarly as with corpus selection, the list of corpora can be filtered based on various criteria before the search itself. One of the possibilities is the use of the so-called labels characterizing each corpus. Furthermore it is possible to filter by the name of the corpus or its part, or according to its size (bookmark Advanced). By clicking on the star in the right-hand column we can add the corpus to our Favourites, on the other hand by turning it off we remove the corpus from the favourites.

Subcorpora and parallel corpora in the favourites list

As a favourite item we may label not only an entire independent corpus, but also a corpus including Subcorpora or aligned groups of two or three corpora within a parallel corpus InterCorp, which significantly speeds up our work. Owing to the fact that not all combinations of Subcorpora and/or aligned corpora can appear in the list of available corpora, it is necessary to add them to the Favourites list when they are selected as the current corpus. It is generally the case that by clicking on the star next to the corpus (subcorpus) name at a time when the given corpus (subcorpus) is selected as current for searching, the entire combination is added to the Favourites (including aligned corpora if there are any).

Working with subcorpora

Creating virtual subcorpora (i.e. subsets of texts from the original corpus) is concentrated in the KonText interface as part of the second item on the main menu. Here it is possible firstly to create one’s own subcorpus and secondly to manage the current subcorpora (searching, deleting etc.).

Subcorpora are tied to the user account. Virtual subcorpora are therefore available to registered users from any computer, provided that they sign in with their username and password.

Generally speaking, a subcorpus is only an additional condition which is applied to all queries in the search. For example, if we are searching for the lemma dřevo in the fiction subcorpus SYN2010:beletrie, the query will automatically add the condition within, which specifies the texts of the corpus SYN2010 in which the search is to be conducted.

Creating a new subcorpus

Creating a new subcorpus