AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
en:manualy:kontext:subkorpus [2016/06/10 14:29] – [Using subcorpora] veronikapojarovaen:manualy:kontext:subkorpus [2017/12/15 14:15] – [Creating a new subcorpus] michalskrabal
Line 3: Line 3:
 ====== Available Corpora ====== ====== Available Corpora ======
  
-A list of all the corpora available to the user are accessible via the menu item**Corpora → Available corpora**. Due to the large number of corpora and their respective versions, **following the first login**, the user is shown a pre-filtered list of corpora with the label “Czech” (containing both the SYN series corpora, and the ORAL series, and many specialized and hosted corpora). A complete **list of all corpora in alphabetical order** appears after clicking on the label “Reset,“ on the far left. With all following visits the KonText interface will remember the user’s most recent settings and will display a list just as how the user had himself compiled it during his last visit.+A list of all the corpora available to the user are accessible via the menu item **Corpora → Available corpora**. Due to the large number of corpora and their respective versions, **following the first login**, the user is shown a pre-filtered list of corpora with the label “Czech” (containing both the SYN series corpora, and the ORAL series, and many specialized and hosted corpora). A complete **list of all corpora in alphabetical order** appears after clicking on the label “Reset,“ on the far left. With all following visits the KonText interface will remember the user’s most recent settings and will display a list just as how the user had himself compiled it during his last visit.
  
 Next to corpora which are limited in some way, usually due to licensing, there is an **icon in the shape of a lock**. If the user is interested in gaining access to such a corpus, he can put in his request by clicking on the icon and the corpus will be, if possible, made accessible to him. Next to corpora which are limited in some way, usually due to licensing, there is an **icon in the shape of a lock**. If the user is interested in gaining access to such a corpus, he can put in his request by clicking on the icon and the corpus will be, if possible, made accessible to him.
Line 19: Line 19:
 Subcorpora are tied to the user account. Virtual subcorpora are therefore available to [[en:kurz:zaciname|registered]] users from any computer, provided that they sign in with their username and password. Subcorpora are tied to the user account. Virtual subcorpora are therefore available to [[en:kurz:zaciname|registered]] users from any computer, provided that they sign in with their username and password.
  
-Generally speaking, a subcorpus is only an additional condition which is applied to all queries in the search. For example, if we are searching for the lemma //dřevo// in the fiction subcorpus SYN2010:beletrie, the query will automatically add the condition [[en:pojmy:within|within]], which specifies the texts of the corpus [[cnk:syn2010|SYN2010]] in which the search is to be conducted.+Generally speaking, a subcorpus is only an additional condition which is applied to all queries in the search. For example, if we are searching for the lemma //dřevo// in the fiction subcorpus SYN2010:beletrie, the query will automatically add the condition [[en:pojmy:within|within]], which specifies the texts of the corpus [[en:cnk:syn2010|SYN2010]] in which the search is to be conducted.
  
 ===== Creating a new subcorpus ===== ===== Creating a new subcorpus =====
  
-[{{ :manualy:kontext:subkorpus_vytvorit.png?direct&300|Creating a new subcorpus }}]+[{{ :en:manualy:kontext:subkorpus_vytvorit.png?direct&300|Creating a new subcorpus }}]
  
 In the case that we want to, in the long term, work only with a specific group of texts in the given corpus, it pays off to create and save our own subcorpus on the server (on the other hand, with ad hoc searches in a subgroup of texts it is better to select the option [[en:manualy:kontext:novy_dotaz#specifikovat_dotaz_podle_metainformaci|Specify query according to the meta-information]] when typing a new query). In the case that we want to, in the long term, work only with a specific group of texts in the given corpus, it pays off to create and save our own subcorpus on the server (on the other hand, with ad hoc searches in a subgroup of texts it is better to select the option [[en:manualy:kontext:novy_dotaz#specifikovat_dotaz_podle_metainformaci|Specify query according to the meta-information]] when typing a new query).
  
-If we select **Corpora → Create new subcorpus**in the menu, a form for creating a permanent virtual subcorpus will appear. When creating a subcorpus it is necessary to specify:+If we select **Corpora → Create new subcorpus** in the menu, a form for creating a permanent virtual subcorpus will appear. When creating a subcorpus it is necessary to specify:
  
   - a default corpus, from which the text will be selected   - a default corpus, from which the text will be selected
Line 33: Line 33:
   - a condition based on which we select the text for the subcorpus   - a condition based on which we select the text for the subcorpus
  
-The condition can be specified with a [[en:pojmy:dotazovaci_jazyk|CQL]] query using the command [[kurz:subkorpusy|within]], or by selecting values of [[en:pojmy:atributy_strukturni|structural attributes]] from the ready selection. On the list of structural attribute values are numbers representing the text’s size in the given category (the number refers to the number of words or number of documents in the given category). Based on these numbers it is possible to create subcorpora with specific proportions.+The condition can be specified with a [[en:pojmy:dotazovaci_jazyk|CQL]] query using the command [[en:kurz:subkorpusy|within]], or by selecting values of [[en:pojmy:atributy_strukturni|structural attributes]] from the ready selection. On the list of structural attribute values are numbers representing the text’s size in the given category (the number refers to the number of words or number of documents in the given category). Based on these numbers it is possible to create subcorpora with specific proportions.
  
 Within this form it is possible to select those structural attribute values that interest us. The form does not contain all the structural attributes, but only those most frequently used in the given corpus (e.g. when searching in [[en:cnk:syn2010|SYN2010]] it is [[en:pojmy:txtype_group|txtype_group]], [[en:pojmy:txtype|txtype]], [[en:pojmy:genre|genre]], [[en:pojmy:medium|med]], [[en:pojmy:srclang|srclang]]). The abbreviations used can be found in the relevant section of [[en:seznamy:index|lists]]. Within this form it is possible to select those structural attribute values that interest us. The form does not contain all the structural attributes, but only those most frequently used in the given corpus (e.g. when searching in [[en:cnk:syn2010|SYN2010]] it is [[en:pojmy:txtype_group|txtype_group]], [[en:pojmy:txtype|txtype]], [[en:pojmy:genre|genre]], [[en:pojmy:medium|med]], [[en:pojmy:srclang|srclang]]). The abbreviations used can be found in the relevant section of [[en:seznamy:index|lists]].
Line 41: Line 41:
 ===== List of existing subcorpora ===== ===== List of existing subcorpora =====
  
-[{{ :manualy:kontext:subkorpus_prehled.png?direct&300|A list of the user’s existing subcorpora }}]+[{{ :en:manualy:kontext:subkorpus_prehled.png?direct&300|A list of the user’s existing subcorpora }}]
  
 The section **Corpora → My Subcorpora** provides a list of all the subcorpora defined by the user. Next to their name in the table is also their size (in the number of [[en:pojmy:pozice|positions]]) and the date they were created. Simultaneously the user may delete the subcorpora that he does not use anymore. The section **Corpora → My Subcorpora** provides a list of all the subcorpora defined by the user. Next to their name in the table is also their size (in the number of [[en:pojmy:pozice|positions]]) and the date they were created. Simultaneously the user may delete the subcorpora that he does not use anymore.