AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
en:manualy:kontext:novy_dotaz [2016/11/08 16:57] veronikapojarovaen:manualy:kontext:novy_dotaz [2016/12/11 11:02] veronikapojarova
Line 3: Line 3:
 [{{:en:manualy:kontext:novy_dotaz_en.png?direct&300|Form for creating a query}}] [{{:en:manualy:kontext:novy_dotaz_en.png?direct&300|Form for creating a query}}]
  
-With the selection **Query → New query**, it is always possible to begin a new corpus search. By clicking on this option we abandon our previous query and any results produced with it, and we begin with a new search. The following text primarily deals with creating queries in single language corpora; the specifics of searching the parallel corpora [[en:cnk:intercorp|InterCorp]] are described in detail in the [[en:kurz:hledani_v_paralelnim_korpusu#paralelni_korpusy_v_rozhrani_kontext|bonus tutorial]] of the basic course on working with the ČNK (Czech National Corpus).+With the selection **Query → New query**, it is always possible to begin a new corpus search. By clicking on this option we abandon our previous query and any results produced with it, and we begin with a new search. The following text primarily deals with creating queries in single language corpora; the specifics of searching the parallel corpora [[en:cnk:intercorp|InterCorp]] are described in detail in the [[en:kurz:hledani_v_paralelnim_korpusu#paralelni_korpusy_v_rozhrani_kontext|bonus tutorial]] of the basic course on working with the CNC (Czech National Corpus).
  
 After clicking on the item New Query, a basic search menu appears for the user. Within this form it is possible to select a corpus in which the search will be conducted, and the [[en:eebo:first_query#query_types|type of query]] which will be used. The query itself is inserted into the input line. Another part of the form is an interactive **international keyboard** for writing special characters (especially when searching in non-Czech texts and when inserting special characters of the Corpus Query Language, [[en:pojmy:cql|CQL]]). Previously used queries can be found under the link //Previous Queries//, located above the query line. After clicking on the item New Query, a basic search menu appears for the user. Within this form it is possible to select a corpus in which the search will be conducted, and the [[en:eebo:first_query#query_types|type of query]] which will be used. The query itself is inserted into the input line. Another part of the form is an interactive **international keyboard** for writing special characters (especially when searching in non-Czech texts and when inserting special characters of the Corpus Query Language, [[en:pojmy:cql|CQL]]). Previously used queries can be found under the link //Previous Queries//, located above the query line.
Line 15: Line 15:
 [{{ :en:manualy:kontext:komponenta-vyber-korpusu.png?direct&350|Corpus selection: featured and favorite corpora}}] [{{ :en:manualy:kontext:komponenta-vyber-korpusu.png?direct&350|Corpus selection: featured and favorite corpora}}]
  
-    - **My list** with a quick, single click selection of corpora. This quick selection contains **//favourite corpora//**, which can be selected by the user, and also the so-called **//featured corpora//**: a default list of several corpora, which the ČNK considers to be particularly important in the individual areas of production. Having them all in one place simplifies the selection of a corpus especially for beginning users of the ČNK. Favourite corpora can be selected either on the page with all the [[en:manualy:kontext:subkorpus?&#dostupne_korpusy|available corpora]], or when working with them at the time of query input (such corpora are labelled with a yellow star). +    - **My list** with a quick, single click selection of corpora. This quick selection contains **//favourite corpora//**, which can be selected by the user, and also the so-called **//featured corpora//**: a default list of several corpora, which the CNC considers to be particularly important in the individual areas of production. Having them all in one place simplifies the selection of a corpus especially for beginning users of the CNC. Favourite corpora can be selected either on the page with all the [[en:manualy:kontext:subkorpus?&#dostupne_korpusy|available corpora]], or when working with them at the time of query input (such corpora are labelled with a yellow star). 
-    - **All corpora** with the possibility of searching all available corpora with the aid of so-called **labels**, which are used to characterize the corpora (a typical corpus has several labels, e.g. SYN2010: written, synchronic, Czech, SYN family, representative). For example, if you are looking for a web corpus of Czech, all you have to do is select the labels "Czech" + "web", and all the relevant corpora made available by the ČNK will appear. The search may be further refined by typing part of the corpus name or its description into the search bar, and the resulting list of corpora is interactively filtered based on the keywords. However, it must be noted that for spatial reasons the list only shows the first 25 items; if the list is too long, the query must be specified further with the addition of another label, or by searching for a part of its name.+    - **All corpora** with the possibility of searching all available corpora with the aid of so-called **labels**, which are used to characterize the corpora (a typical corpus has several labels, e.g. SYN2010: written, synchronic, Czech, SYN family, representative). For example, if you are looking for a web corpus of Czech, all you have to do is select the labels "Czech" + "web", and all the relevant corpora made available by the CNC will appear. The search may be further refined by typing part of the corpus name or its description into the search bar, and the resulting list of corpora is interactively filtered based on the keywords. However, it must be noted that for spatial reasons the list only shows the first 25 items; if the list is too long, the query must be specified further with the addition of another label, or by searching for a part of its name.
  
 **Example**: The user searches in the tab //All corpora// for a current version of the English section of the parallel corpus [[en:cnk:intercorp|InterCorp]]. He first selects the labels "InterCorp" a "current version ", the first 25 corpora which conform to the specified conditions appear on the list, although InterCorp contains many more languages. The corpora not displayed can be accessed with the help of further filtering, for example by typing part of the corpus name or language (please note that the names of the individual InterCorp versions are in English). After finding the desired corpus and clicking on it, the corpus becomes the current corpus for searching, and it is at the same time possible to mark it as favourite with a star. The corpus is added to the list of favourite corpora and can be accessed quickly and easily with a single click. **Example**: The user searches in the tab //All corpora// for a current version of the English section of the parallel corpus [[en:cnk:intercorp|InterCorp]]. He first selects the labels "InterCorp" a "current version ", the first 25 corpora which conform to the specified conditions appear on the list, although InterCorp contains many more languages. The corpora not displayed can be accessed with the help of further filtering, for example by typing part of the corpus name or language (please note that the names of the individual InterCorp versions are in English). After finding the desired corpus and clicking on it, the corpus becomes the current corpus for searching, and it is at the same time possible to mark it as favourite with a star. The corpus is added to the list of favourite corpora and can be accessed quickly and easily with a single click.
Line 26: Line 26:
 ^ Query type ^ What it’s for ^ How it works ^ What it does ^ Examples ^ ^ Query type ^ What it’s for ^ How it works ^ What it does ^ Examples ^
 ^ Basic Query | for familiarization with the corpus| Searches for the expression as a node form regardless of case; in the case of a dictionary form (lemma), all of its possible forms are also searched for. | without [[en:pojmy:regularni_vyrazy|regular expressions]] (RE), [[wp>Case_sensitivity|case-insensitive]] | ''old house'' > //old house, older houses, oldest house…//\\ ''this time'' > //this time// | ^ Basic Query | for familiarization with the corpus| Searches for the expression as a node form regardless of case; in the case of a dictionary form (lemma), all of its possible forms are also searched for. | without [[en:pojmy:regularni_vyrazy|regular expressions]] (RE), [[wp>Case_sensitivity|case-insensitive]] | ''old house'' > //old house, older houses, oldest house…//\\ ''this time'' > //this time// |
-^ Lemma | for the analysis of an entire paradigm/lexeme | Finds all forms associated with the given [[wp>Lemma_(psycholinguistics)|lemma]]. | RE (it is possible to use regular expressions), case-sensitive, possibility of specifying word class | ''see'' > //see, saw, seen, seeing…//\\ ''new'' > //new, newer, newest…// |+^ Lemma | for the analysis of an entire paradigm/lexeme | Finds all forms associated with the given [[en:pojmy:lemma|lemma]]. | RE (it is possible to use regular expressions), case-sensitive, possibility of specifying word class | ''see'' > //see, saw, seen, seeing…//\\ ''new'' > //new, newer, newest…// |
 ^ Phrase | for a multiword combination in the given form | Finds the exact wording of a phrase. | RE, case-sensitive | ''black dog'' > //black dog//\\ ''new car'' > //new car//\\ ''newer car'' > //newer car// | ^ Phrase | for a multiword combination in the given form | Finds the exact wording of a phrase. | RE, case-sensitive | ''black dog'' > //black dog//\\ ''new car'' > //new car//\\ ''newer car'' > //newer car// |
 ^ Node form | for the analysis of one specific form | Finds the exact form. | RE, case-in/sensitive (possible to select Match case) | ''cat'' > //cat//\\ ''cats'' > //cats//\\ ''cat.*'' > //cat, cats, Cats, CATS…// | ^ Node form | for the analysis of one specific form | Finds the exact form. | RE, case-in/sensitive (possible to select Match case) | ''cat'' > //cat//\\ ''cats'' > //cats//\\ ''cat.*'' > //cat, cats, Cats, CATS…// |
Line 37: Line 37:
   - Corpora which are not lemmatized, i.e. do not offer //lemma// as a [[en:eebo:first_query#query_types|query type]].   - Corpora which are not lemmatized, i.e. do not offer //lemma// as a [[en:eebo:first_query#query_types|query type]].
   - Some query types (only those where it makes sense) allow for the user to specify whether the query should be assessed with respect to capitalization ([[wp>Case_sensitivity|case-sensitive]]), or without considering upper/lower case ([[wp>Case_sensitivity|case-insensitive]]).   - Some query types (only those where it makes sense) allow for the user to specify whether the query should be assessed with respect to capitalization ([[wp>Case_sensitivity|case-sensitive]]), or without considering upper/lower case ([[wp>Case_sensitivity|case-insensitive]]).
-  - In the case of the [[en:eebo:first_query#query_types|query types]] [[wp>Lemma_(psycholinguistics):lemma]] and [[en:pojmy:word]] it is also possible to specify word class (position attribute [[en:pojmy:pos|pos]]).+  - In the case of the [[en:eebo:first_query#query_types|query types]] [[en:pojmy:lemma|lemma]] and [[en:pojmy:word]] it is also possible to specify word class (position attribute [[en:pojmy:pos|pos]]).
   - The [[en:pojmy:dotazovaci_jazyk|CQL]] query type also allows for the insertion of interactively generated [[en:pojmy:tag|morphological tags]] (with corpora which are tagged in this way) or conditions specifying texts in which the search is to be carried out (the condition [[en:pojmy:within]]).   - The [[en:pojmy:dotazovaci_jazyk|CQL]] query type also allows for the insertion of interactively generated [[en:pojmy:tag|morphological tags]] (with corpora which are tagged in this way) or conditions specifying texts in which the search is to be carried out (the condition [[en:pojmy:within]]).
   - A very specific way of inputting queries for [[en:kurz:hledani_v_paralelnim_korpusu|searches in parallel corpora]].   - A very specific way of inputting queries for [[en:kurz:hledani_v_paralelnim_korpusu|searches in parallel corpora]].
Line 89: Line 89:
   * option "Include non-words", which widens the search to words which are not composed only of alphabetic characters   * option "Include non-words", which widens the search to words which are not composed only of alphabetic characters
  
-Among the output option settings we can find a selection of either the absolute [[en:pojmy:frekvence|frequency]], [[en:pojmy:arf|ARF]] or a document count. Furthermore there is also the possibility to choose a specific output attribute (or attributes). These attributes **need not be** identical to the positional attribute selected in the top section of the form, on which all the above mentioned filters are applied. This enables us to create e.g. a frequency list of all verbs by selecting the attribute [[en:pojmy:tag|tag]] in the top section, applying the condition for a verb as in [[en:seznamy:tagy#pozice_1_-_slovni_druh|V.*]] and finally by "switching" the output type to [[wp>Lemma_(psycholinguistics)|lemma]] – an example of such a query is shown in the picture.+Among the output option settings we can find a selection of either the absolute [[en:pojmy:frekvence|frequency]], [[en:pojmy:arf|ARF]] or a document count. Furthermore there is also the possibility to choose a specific output attribute (or attributes). These attributes **need not be** identical to the positional attribute selected in the top section of the form, on which all the above mentioned filters are applied. This enables us to create e.g. a frequency list of all verbs by selecting the attribute [[en:pojmy:tag|tag]] in the top section, applying the condition for a verb as in [[en:seznamy:tagy#pozice_1_-_slovni_druh|V.*]] and finally by "switching" the output type to [[en:pojmy:lemma|lemma]] – an example of such a query is shown in the picture.
  
 ---- ----