Both sides previous revisionPrevious revisionNext revision | Previous revision |
en:obc:specific_query [2020/02/19 12:35] – michalskrabal | en:obc:specific_query [2020/02/27 12:23] (current) – jankocek |
---|
**Searching the corpus** | **Searching the corpus** |
| |
To search the corpus using a specified query, open the KonText interface and make sure you have the OBC selected. You can choose any query type – for this lesson, let’s use the basic type. Let’s say we are interested in the language of women in the 19<sup>th</sup> century, who were convicted of theft and either transported or sentenced to death, and we would like to know the frequency of interrogative sentences used by them. To find exclamative sentences, simply type ''?'' into the search box. To specify the characteristics of the utterances we are looking for, click on **Restrict search**. | To search the corpus using a specified query, open the KonText interface and make sure you have the OBC selected. You can choose any query type – for this lesson, let’s use the basic type. Let’s say we are interested in the language of women in the 19<sup>th</sup> century, who were convicted of theft and either transported or sentenced to death, and we would like to know the frequency of interrogative sentences used by them. To find exclamative sentences, simply type ''?'' into the search box. To specify the characteristics of the utterances we are looking for, click on //Restrict search//. |
| |
{{:en:obc:l6_1.png?direct&500|}} | {{:en:obc:l6_1.png?direct&500|}} |
* **Trial account**: includes information about the defendants, witnesses, victims, descriptions of the crimes and transcriptions of the testimonies | * **Trial account**: includes information about the defendants, witnesses, victims, descriptions of the crimes and transcriptions of the testimonies |
| |
Firstly, to get the actual utterances of the defendants, it is necessary to select the **trialAccount** category only. As we are interested in the language of the 19<sup>th</sup> century, you need to delimit the given time span in the **text.year** box. In the **text.offenceCategory** box you will find many different combinations of [[https://www.oldbaileyonline.org/static/Crimes.jsp|offences]] and, as it was mentioned above, you need to be careful when making your selection. Multiple offences divided by the vertical bar indicate that there were multiple defendants present at the trial and to distinguish which person committed which crime and what was their punishment can be quite a demanding task, as it would be necessary to go through each trial account individually and read the transcription. | Firstly, to get the actual utterances of the defendants, it is necessary to select the //trialAccount// category only. As we are interested in the language of the 19<sup>th</sup> century, you need to delimit the given time span in the //text.year// box. In the //text.offenceCategory// box you will find many different combinations of [[https://www.oldbaileyonline.org/static/Crimes.jsp|offences]] and, as it was mentioned above, you need to be careful when making your selection. Multiple offences divided by the vertical bar indicate that there were multiple defendants present at the trial and to distinguish which person committed which crime and what was their punishment can be quite a demanding task, as it would be necessary to go through each trial account individually and read the transcription. |
| |
So, to make sure you include only the people convicted of committing the crime of theft, select the options which include only theft. Here, you have a number of choices: either //theft//, //violentTheft// or //theft | violentTheft//. Selecting all will still ensure including only people convicted of theft in your search. However, when the other categories which include //theft// (e.g. //deception | sexual | theft//) are left out, the search will not consist of //all// the trials which deal with the offence of theft. | So, to make sure you include only the people convicted of committing the crime of theft, select the options which include only theft. Here, you have a number of choices: either //theft//, //violentTheft// or //theft | violentTheft//. Selecting all will still ensure including only people convicted of theft in your search. However, when the other categories which include //theft// (e.g. //deception | sexual | theft//) are left out, the search will not consist of //all// the trials which deal with the offence of theft. |
{{:en:obc:l6_2.png?direct&800|}} | {{:en:obc:l6_2.png?direct&800|}} |
| |
Next, the [[https://www.oldbaileyonline.org/static/Punishment.jsp|punishment]] needs to be selected. Find the **text.punishmentCategory** box and select //death//, //death | transport// and //transport// (for more information on offences, verdicts and punishments, see [[https://www.oldbaileyonline.org/static/History.jsp|here]]). You also need to select the role of the utterance speaker, so as not to include utterances spoken by, for example, the judge. Go to the **utterance.speaker_role** box and select **Defendant**. Lastly, find the **utterance.speaker_sex** box and select **f** (female). You can delimit your search further by modifying any of the categories available. When you are satisfied with your selection, hit the search button. You can view the Text types frequency list (**(Frequency → Text Types)**) to see all variables, including those which you did not specify in your query. | Next, the [[https://www.oldbaileyonline.org/static/Punishment.jsp|punishment]] needs to be selected. Find the //text.punishmentCategory// box and select //death//, //death | transport// and //transport// (for more information on offences, verdicts and punishments, see [[https://www.oldbaileyonline.org/static/History.jsp|here]]). You also need to select the role of the utterance speaker, so as not to include utterances spoken by, for example, the judge. Go to the //utterance.speaker_role// box and select //Defendant//. Lastly, find the //utterance.speaker_sex// box and select //f// (female). You can delimit your search further by modifying any of the categories available. When you are satisfied with your selection, hit the search button. You can view the Text types frequency list (//Frequency → Text Types//) to see all variables, including those which you did not specify in your query. |
| |
<WRAP round help 40%> | <WRAP round help 40%> |
</WRAP> | </WRAP> |
| |
**[[https://kontext.korpus.cz/view?q=~nUvVP3Fs9cyn|Solution]]**: | You can find solution [[en:obc:solution#lesson_6|here]]. |
| |
{{:en:obc:l6_4.png?direct&300|}} | ---- |
| |
{{:en:obc:l6_3.png?direct&300|}} | **If you are ready, you can continue to [[en:obc:frequency_distribution|Lesson 7]].** |
| |
{{:en:obc:l6_5.png?direct&300|}} | |
| |
You can find solution here. | |
| |
You can now proceed to [[en:obc:frequency_distribution|Lesson 7]]. | |
| |
| ---- |