AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
en:start [2016/11/09 16:33] – [What is a corpus?] Veronika Pojarováen:start [2019/10/17 12:23] – [Manuals for CNC applications] Václav Cvrček
Line 42: Line 42:
 <html><br/></html> <html><br/></html>
  
-The [[http://www.korpus.cz|Czech National Corpus]] (ČNK) project was set up in 1994 in order to make accessible extensive linguistic data for teaching and research in the form of electronic [[en:pojmy:korpus|corpora]]. The Czech National Corpus currently ensures access to more than three billion [[en:pojmy:pozice|words]] contained in both [[en:pojmy:synchronni|synchronic]] and [[en:pojmy:diachronni|diachronic]] corpora, both [[en:cnk:struktura#korpusy_mluvene|spoken]] and [[en:cnk:struktura#korpusy_psane_soucasne_cestiny_rada_syn|written]][[en:pojmy:paralelni|paralell]] and [[en:cnk:struktura#jednojazycne|monolinguistic]]. For working with them the ČNK develops [[https://wiki.korpus.cz/doku.php/start_new#manualove_stranky_aplikaci_cnk|specialized tools]].+The [[http://www.korpus.cz|Czech National Corpus]] (CNC) project was set up in 1994 in order to make accessible extensive linguistic data for teaching and research in the form of electronic [[en:start#what_is_a_corpus|corpora]]. The Czech National Corpus currently ensures access to more than three billion words contained in both synchronic and diachronic corpora, both spoken and written, parallel and monolinguistic (see [[en:cnk:uvod|overview]]). For working with them the CNC develops [[en:start#manuals_for_cnc_applications|specialized tools]].
  
-The ČNK project is managed mainly by the workers of two departments under [[http://www.ff.cuni.cz|FF UK]] (Faculty of Arts, Charles University): [[http://ucnk.ff.cuni.cz|The Institute of the Czech National Corpus ]] and [[http://utkl.ff.cuni.cz|The Institute of Theoretical and Computational Linguistics]]. The collection of data and coordination of the individual activities is done with the aid of more than two hundred external workers from all over the Czech Republic.+The CNC project is managed mainly by the workers of two departments of [[http://www.ff.cuni.cz|Faculty of Arts]][[http://ww.cuni.cz|Charles University]]: [[http://ucnk.ff.cuni.cz|The Institute of the Czech National Corpus]] and [[http://utkl.ff.cuni.cz|The Institute of Theoretical and Computational Linguistics]]. The collection of data and coordination of the individual activities is done with the aid of more than two hundred external workers from all over the Czech Republic.
  
 ===== What information will you find here? ===== ===== What information will you find here? =====
  
-This wiki serves ČNK users not only as a source of information about the ČNK (description of public corpora and their documentation, application manuals), but also as a continuously edited database of corpus linguistic knowledge. The main parts of the wiki consist of the following:+This wiki serves CNC users not only as a source of information about the CNC (description of public corpora and their documentation, application manuals), but also as a continuously edited database of corpus linguistic knowledge. The main parts of the wiki consist of the following:
  
 <HTML> <HTML>
 <!-- Entypo pictograms by Daniel Bruce — www.entypo.com --> <!-- Entypo pictograms by Daniel Bruce — www.entypo.com -->
 <table class="std"> <table class="std">
-<tr><td><a href="https://wiki.korpus.cz/doku.php/manualy:kontext:index"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/manual.png" /></a></td> +<tr><td><a href="https://wiki.korpus.cz/doku.php/en:manualy:kontext:index"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/manual.png" /></a></td> 
-    <td><a href="https://wiki.korpus.cz/doku.php/manualy:kontext:index">Manuals for ČNK applications</a></td></tr> +    <td><a href="https://wiki.korpus.cz/doku.php/en:manualy:kontext:index">Manuals for CNC applications</a></td></tr> 
-<tr><td><a href="https://wiki.korpus.cz/doku.php/cnk:uvod"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/korpusy.png" /></a></td> +<tr><td><a href="https://wiki.korpus.cz/doku.php/en:cnk:uvod"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/korpusy.png" /></a></td> 
-    <td><a href="https://wiki.korpus.cz/doku.php/cnk:uvod">Overview of corpora available within the ČNK</a></td></tr> +    <td><a href="https://wiki.korpus.cz/doku.php/en:cnk:uvod">Overview of corpora available within the CNC</a></td></tr> 
-<tr><td><a href="https://wiki.korpus.cz/doku.php/kurz:uvod"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/kurz.png" /></a></td> +<tr><td><a href="https://wiki.korpus.cz/doku.php/en:kurz:uvod"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/kurz.png" /></a></td> 
-    <td><a href="https://wiki.korpus.cz/doku.php/kurz:uvod">Tutorial for working with the EEBO in 8 lessons</a></td></tr>+    <td><a href="https://wiki.korpus.cz/doku.php/en:kurz:uvod">Tutorial for working with the EEBO in 8 lessons</a></td></tr>
 <tr><td><a href="https://wiki.korpus.cz/doku.php/pojmy:prehled_pojmu"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/slovnicek.png" /></a></td> <tr><td><a href="https://wiki.korpus.cz/doku.php/pojmy:prehled_pojmu"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/slovnicek.png" /></a></td>
-    <td><a href="https://wiki.korpus.cz/doku.php/pojmy:prehled_pojmu">Index of basic concepts in corpus linguistics</a></td></tr>+    <td><a href="https://wiki.korpus.cz/doku.php/pojmy:prehled_pojmu">Index of basic concepts in corpus linguistics (in Czech)</a></td></tr>
 <tr><td><a href="https://wiki.korpus.cz/doku.php/seznamy:index"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/seznamy.png" /></a></td> <tr><td><a href="https://wiki.korpus.cz/doku.php/seznamy:index"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/seznamy.png" /></a></td>
-    <td><a href="https://wiki.korpus.cz/doku.php/seznamy:index">List of sources and abbreviations</a></td></tr>+    <td><a href="https://wiki.korpus.cz/doku.php/seznamy:index">List of sources and abbreviations (in Czech)</a></td></tr>
 </table> </table>
 </HTML> </HTML>
 ===== Frequently searched pages ===== ===== Frequently searched pages =====
  
-==== Manuals for ČNK applications ====+==== Manuals for CNC applications ==== 
  
 <HTML> <HTML>
 <table class="apps"> <table class="apps">
-<tr><td><a href="https://wiki.korpus.cz/doku.php/manualy:kontext:index"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/kontext_start.png" height=50 /></a></td> +<tr><td><a href="https://wiki.korpus.cz/doku.php/en:manualy:kontext:index"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/kontext_start.png" height=50 /></a></td> 
-    <td><a href="https://wiki.korpus.cz/doku.php/manualy:syd"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/syd_start.png" height=50 /></a></td> +<td><img src="https://wiki.korpus.cz/lib/exe/fetch.php/puntik.png" height=10 /></td> 
-    <td><a href="https://wiki.korpus.cz/doku.php/manualy:treq"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/treq_start.png" height=50 /></a></td> +<td><a href="https://wiki.korpus.cz/doku.php/en:manualy:wag"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/wag-logo_cs.png" height=36 /></a></td> 
-    <td><a href="https://wiki.korpus.cz/doku.php/manualy:kwords"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/kwords_start.png" height=50 /></a></td> +</tr>
-    <td><a href="https://wiki.korpus.cz/doku.php/manualy:morfio"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/morfio_start.png" height=50 /></a></td></tr>+
 </table> </table>
-</HTML> 
  
 +<table class="apps">
 +<tr>
 +    <td><a href="https://wiki.korpus.cz/doku.php/en:manualy:syd"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/syd_start.png" height=50 /></a></td>
 +    <td><img src="https://wiki.korpus.cz/lib/exe/fetch.php/puntik.png" height=10 /></td>
 +    <td><a href="https://wiki.korpus.cz/doku.php/en:manualy:treq"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/treq_start.png" height=50 /></a></td>
 +        <td><img src="https://wiki.korpus.cz/lib/exe/fetch.php/puntik.png" height=10 /></td>
 +    <td><a href="https://wiki.korpus.cz/doku.php/en:manualy:kwords"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/kwords_start.png" height=50 /></a></td>
 +        <td><img src="https://wiki.korpus.cz/lib/exe/fetch.php/puntik.png" height=10 /></td>
 +    <td><a href="https://wiki.korpus.cz/doku.php/en:manualy:morfio"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/morfio_start.png" height=50 /></a></td>
 +        <td><img src="https://wiki.korpus.cz/lib/exe/fetch.php/puntik.png" height=10 /></td>
 +    <td><a href="https://wiki.korpus.cz/doku.php/en:manualy:calc"><img src="https://wiki.korpus.cz/lib/exe/fetch.php/Calc_logotyp.png" height=50 /></a></td>
 +    </tr>
 +</table>
 +</HTML>
  
 ==== Useful links ==== ==== Useful links ====
  
  
-  * [[en:kurz:zaciname|How to begin working with the ČNK (registration, types of access)]] +  * [[en:kurz:zaciname|How to begin working with the CNC (registration, types of access)]] 
-  * [[en:cnk:citace|How to cite ČNK corpora and tools]] +  * [[en:cnk:citace|How to cite CNC corpora and tools]] 
-  * [[en:seznamy:tagy|Description of morphological tags]] +  * [[en:pojmy:regularni_vyrazy|Regular expressions]]
-  * [[en:seznamy:index#zkratky_a_kody|Abbreviations and codes used for annotation of corpora]] +
-  * [[en:pojmy:regularni_vyrazy|Regular expressions]] and [[en:kurz:pokrocile_dotazy|advanced queries]]+
  
  
Line 93: Line 104:
 ===== What is a corpus? ===== ===== What is a corpus? =====
  
-A language[[en:pojmy:korpus|corpus]] is an extensive collection of **authentic textual data** (written or spoken) converted to **electronic form** in a uniform format, meaning that it can easily be **searched** for various linguistic phenomena -- especially words and phrases ([[en:pojmy:kolokace|collocations]]). Od obyčejného textového archívu či databáze jej odlišuje především to, že bývá pečlivě sestaven s ohledem na výzkumný účel (má například reprezentovat současný mluvený či psaný jazyk nebo jeho částnapřpublicistické texty). Korpus zobrazuje jazykové jevy v jejich **přirozeném kontextu**, a umožňuje tak vytvářet na reálných datech podložený jazykový výzkum v rozsahu, který byl dříve nemyslitelný.+A language corpus is an extensive collection of **authentic textual data** (written or spoken) converted to **electronic form** in a uniform format, meaning that it can easily be **searched** for various linguistic phenomena -- especially words and phrases (collocations). Corpora differ from a plain text archive or database primarily because they have been carefully compiled with the research purpose in mind (they shouldfor example, represent contemporary spoken language or written language or one of its parts, e.g. journalistic texts). A corpus displays linguistic phenomena in their **natural context**, which allows us to do language research based on actual data on scale so large that it would have previously been unthinkable.
  
  
 ===== User support ===== ===== User support =====
  
-Všem uživatelům je k dispozici [[https://podpora.korpus.cz/projects/poradna/boards|Poradna]], do které je možné klást libovolné dotazy týkající se práce s ČNK (vytváření dotazůspecifika korpusů apod.). Většina dotazů je zodpovězena do jednoho pracovního dne.+The [[https://podpora.korpus.cz/projects/poradna/boards|Helpdesk]] is available to all userswho are invited to post questions concerning work with the CNC (creating queriescorpus specifics etc.). The majority of the questions is answered within one work day.
  
-Centrum uživatelské podpory zahrnuje také hlášení chyb v aplikacích ČNK a zasílání návrhů na jejich vylepšeníOdkaz na formulář takového hlášení najdete v každé aplikaci zcela vespod -- "//Nahlásit chybu//".+The user support centre also includes error reports in CNC applications and sending improvement suggestionsThe link to a form intended for such reports can be found at the very bottom of every application -- "//Report an error//".
  
 ---- ----
  
 <WRAP center round box 64%> <WRAP center round box 64%>
-[[en:kurz:zaciname|How to begin work with ČNK]] • [[en:manualy:kontext:index|KonText interface manual]] • [[en:cnk:uvod|ČNK corpora]]   • [[en:pojmy:prehled_pojmu|Index of concepts]] • [[en:kurz:uvod|Tutorial for working with EEBO in 8 lessons]] • [[en:cnk:citace|How to cite]]+[[en:kurz:zaciname|How to begin working with the CNC]] • [[en:manualy:kontext:index|KonText interface manual]] • [[en:cnk:uvod|CNC corpora]] • [[en:kurz:uvod|Tutorial for working with EEBO in 8 lessons]] • [[en:cnk:citace|How to cite]]
 </WRAP> </WRAP>