AplikaceAplikace
Nastavení

This is an old revision of the document!


Alpha

Rozhraní aplikace Alpha

The Alpha application is as a natural language to Corpus Query Language (CQL) translator.

If you write your queries in Czech, it will translate them into CQL formalism suitable for direct use in KonText search engine, including morphological tags etc. It will respond to English queries with tags typical for Universal Dependencies. These are so far only usable with InterCorp corpora in version 13UD, which are annotated according to this standard.

Short tutorial and various ideas on how to use the tool can be found directly in the application. It appears when you press the question mark button. More extensive introduction to be found in this tutorial video.

Application examples

Try query In a German corpus, find all lemmata that begin with “Schei” and are dependent on an object, Alpha should translate this sentence as [lemma="Schei.*" & p_deprel="obj"] after presing the Enter or clicking on the “Vyhledej v korpusu” button, the query is performed by the Kontext tool.

Or you can try Czech language: najdi slovo „na“ následované lemmatem „hrad“, this query translates as [lc="na"][lemma="hrad"]. Note that the words you want to find must be in quotes, this will make it clear if you want to find a noun i.e. [upos="NOUN|PROPN"], or find „a noun“, i.e. [word="a"][word="noun"].

How to cite Alpha

Milička, J.: Alpha: Natural Language to Corpus Query Language Translator. FF UK. Praha 2020. Available from WWW: <http://www.alpha.korpus.cz>.

  • Milička, J., & Šebestová, D. (2024). Query a corpus in near-natural language: A human-friendly corpus query language not only for linguists. In S. Buschfeld, P. Ronan, T. Neumaier, A. Weilinghoff, & L. Westermayer (Eds.), Crossing Boundaries through Corpora: Innovative Approaches to Corpus Linguistics. John Benjamins.