AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
en:pojmy:syntakticka_analyza [2021/01/20 11:25] – [Searching KonText for syntactic structures: syntactic attributes] Tomáš Jelíneken:pojmy:syntakticka_analyza [2022/08/13 13:08] (current) – [Syntactic analysis and syntactic tagging] Alexandr Rosen
Line 1: Line 1:
 ====== Syntactic analysis and syntactic tagging ====== ====== Syntactic analysis and syntactic tagging ======
  
-Some of CNC corpora (the first of which is [[en:cnk:syn2015|SYN2015]]) are syntactically annotated, marking dependency relations between two words in a sentence and the analytical functions of individual words. This syntactic annotation is based on the principles of the analytical-layer annotation used in the [[http://ufal.mff.cuni.cz/pdt2.0/index-cz.html|Prague Dependency Treebank]] (PDT).+Some of CNC corpora (the first of which is [[en:cnk:syn2015|SYN2015]]) are syntactically annotated, marking dependency relations between two words in a sentence and the analytical functions of individual words. This syntactic annotation is based on the principles of the analytical-layer annotation used in the [[http://ufal.mff.cuni.cz/pdt2.0/index-cz.html|Prague Dependency Treebank]] (PDT).  The [[en:cnk:intercorp|InterCorp]] parallel corpus in its release [[en:cnk:intercorp:verze13ud|13ud]] is syntactically (and also morphologically) annotated in an alternative way, following the guidelines of the international [[en:pojmy:ud|Universal Dependencies]] project.
  
 ===== The system of syntactic annotation: the analytical layer of the Prague Dependency Treebank ===== ===== The system of syntactic annotation: the analytical layer of the Prague Dependency Treebank =====
Line 32: Line 32:
 ===== Visualisation of syntactic structures in KonText ===== ===== Visualisation of syntactic structures in KonText =====
  
-For every sentence in a syntactically annotated corpus (for now in [[en:cnk:syn2015|SYN2015]] and in [[en:cnk:syn2020|SYN2020]]), a syntactic structure can be visualised by clicking on the icon representing a syntactic tree on the left side of a concordance line (marked with a red circle in the following image):\\+For every sentence in a syntactically annotated corpus (currently [[en:cnk:syn2015|SYN2015]] and [[en:cnk:syn2020|SYN2020]]), a syntactic structure can be visualised by clicking on a little icon representing a syntactic tree on the left side of a concordance line (marked with a red circle in the following image):\\
  
-{{:pojmy:zobrazenisyntaxe.png?500|Jak zobrazit syntaktickou strukturu}}\\+{{:pojmy:zobrazenisyntaxe.png?500|Syntactic structure visualisation}}\\
  
-Prokliknutím se zobrazí syntaktická struktura dané věty. Pořadí slov ve větě odpovídá pořadí syntaktických slov ve struktuře (zleva doprava), řídící slova jsou ve struktuře výše, závislá nížeJako příklad uvádíme část zobrazené struktury týkající se vedlejší věty //aby ses měla nač vymluvit// (věta z korpusu SYN2020, obsahuje tzv. [[cnk:syn2020:agregat|agregáty]], tjtokeny zahrnující více syntaktických slov):\\+By clicking on the icon, a representation of the syntactic structure is displayed (a syntactic tree). The left-to-right order in the syntactic representation corresponds to the order in the sentencethe dependent tokens are placed below the governing tokensThe following image represents the structure of a subordinate clause from the SYN2020 corpus "//aby ses měla nač vymluvit//" [so that you can find an excuse]The sentence contains three so called [[en:cnk:syn2020:agregat|agreggates]], i.e. tokens containing two or more syntactic words:\\
  
-{{:cnk:syn2020:agregaty_syntax.png?250|Syntaktická struktura věty: příklad}}\\+{{:cnk:syn2020:agregaty_syntax.png?250|Example of syntactic structure in Kontext}}\\
  
 ===== Searching KonText for syntactic structures: syntactic attributes ===== ===== Searching KonText for syntactic structures: syntactic attributes =====
  
-Searching in syntactically annotated corpora typically requires an interface specially designed to display the syntactic structure, for example the program [[https://ufal.mff.cuni.cz/tred/|TrEd]]. The [[en:manualy:kontext:index|KonText]] interface does not offer the option of viewing the syntactic structure, nonetheless it is possible to search for words and phrases according to syntactic parameters. For this purpose, each token is assigned several [[en:pojmy:atributy_pozicni|attributes]], in addition to the smaller number of attributes which are assigned only to selected tokens. All syntactic attributes are described in a [[en:seznamy:syntakticke_znacky|separate entry]]. The basic syntactic attributes assigned to all tokens are: +It is possible to formulate queries in KonText based on syntactic properties of words. For this purpose, each token is assigned several [[en:pojmy:atributy_pozicni|attributes]]. All syntactic attributes are described in a [[en:seznamy:syntakticke_znacky|separate entry]]. The basic syntactic attributes assigned to all tokens are: 
   * [[en:seznamy:parent|parent]] (numbered reference to the position of the governing token)    * [[en:seznamy:parent|parent]] (numbered reference to the position of the governing token) 
   * [[en:seznamy:afun|afun]] (syntactic function)   * [[en:seznamy:afun|afun]] (syntactic function)