AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
Next revisionBoth sides next revision
en:pojmy:din [2019/09/27 10:03] – created vaclavcvrceken:pojmy:din [2019/09/27 10:16] – [DIN] vaclavcvrcek
Line 1: Line 1:
 ====== DIN ====== ====== DIN ======
  
 +DIN (Difference index) is a so called effect-size metric, i.e. measure designed((see Fidler, M. - Cvrček, V.: {{:pojmy:josl-separat.pdf|A Data-Driven Analysis of Reader Viewpoints: Reconstructing the Historical Reader Using Keyword Analysis}})) for the purpose of quantifying the relevance of a difference between values. DIN is implemented for extracting prominent units from a text (keywords) in the [[en:manualy:kwords|KWords]] tool. 
  
 +===== Significance and relevance =====
  
 +When comparing values (e.g. frequencies of words) we should be interested not only in the statistical significance but also whether the difference under consideration is actualy relevant for the description. Statistical significance can be obtained by several tests (e.g. chi2 test, Fisher's test or log-likelihood test).((It is unimportant for the time being that these test can also be employed as association measures for the extraction of collocastions)) Significance is usualy expressed as a p-value, i.e. the probability that the difference is caused by chance or variation within the data.
 +
 +Even if the difference is signifiacnt it does not necesarily entails that it is relevant for the description. Even a small difference can be significant when there is a lot of results available. That is why the statistical significance information is often combined with the effect-size.
 ===== How it works ===== ===== How it works =====