AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
en:cnk:online:gen2 [2022/12/22 14:26] vaclavcvrceken:cnk:online:gen2 [2022/12/22 16:13] – [duplicate] vaclavcvrcek
Line 64: Line 64:
   * Stranické weby (party sites)   * Stranické weby (party sites)
   * Web instituce (institution sites)   * Web instituce (institution sites)
 +
 +==== duplicate ====
 +
 +The ''text.duplicate'' attribute (available only in Generation 2) indicates whether a text is a duplicate of another text in the corpus. This situation can happen quite often with online media as a result of adopting news between news agencies and individual portals. If we want to avoid the bias introduced by such text duplicates, we can use a ''within'' condition (e.g., ''%%[word="round"] within <text duplicate!="no" />%%''), which causes that duplicate texts appear in the result only once.
 +