AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
en:seznamy:kontext_verze [2018/10/30 16:41] michalkrenen:seznamy:kontext_verze [2024/02/23 16:26] tomasmachalek
Line 5: Line 5:
 The corpus interface KonText is designed for general interaction with [[en:cnk:uvod|CNC corpora]]. A comprehensive list of KonText’s available functions can be found in the [[en:manualy:kontext:index|manual]]. The corpus interface KonText is designed for general interaction with [[en:cnk:uvod|CNC corpora]]. A comprehensive list of KonText’s available functions can be found in the [[en:manualy:kontext:index|manual]].
  
-KonText is an extended and visually modified version of the original NoSketch Engine application. It is developed by the [[http://ucnk.ff.cuni.cz/en/|Institute of the Czech National Corpus]] (Faculty of Arts, Charles University) and the [[http://ufal.mff.cuni.cz|Institute of Formal and Applied Linguistics]] (Faculty of Mathematics and Physics, Charles University) under the **GNU GPL 2** license (with Tomáš Machálek as the main developer). Just as the NoSketch Engine, KonText uses [[https://nlp.fi.muni.cz/trac/noske|Manatee]] as its backend.+KonText is an extended and visually modified version of the original NoSketch Engine application. It is developed by the [[http://ucnk.ff.cuni.cz/en/|Institute of the Czech National Corpus]] (Faculty of Arts, Charles University) and the [[http://ufal.mff.cuni.cz|Institute of Formal and Applied Linguistics]] (Faculty of Mathematics and Physics, Charles University) under the **GNU GPL 2** license (with Tomáš Machálek and Martin Zimandl as the main developers). Just as the NoSketch Engine, KonText uses [[https://nlp.fi.muni.cz/trac/noske|Manatee]] as its backend.
  
 The version history overview below contains only the most signicant changes as seen from the end-user perspective. **A complete list of all changes and bug fixes can be found on [[https://github.com/czcorpus/kontext/releases|KonText's GitHub page]]** which also hosts a complete source code repository. The version history overview below contains only the most signicant changes as seen from the end-user perspective. **A complete list of all changes and bug fixes can be found on [[https://github.com/czcorpus/kontext/releases|KonText's GitHub page]]** which also hosts a complete source code repository.
 +
 +===== Release 0.18.0 =====
 +
 +//Publication date: 7.2.2024//
 +
 +User changes:
 +
 +    * new module "Keyword Analysis"
 +    * displaying translation equivalents in parallel corpora by clicking on the selected word directly in a concordance (new plug-in ''tokens_linking'')
 +    * possibility to download a list of documents matching selected text types
 +    * JSONL as a new optional format for storing results (concordance, word list, collocations, frequency list, document list), where each document line contains a separate JSON string - the format is particularly suitable for further automated processing
 +    * improved linking from external applications to KonText
 +      * multi-step operations (e.g. query + filter) with the possibility of subsequent editing in respective query forms
 +      * support for non-token filter ranges when linking to KonText (e.g. "from ''-1s'' to ''1s''")
 +    * "Federated Content Search" module supports searching in multiple corpora at the same time
 +
 +Technical changes:
 +
 +    * dropped support for Celery as a calculation backend (Rq remains)
 +    * new internal HTTP client for querying external data sources (authentication, translation equivalents, etc.)
 +    * improved installation script
 +    * KonText uses (optionally) a custom modification of Manatee-open with more statistical measures for keyword analysis
 +
 +
 +===== Release 0.17.0 =====
 +
 +//Publication date: 17.2.2023//
 +
 +User changes:
 +
 +  * **enhanced and refined subcorpora**
 +    * by default, every subcorpus is available to all users, addressing issues with URLs shared between users
 +    * if a user does not provide a description, the subcorpus remains undiscoverable
 +    * a subcorpus can be archived in which case all the URLs are still functional but the subcorpus won't be listed in author's subcorpora (unless explicitly specified in listing filter)
 +    * on the concordance query page, users can create a **subcorpus draft** from selected text types for future use
 +    * easily copy a subcorpus or create a new variant
 +  * a new function displays graphically the **dispersion** of a search term across the corpus data
 +  * highlighted **translation equivalents** (as retrieved from the Treq application) directly in the concordance
 +  * sharing **individual frequency tables through exported URLs**
 +    * when a frequency result page contains multiple tables, users can now easily obtain URLs for each table to share or publish the table
 +  * in the line selection function, users can navigate to the page with the **first selected line**
 +    * for manually categorized lines in extensive concordances where the first selected line starts far beyond the initial page, this feature enables automatic location of the first selection
 +  * customizable "nice" backlinks allow other applications to reference KonText results (available for easier integration with other applications)
 +  * **detection of overly time-consuming queries** for large corpora (typically the ones producing large result sets) and suggestion of an alternative corpus
 +
 +Technical changes:
 +
 +  * core web application framework changed from Gunicorn+Werkzeug to [[https://sanic.dev/en/|Sanic]]
 +  * upgrade to React 18
 +  * server backend rewritten with //async/awit// 
 +  * checking of background tasks from the client side is now by default doe via WebSockets
 +  * support for Manatee 2.2xx
 +  * improved caching of frequency distribution results for faster navigation between result pages
 +  * moved from HTTP sessions stored on server to [[https://jwt.io/|JWT]]
 +  * possibility to apply individual "cutoff" for large concordances
 +
 +
 +===== Release 0.16.0 =====
 +
 +//Publication date: 23. 2. 2022//
 +
 +User changes:
 +    * new query type: **paradigmatic query**
 +    * enhanced "word list" query type
 +        * improved user interface
 +        * optimalization of saved results for faster subsequent access
 +    * query history now supports all query types (concordance, word list, paradigmatic query)
 +    * enhanced frequency distribution
 +        * **graphical mode**
 +            * special support for time-based distributions
 +        * displaying of confidence intervals
 +        * default display option can now be set by the user (tables vs. figures)
 +    * enhanced audio playback
 +        * possibility to shift the playback in time
 +        * waveform display
 +    * option to create a subcorpus directly on the concordance query page
 +    * search suggestion with sublemma support (syn2020, syn_v9) and faster response
 +
 +Technical changes:
 +    * integration of a number of modules (e.g. "liveattrs", query history) with an internal database system
 +    * reorganization of server code
 +    * transition from CSS files to Styled Components
 +    * Docker support
 +    * support for automatic testing of the user interface
 +    * removing unnecessary attributes from the configuration
 +
 +
 +===== Release 0.15.0 =====
 +
 +//Publication date: 18. 12. 2020//
 +
 +User changes:
 +
 +    * number of query types reduced to two:
 +      * advanced (equivalent to the original "CQL")
 +      * simple
 +        * multi-word search
 +        * optional support for regular expressions
 +        * optional (per corpus) default search attributes
 +    * new calendar-based widget for specifying date intervals in the "Restrict search" section of the main query form
 +    * **syntax_viewer** plug-in enhancement -- added support for new features of SYN2020
 +    * new **query_suggest** plug-in providing interactive help with writing a query
 +    * **token_connect** plug-in can be now used also as a source for an alternative KWIC detail view
 +      * added a new module "formatted text"
 +    * **taghelper** plug-in now supports "key-value" tagsets and it is also possible to define multiple tagsets for a corpus
 +    * new option for displaying additional positional attributes (below the main text tokens)
 +    * possibility to set any positional attribute as the main one in the concordance view
 +    * more user-friendly "Corpus-specific settings" module
 +    * redesigned "Specify context" section of the main query form
 +    * possibility to perform more complex queries (billion-word corpora, aligned corpora when querying only the primary language) without the web-server's time limit constraint
 +    * an archived URL of a frequency distribution or a collocation can be now restored even for complex queries, regardless the web server time-out
 +
 +Technical changes:
 +    * server-side rewritten to Python 3
 +    * added support for a new asychronous task processing backend [[https://python-rq.org/|Rq]]; the new backend is now the default one
 +    * client-side rewritten using the same framework as in [[manualy:wag|WaG]]
 +    * synchronization between the web server and the back-end worker queue rewritten in case of concordance calculation
 +    * changes in HTTP API
 +
 +===== Release 0.13.0 =====
 +
 +//Publication date: 9. 12. 2019//
 +
 +  * rewrite of HTML templates to Jinja2
 +  * transition to React.JS framework, which resulted mainly in extensive changes of the code and, to a lesser extent, also in user interface elements (e.g. corpus-specific view settings are now in three tabs)
 +  * preparing future functionality support
  
 ===== Release 0.12.0 ===== ===== Release 0.12.0 =====
Line 13: Line 139:
 //Publication date: 30. 10. 2018// //Publication date: 30. 10. 2018//
  
-  * query results integrated with Treq (set up for InterCorp v10 and v11) +  * translation equivalents based on Treq directly displayed in KonText for parallel corpora (set up for InterCorp v10 and v11) 
-  * CQL editor with syntax highlighting and some basic value validation+  * CQL editor with syntax highlighting and basic value validation
   * mixed mode for attribute displaying (directly in text for KWIC, on mouse-over for other tokens)   * mixed mode for attribute displaying (directly in text for KWIC, on mouse-over for other tokens)
-  * possibility to share a named subcorpus with its description+  * sharing a named subcorpus and its description with other users
   * new filter functions    * new filter functions 
     * nested matches filter     * nested matches filter