Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
en:manualy:lists [2019/10/15 20:46] – vaclavcvrcek | en:manualy:lists [2021/02/02 18:28] (current) – michalkren | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== Lists: Frequency list browser ====== | ====== Lists: Frequency list browser ====== | ||
- | The //Lists// application allows the user to browse the frequency lists of various units ([[en: | + | The //Lists// application allows the user to browse the frequency lists of various units ([[en: |
When browsing the list by corpora (first tab), each unit has 4 types of frequency information: | When browsing the list by corpora (first tab), each unit has 4 types of frequency information: | ||
Line 10: | Line 10: | ||
* average reduced frequency normalized per million words (ARFn). | * average reduced frequency normalized per million words (ARFn). | ||
- | The lemma table contains an additional column with word class information ([[en: | + | The lemma table contains an additional column with word class information ([[en: |
The second tab in the browser provides a simple comparison of relative frequencies (IPM) and average reduced frequencies normalized per million words (ARFn) within individual registers (other frequency-related data are dependent on the size of the sub-corpus, rendering the results of such a comparison worthless). The information in this tab is derived from the SYN2015 and Oral v1 corpora. | The second tab in the browser provides a simple comparison of relative frequencies (IPM) and average reduced frequencies normalized per million words (ARFn) within individual registers (other frequency-related data are dependent on the size of the sub-corpus, rendering the results of such a comparison worthless). The information in this tab is derived from the SYN2015 and Oral v1 corpora. | ||
Line 16: | Line 16: | ||
For the purposes of comparison and the use of the newest versions of lemmatization and POS tagging, the data for the SYN2000, SYN2005, SYN2010, and SYN2015 corpora have been taken from the corresponding sub-corpora of the [[en: | For the purposes of comparison and the use of the newest versions of lemmatization and POS tagging, the data for the SYN2000, SYN2005, SYN2010, and SYN2015 corpora have been taken from the corresponding sub-corpora of the [[en: | ||
- | The application is available at: [[https://jupyter.korpus.cz/ | + | **The application is available at [[http:// |
+ | |||
+ | In addition to the //Lists// application, | ||
+ | * registered CNC users can create customized frequency lists using the [[en: | ||
+ | * it is possible to download [[seznamy: | ||
+ | * other frequency data can be obtained upon request sent by e-mail to cnk (at) korpus.cz | ||
+ | |||
+ | ===== How to cite Lists ===== | ||
+ | |||
+ | <WRAP round tip 80%> | ||
+ | Křen, M. - Cvrček, V.: Lists: Frequency list browser. FF UK. Praha 2019. Available at: < | ||
+ | </ |