Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
en:cnk:ortofon [2024/06/18 19:03] – Correct stats (main change: number of speakers in v2 by speaker_id, not nickname) vhorky | en:cnk:ortofon [2024/08/05 10:27] (current) – [ORTOFON v3 (2024)] v3 is not balanced vhorky | ||
---|---|---|---|
Line 9: | Line 9: | ||
<WRAP 45%> | <WRAP 45%> | ||
^ <fs medium> | ^ <fs medium> | ||
- | ^ Number of [[en: | + | ^ Number of [[en: |
- | ^ Number of [[en: | + | ^ Number of [[en: |
^ Number of [[en: | ^ Number of [[en: | ||
^ Number of [[en: | ^ Number of [[en: | ||
Line 79: | Line 79: | ||
===== ORTOFON v3 (2024) ===== | ===== ORTOFON v3 (2024) ===== | ||
- | The 3rd version of the ORTOFON corpus was published in 2024. It contains 110 127 words and captures 1 121 speakers from all over the Czech Republic in 697 recordings, made between 2012 and 2020, totaling 243 hours. It also includes data from both previous versions of the corpus. The transcription at the orthographic and phonetic level as well as the corresponding audio recording are available in the KonText corpus interface. For this version, a number of inconsistencies in the transcription have been removed and a number of corrections have been made. | + | The 3rd version of the ORTOFON corpus was published in 2024. It contains 110 127 words and captures 1 121 speakers from all over the Czech Republic in 697 recordings, made between 2012 and 2020, totaling 243 hours. It also includes data from both previous versions of the corpus. Like the second version, this one too is not balanced. The transcription at the orthographic and phonetic level as well as the corresponding audio recording are available in the KonText corpus interface. For this version, a number of inconsistencies in the transcription have been removed and a number of corrections have been made. |
The ORTOFON v3 corpus is automatically **annotated according to the SYN2020 standard**, see [[en: | The ORTOFON v3 corpus is automatically **annotated according to the SYN2020 standard**, see [[en: | ||
Line 91: | Line 91: | ||
<WRAP round tip 70%> | <WRAP round tip 70%> | ||
+ | Lukeš, D. – Kopřivová, | ||
+ | |||
Kopřivová, | Kopřivová, | ||