~~NOTOC~~
====== Jiří Milička ======

  *[[https://scholar.google.com/citations?hl=en&user=71SLjBMAAAAJ|Google scholar]], [[https://www.researchgate.net/profile/Jiri-Milicka|ResearchGate]]
  *[[https://www.scopus.com/authid/detail.uri?authorId=55946511400|Scopus]], ORCID: [[https://orcid.org/0000-0001-8605-1199|0000-0001-8605-1199]]
Členství v radách: [[https://www.tandfonline.com/toc/njql20/current|Journal of Quantitative Linguistics]], [[https://sciendo.com/journal/LF|Linguistic Frontiers]]

Další členství: [[https://www.iqla.org/|The International Quantitative Linguistics Association]]  


===== Zaměření =====


    *korpusová  lingvistika
    *kvantitativní lingvistika
    *čeština, arabština


===== Články =====

==== Preprinty ====
  * Milička, J., & Bednářová, H. (2026). Sydney Telling Fables on AI and Humans: A Corpus Tracing Memetic Transfer of Persona between LLMs. arXiv preprint https://arxiv.org/abs/2602.22481.
  * Milička, J., Marklová, A., & Cvrček, V. (2025). Benchmark of stylistic variation in LLM-generated texts. arXiv preprint http://arxiv.org/abs/2509.10179.
  * Milička, J., Marklová, A., & Cvrček, V. (2025). AI Brown and AI Koditex: LLM-Generated Corpora Comparable to Traditional Corpora of English and Czech Texts. arXiv preprint https://arxiv.org/abs/2509.22996.  
  * Milička, J. (2024). Theoretical and Methodological Framework for Studying Texts Produced by Large Language Models. arXiv [Cs.CL]. Retrieved from https://arxiv.org/abs/2408.16740

==== 2026 ====
  * Marklová, A., Vinš, O., Vokáčová, M., & Milička, J. (2025). The author is dead, but what if they never lived? A reception experiment on Czech AI- and human-authored poetry. Digital Scholarship in the Humanities, https://doi.org/10.1093/llc/fqag067
  *Milička, J., & Machálek T.: AI Corpus Linguist: More than a Year of Experience. SIGHUM (LaTeCH-CLfL) workshop at EACL 2026. [[https://aclanthology.org/2026.latechclfl-1.29/|EACL 2026]]
  *Nádvorníková, O., Milička, J. & Rosen, A.: Exploring crosslinguistic and genre variation in syntactic complexity: insights from a multilingual parallel corpus. Linguistics Vanguard. https://doi.org/10.1515/lingvan-2025-0109
==== 2025 ====
  *Marklová, A., Milička, J., Ryvkin, L., Lacková Bennet, L. U., & Kormaníková, L. (2025). Iconicity in large language models. Digital Scholarship in the Humanities, [[https://academic.oup.com/dsh/advance-article/doi/10.1093/llc/fqaf095/8257789|DSH]]. 
  * Milička, J., Marklová, A., Drobil, O., & Pospíšilová, E. (2025). Learning to detect AI texts and learning the limits. PloS one, 20(10), [[https://doi.org/10.1371/journal.pone.0333007|e0333007]].
  * Milička, J. (2025). Simple stochastic processes behind Menzerath’s Law. In A. Pawłowski, S. Embleton, J. Mačutek, & A. Xanthos (Eds.), //Mathematical Modelling in Linguistics and Text Analysis: Theory and applications// (pp. 43–59). Amsterdam: John Benjamins. [[https://www.benjamins.com/catalog/cilt.370.04mil]] Preprint available at [[http://arxiv.org/abs/2409.00279]].
  * Milička, J., Marklová, A., Láznička, M., Diatka, V., Bednářová, H., Matela, J., & Škrabal, M. (2025). Sources of Intelligibility of Distant Languages: An Empirical Study. //Language and Speech, 0(0)//. [[https://doi.org/10.1177/00238309251345952]]
==== 2024 ====
  * Milička, J., Marklová, A., VanSlambrouck, K., Pospíšilová, E., Šimsová, J., Harvan, S., & Drobil, O. Large language models are able to downplay their cognitive abilities to fit the persona they simulate. //Plos one, 19(3), [[https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0298522|e0298522]]//.
  * Milička, J., & Šebestová, D. (2024). Query a corpus in near-natural language: A human-friendly corpus query language not only for linguists. In S. Buschfeld, P. Ronan, T. Neumaier, A. Weilinghoff, & L. Westermayer (Eds.), //Crossing Boundaries through Corpora: Innovative Approaches to Corpus Linguistics.// John Benjamins.
==== 2023 ====

  * Milička, J. Menzerath’s law: Is it just regression toward the mean? //Glottometrics//, 55. doi:[[https://glottometrics.iqla.org/409-menzeraths-law-is-it-just-regression-toward-the-mean/|10.53482/2023_55_409]]

==== 2022 ====
  * Milička, J., Cvrček, V., & Lukeš, D.: Unpacking lexical intertextuality: Vocabulary shared among texts. Yamazaki, M., Sanada, H., Köhler, R., Embleton, S., Vulanović, R., & Wheeler, E. S. (Eds). //Quantitative Approaches to Universality and Individuality in Language//. Berlin/Boston: De Gruyter Mouton. 101-116. DOI: [[https://doi.org/10.1515/9783110763560-009|10.1515/9783110763560-009]]
   *Zemánek, P. & Milička, J.: Frankové očima Arabů v klasickém a moderním období. In O. Lomová, J. Malečková & K. Šíma (Eds.), //Setkávání kultur. Identity, ideologie, jazyky// (pp. 233-246). Praha:  Univerzita Karlova, Filozofická fakulta. ISBN 978-80-7671-085-6.


==== 2021 ====
   *Milička, J., Cvrček, V., & Lukešová, L.: Modelling crosslinguistic n‑gram correspondence in typologically different languages. //Languages in Contrast 21(2)//, 217-249. DOI: [[http://doi.org/10.1075/lic.19018.mil|10.1075/lic.19018.mil]]. ISSN: 1387-6759.
   *Milička, J., & Houzar, A.: Phonological properties as predictors of text success. In A. Pawłowski, S. Embleton, J. Mačutek and G. Mikros (eds.), //Language and Text: Data, models, information and applications// (pp. 177–194). John Benjamins. ISBN 9789027210104.
   *Matlach, V., Krivochen, D. G., & Milička, J.: A method for the comparison of general sequences via type-token ratio. In A. Pawłowski, S. Embleton, J. Mačutek and G. Mikros (eds.), //Language and Text: Data, models, information and applications// (pp. 37–54). John Benjamins. ISBN 9789027210104.
   *Malá, M., Šebestová, D., & Milička, J.: The expression of time in English and Czech children’s literature. In A. Čermáková, T. Egan, H. Hasselgård &  S. Rørvik (eds.), //Time in Languages, Languages in Time// (pp 283–304). John Benjamins. ISBN 978-90-272-0968-9.
   *Kubát, M., Hůla, J., Chen, X., Čech, R., & Milička, J.: The lexical context in a style analysis: A word embeddings approach. //Corpus Linguistics and Linguistic Theory, 17(2)//, 443-464.

==== 2020 ====
   *Milička, J.: Kolik procent lexikálních výpůjček můžeme očekávat ve slovenském textu?. //[[https://www.juls.savba.sk/ediela/sr/2020/1/sr20-1.pdf|Slovenská reč, 85(1)]]//, 76–81.
   *Kováříková, D., Škrabal, M., Cvrček, V., Lukešová, L., & Milička, J.: Lexicographer’s Lacunas or How to Deal with Missing Representative Dictionary Forms on the Example of Czech. //International Journal of Lexicography, 33(1)//, 90-103.

==== 2019 ====
   *Mačutek, J., Čech, R., & Milička, J.: Length of non-projective sentences: A pilot study using a Czech UD treebank. In //Proceedings of the First Workshop on Quantitative Syntax (Quasy, SyntaxFest 2019)// (pp. 110–117). ISBN 978-1-950737-65-9.
   *Čech, R., Hůla, J., Kubát, M., Chen, X., & Milička, J.: The development of context specificity of lemma. A word embeddings approach. //Journal of Quantitative Linguistics, 26(3)//, 187-204.
   *Hůla, J., Kubát, M., Čech, R., Chen, X., Číž, D., Pelegrinová, K., & Milička, J.: Context Specificity of Lemma. Diachronic Analysis. //Glottometrics 45 2019//, 7.

==== 2018 ====
   *Juola, P., Milička, J., & Zemánek, P.: Authorship and time attribution of Arabic texts using JGAAP. In K. Shaalan, A. E. Hassanien & F. Tolba (eds.),  //Intelligent Natural Language Processing: Trends and Applications// (pp. 325–349). Springer, Cham. ISBN: 978-3-319-67056-0.
   *Milička, J.: Average Word Length from the Diachronic Perspective: The Case of Arabic. //Linguistic Frontiers, 1(2)//, 81-89.
   *Milička, J., & Kalábová, H.: Vowel Disharmony in Czech Words and Stems. In  M. Fidler & V. Cvrček (eds.), //Taming the Corpus: From Inflection and Lexis to Interpretation// (pp. 37–61). Springer, Cham. ISBN: 978-3-319-98017-1.
   *Čech, R., Milička, J., Mačutek, J., Koščová, M., & Lopatková, M.: Quantitative Analysis of Syntactic Dependency in Czech. In J. Jiang & H. Liu (eds.), //Quantitative Analysis of Dependency Structures// (pp 53–70). ISBN: 978-3-11-057356-5.

==== 2017 ====
   *Diatka, V., & Milička, J: The effect of iconicity flash blindness: An empirical study. In A. Zirker, M. Bauer, O. Fisher & C. Ljungberg (eds.), //Dimensions of Iconicity// (pp 3–14). John Benjamins. ISBN 978-90-272-4351-5.
   *Mačutek, J., Čech, R., & Milička, J.: [[https://aclanthology.org/volumes/W17-65/|Menzerath-Altmann Law in Syntactic Dependency Structure]]. In S. Montemagni & J. Nivre (eds.), //Proceedings of the Fourth International Conference on Dependency Linguistics (Depling 2017), September 18-20, 2017, Università di Pisa, Italy (No. 139, pp. 100–107).// Linköping University Electronic Press. ISBN: 978-91-7685-467-9.

==== 2016 ====
   *Milička, J.: Key Length Motifs in Czech and Arabic Texts. In E. Kelih, R. Knight, J. Mačutek & A. Wilson (eds.), //Studies in Quantitative Linguisitcs 23//. (pp. 27–42). RAM – Verlag. ISBN: 978-3-942303-44-6.
   *Čéplö, S., Bátora, J., Benkato, A., Milička, J., Pereira, C., & Zemánek, P.: Mutual intelligibility of spoken Maltese, Libyan Arabic, and Tunisian Arabic functionally tested: A pilot study. //Folia Linguistica, 50(2)//, 583-628.
   *Zemánek, P., & Milička, J.: Restricted collocability and its use in Arabic Corpus Linguistics. In G. C. Pastor (ed.), //Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives.// (pp. 67–78). Tradulex. ISBN: 978-2-9700736-5-9.
==== 2015 ====
   *Milička, J.: Synergetic Linguistics: Do We Need Better Explanatory Mechanism?. //Glottotheory, 6(2)//, 291-298.
   *Milička, J.: Is the Distribution of L-Motifs Inherited from the Word Lengths Distribution?. In G. K. Mikros & J. Mačutek (eds.) //Sequences in Language and Text// (pp 133–146). De Gruyter. ISBN: 978-3-11-036273-2.
   *Milička, J.: Is Menzerath’s Law a consequence of segment inventory inhomogeneity?. //Czech and Slovak Linguistic Review, 2015(2)//, 62-71.
==== 2014 ====
   *Milička, J.: Menzerath’s law: the whole is greater than the sum of its parts. //Journal of Quantitative Linguistics, 21(2)//, 85-99.
   *Mikros, G., & Milička, J.: Distribution of the Menzerath’s law on the syllable level in Greek texts. In G. Altmann, R. Čech, J. Mačutek & L. Uhlířová (eds). //Empirical approaches to text and language analysis// (pp 180–189). RAM - Verlag. ISBN 978-3-942303-24-8.
   *Zemánek, P., & Milička, J.: [[https://aclanthology.org/volumes/W14-09/|Quotations, relevance and time depth: Medieval Arabic literature in grids and networks]]. In A. Feldman, A. Kazantseva, & S. Szpakowicz (eds.) //Proceedings of the 3rd Workshop on Computational Linguistics for Literature (CLFL)// (pp. 17–24). ISBN 978-1-937284-88-6.
   *Zemánek, P., & Milička, J.: Ranking Search Results for Arabic Diachronic Corpora. Google-like search engine for (non) linguists. In A. Lakhouaja (ed.), //Proceedings of the 5th International Conference on Arabic Language Processing (CITALA 2014)// (pp. 73–78). Oujda.  

==== 2013 ====
   *Kubát, M., & Milička, J.: Vocabulary richness measure in genres. //Journal of Quantitative Linguistics, 20(4)//, 339-349.
   *Milička, J.: Rank-frequency relation & type-token relation: Two sides of the same coin. In I. Obradović, E. Kelih & R. Köhler (eds.), //Methods and Applications of Quantitative Linguistics: Selected papers of the 8th International Conference on Quantitative Linguistics (QUALICO)// (pp. 163–171). ISBN 978-86-7466-465-0.
==== 2012 ====
   *Milička, J.: Minimal ratio: an exact metric for keywords, collocations etc. //Czech and Slovak Linguistic Review, 2012(1)//, 62-70.
   *Chromý, J., & Milička, J.: Experimentální zkoumání stylotvorných faktorů: první výstupy. //Naše řeč (Our Speech), 95(4)//, 181-187.

==== 2011 ====
   *Milička, J.: A Combinatorial Method for a Context Comparison. In: E. Kelih, V. Levickij &Y. Matskulyak (eds.), //Issues in Quantitative Linguistics 2// (pp 104–109). RAM – Verlag. ISBN 978-3-942303-07-1.
   *Milička, J.: [[https://www.birmingham.ac.uk/Documents/college-artslaw/corpus/conference-archives/2011/Paper193.pdf|Valency and Information Structure: A quantitative approach to from–to juxtaposition in Arabic]]. In //Proceedings of CLC 2011//. 

==== 2010 ====
  *Milička, J.: Budování česko-arabského paralelního korpusu. In F. Čermák & J. Kocek (eds.), //Mnohojazyčný korpus Intercorp: Možnosti studia// (pp 221–225). Nakladatelství Lidových novin. ISBN 978-80-7422-058-6.

==== 2009 ====
   *Milička, J.: Type-token & hapax-token relation: A combinatorial model. //Glottotheory, 2(1)//, 99-110.
   *Milička, J.: [[http://milicka.ff.cuni.cz/kestazeni/clanek_Novy_Orient.pdf|Knihtisk v dějinách islámské kultury]]. //Nový orient, 64(2)//, 42–46.

===== Nástroje =====


    *[[https://chatgpt.com/g/g-pFqRCNeHu-corpus-linguist|Mechanický korpusový lingvista]]: asistent který dokáže sehnat data z korpusu a dále s nimi pracovat (viz [[en:manualy:alpha|info]])
    *[[http://alpha.korpus.cz|Alpha]]: Překladač z přirozeného jazyka do CQL (viz [[en:manualy:alpha|info]])
    *[[http://milicka.cz/en/engrammer|Engrammer]]: Nástroj pro explorativní analýzu kolokací.
    *[[http://www.milicka.cz/keyworder|KeyWorder]]: Program pro rozpoznávání klíčových slov v textu pomocí minimálního poměru.
    *[[http://www.milicka.cz/typetokener|TypeTokener]]: Program, který měří type-token relation, hapax-token relation atd. zvoleného textu a následně pomocí změřené distribuce typů tyto veličiny zpětně modeluje.
    *[[http://www.milicka.cz/lexicographerscalculator|Lexicographers' Calculator]]: Program pro plánování rozsahu korpusu.
    *[[http://milicka.cz/tinfi|Tinfi]]: Program, který označuje části textu, jež z něj vyčnívají.
    *[[http://milicka.cz/blacksquare|BlackSquare]]: Program pro jednoduché (nejen) lingvistické experimenty.
    *[[http://zumky.com/|Zumky]]: Komunikační nástroj pro všechny, kteří si váží svého času, klidu a soukromí.


===== Monografie =====
   *Zemánek, P., & Milička, J. (2017): Words Lost and Found: The Diachronic Dynamics of the Arabic Lexicon. RAM-Verlag. 234 p. ISBN: 978-3-942303-45-3.
   *Zemánek, P., Milička, J., & Ondráš, F. (2017): Al-haraka baraka. Strukturně-variační pohled na středověká arabská přísloví a rčení. Univerzita Karlova, Filozofická fakulta. 167 p. ISBN 978-80-7308-749-4.

===== Kvalifikační práce =====
  *Milička, J. (2022): //[[http://milicka.cz/habilitace.pdf|Lexikální diverzita]]// (Habilitační spis).
  *Milička, J. (2015): //[[http://milicka.cz/disertace.pdf|Teorie komunikace jakožto explanatorní princip přirozené víceúrovňové segmentace textů]]// (Disertace).
===== Recenze =====
   *Milička, J. (2014): Kontroverzní hranice jazykovědy aneb O syntagmatických očích Hany Karadžičové [Recenze knihy Kvantitativní analýza kontextu, V. Cvrček]. //Naše řeč, (4-5)//, 300-304.
   *Milička, J. (2018): Kapitoly z korpusové versologie — cesta správným směrem [recenze knihy Kapitoly z korpusové versologie, P. Plecháč & R. Kolár]. //Česká Literatura, 66(2)//, 286–289.

===== Konference =====
    *2026/6 (JM) hands on workshop //Využítí AI při empirickém lingvistickém výzkumu// Bohemistický kongres 2026, Praha (zvaná).
    *2026/5 (JM, Laura Janda, Dominika Kováříková): prezentace //Grammatical and Syntactic bias in Large Language Models// ICAME 47, Koblenz, Germany.
    *2026/5 (Ondřej Tichý, Barbora Bulantová, Magdalena Titlbachová, AnnaMarklová, JM) prezentace //Benchmarking Large Language Models for Linguistic Research// ICAME 47, Koblenz, Germany.
    *2026/3 (JM, Tomáš Machálek): poster //AI Corpus Linguist: More than a Year of Experience// SIGHUM (LaTeCH-CLfL) 2026 workshop at EACL, Rabat, Morocco. 
    *6/2025 (JM, Anna Marklová, Václav Cvrček) Prezentace //AI-Brown// na konferenci Corpus Linguistics (Birmingham, Velká Británie).
    *6/2025 Prezentace //Comparison of Human-Written and LLM-Generated Texts: An incomplete QL Package// na konferenci QUALICO (Brno, Česká republika).
    *6/2025 (Anna Marklová, JM, Eva Pospíšilová, Ondřej Drobil) Prezentace //The Future of Corpus Linguistics in the World of Large Language Models// na konferenci ICAME (Vilnius, Litva).
    *2/2025 (JM, Anna Marklová) Prezentace //Register variability in machine-generated texts// na workshopu ISCA/ITG (Berlin, Německo).
    *10/2024 (Dominika Kováříková, JM, Václav Cvrček, Michal Láznička) Prezentace //Unlocking Lexical Meaning 
through Grammatical Profiling.// na konferenci EURALEX (Cavtat, Chorvatsko).
    *6/2024 (JM, Anna Marklová, Václav Cvrček) Prezentace //Exploring register variation in human and machine-generated texts: A comparative analysis.// na konferenci ICAME (Vigo, Španělsko).
    *6/2024 Prezentace //Mechanical Corpus Linguist// na konferenci 4EU+ AI Days (Praha, ČR).
    *5/2024 Prezentace //Let’s Delve into the Intricate Tapestry of the Chatgptese// na International Workshop on Corpus and Computational Linguistics (Ostrava, ČR, zvaná).
    *5/2024 Prezentace //Exploring Habibi Corpus: Mapping latent space to real geographic space// na konferenci AIDA (Valletta, Malta).
    *2/2024 Prezentace //Not Your Training Data – Not Your Culture: Exploring Variations in Gender Bias in Large Language Models// na Gender, Technology, and Digital Cultures in the Middle East Conference a účast na panelové diskusi (Dauha, Qatar, zvaná).
    *11/2023 Prezentace //Hledání v korpusech pomocí velkých jazykových modelů: příklady z lingvistiky a dalších oborů// na workshopu Humanitní a společenské vědy perspektivou Digital Humanities (Olomouc, zvaná).
    *9/2023 Prezentace //Our Timelines// na workshopu [[http://milicka.ff.cuni.cz/AIAL2023|AIAL2023]] (Towards AI-Aided Human-Supervised Linguistics, Prague, organizer)
    *6/2023 Prezentace //Modelling Menzerath’s Law with Gaussian Copula// na konferenci QUALICO 2023 (Lausanne).
    *6/2023 Prezentace //A Guided Tour through the Labyrinth of Lexical Diversity// na workshopu International Workshop on Corpus Stylistics and Stylometrics (Ostrava, zvaná).
    *6/2023 (JM a Petr Zemánek) Poster //Principal Component Analysis of Written Arabic Dialects// na konferenci Olinco 2023 (Olomouc, Best Poster Award).
    *11/2022 (JM a Dominika Kováříková) Prezentace //Jak vytěžit textová data Českého národního korpusu pomocí KonTextu// na konferenci Digitální data perspektivou humanitního vědce (Brno, hybridní, zvaná).
    *11/2022 Prezentace //Engrammer, nástroj na automatickou extrakci frazeologie// na workshopu Vývoj elektronické lexikální databáze indoíránských jazyků a podpora zavádění moderních technologií do výuky jazyků (Praha, zvaná).
    *5/2022 Prezentace //The Menzerath-Altmann Law: Time to move on// na workshopu III. Summer Workshop for Statistics in Linguistics (Trojanovice, zvaná).
    *5/2022 Prezentace //Measuring lexical diversity: The influence of lemmatization// na kolokviu SlavLingColl (Berlin, zvaná).
    *9/2021 (JM, Václav Cvrček a David Lukeš) Prezentace //Unpacking Lexical Intertextuality – Number of Types Shared Among Texts// na konferenci QUALICO (Tokyo, online).
    *8/2021 (JM a Denisa Šebestová) Prezentace //Human Freindly Corpus Querry language// na konferenci ICAME (Dortmund).
    *11/2019 Prezentace //Engrammer — On the borders between language and other cultural phenomena that can be quantitatively analyzed via corpus// na workshopu Corpus Driven Quantitative Linguistics Workshop (v Ostravě; zvaná).
   *9/2019 (JM a Denisa Šebestová) Prezentace //Engrammer: Introducing a new tool for the identification of phraseological patterning. Demo and case study on Czech, English and Arabic// na konferenci EUROPHRAS (Málaga).
    *8/2019 (Ján Mačutek, Radek Čech a JM) Prezentace //Length of non-projective sentences: A pilot study using a Czech UD treebank// na konferenci Quasy konané v rámci SyntaxFestu 2019, Paris.
    *7/2019 (JM, Václav Cvrček a Lucie Lukešová) Prezentace //N-gram Length Correspondence in Typologically Different Languages// na konferenci CL2019 Cardiff.
    *6/2019 (Denisa Šebestová a Markéta Malá a JM) Prezentace //The expression of time in English and Czech children’s literature:A contrastive phraseological perspective// na konferenci ICAME (Neuchatel).
    *3/2019 Prezentace //Analysis of Liberal Translations and Cross-Language Plagiarism// na setkání Lingvistické odpoledne 2019 (Olomouc, zvaná).
    *9/2018 (JM a Alžběta Růžičková) Prezentace //Slovak Vowel Phonotactics: Slavic Origins vs. Hungarian Influences// na konferenci SlaviCorp (Praha).
    *7/2018 (JM a Alžběta Růžičková) Prezentace //Demand and Supply in the Communication Process: The Case of Lexical Richness and Phonological Features// na konferenci QUALICO (Wroclaw).
    *9/2017 (Jan Mačutek, Radek Čech a JM) Prezentace a poster //Menzerath-Altmann Law in Syntactic Dependency Structure// na konferenci Depling (Pisa).
    *5/2017 (JM a Hana Kalábová) Prezentace //Vowel Disharmony in Czech: Description and Explanation// na konferenci Linguistics Prague.
    *3/2017 Prezentace //From – To Construction in Arabic and Czech// na konferenci Word Order and Information Structure: a Cross- and Intra-Linguistic Perspective (Olomouc; zvaná).
    *2/2017 Prezentace //Menzerathův-Altmannův zákon: adorovaný model podivného vztahu// na kolokviu Kritické pohledy na Menzerathův-Altmannův zákon (Ostrava; zvaná).
    *8/2016 (JM a Karolína Vyskočilová) Prezentace //Models of noisy channels that speech gets over// na konferenci QUALICO (Trier).
    *12/2015 (JM a Petr Zemánek) Prezentace //Tolerant algorithm for quotation extraction// na Digital Arabic and Persian Research Workshop (Leipzig; zvaná).
    *11/2015 Poster //From Linguistic Theory to an Effective Quotation Extraction Algorithm// na symposiu Methods and Linguistic Theories (MaLT 2015) (Bamberg).
    *10/2015 (Vojtěch Diatka a JM) Prezentace //Můžou se neikonická slova někdy chovat jako ikonická?// na konferenci Lingvistika Praha.
    *7/2015 (JM a Petr Zemánek) Poster //Hypertextualizer. Quotation Extraction Software// na konferenci Corpus Linguistics 2015 (Lancaster).
    *7/2015 (Vojtěch Diatka a JM) Poster //The Iconicity of the "Non-Iconic Words" and its Effects on the Language Processing// na 12th International Symposium of Psycholinguistics (Valencia).
    *6/2015 (JM a Petr Zemánek) Prezentace //Restricted Collocability and its Use in Arabic Corpus Linguistics// na konferenci EUROPHRAS 2015 (Malaga).
    *3/2015 (Vojtěch Diatka, Jiří Milička) Prezentace //Are Iconic Words Statistically more Iconic than Non-Iconic Ones? A New Method of Testing// na 10th International Symposium on Iconicity in Language and Literature (Tübingen).
    *6/2014 Prezentace //Three Models for the Menzerath's Law// na konferenci QUALICO (organizováno IQLA).
    *5/2014 Prezentace //Konfidenční intervaly v empirické lingvistice// na konferenci Lingvistika Praha.
    *4/2014 (JM a Petr Zemánek) Prezentace //Quotations, Relevance and Time Depth: Medieval Arabic Literature in Grids and Networks// na konferenci EACL v Göteborgu (organizovaná Association for Computational Linguistics).
    *7/2012 Prezentace //Rank-frequency Relation & Type-token Relation: Two Sides of the Same Coin// na konferenci QUALICO.
    *7/2011 Prezentace //Valency and the Information Structure. A Quantitative Approach// na Corpus Linguistics Conference v Birminghamu.
    *4/2011 (Petr Zemánek a JM) Prezentace //Arabic Plurals in Context. A Corpus Study// na Workshop on Arabic Corpus Linguistics v Lancasteru.
    *9/2009 Prezentace //Budování česko-arabského paralelního korpusu// na konferenci Intercorp v Praze.


===== Překlady =====


  * Muntasir al-Qaffáš: On. In //Antologie moderních arabských povídek.// Praha 2011, pp 93-97.
  * (Přeloženo s Annou Humlovou) Alí ad-Du'áží: //Po hospodách kolem Středozemního moře.// Praha, Malvern 2013, 76 s.


===== Výuka =====

  *Dříve vyučované
    * Arabština a korpus
    * Úvod do kvantitativní lingvistiky
    * Článek na korpusově-lingvistické téma 
  *Aktuálně vyučované
    * Obecné zákonitosti v textech
    * Užití velkých jazykových modelů
  *Také se aktuálně podílím na předmětech
    * Práce s korpusy: případové studie
    * Úvod do práce s jazykovými korpusy


===== Stáže =====

   * 4/2013–6/2013 Stáž na Univerzitě v Trieru.
   * 10/2013–11/2013 Stáž na Národní a Kapodistrijské univerzitě v Athénách.