~~NOTOC~~ ====== Corpora Aranea ====== **Aranea** is a family of comparable web corpora prepared by [[http://www.juls.savba.sk/%7Evladob|Vladimír Benko]]. See more [[http://ucts.uniba.sk/aranea_about/index.html|here]]. ===== Corpora available by now (December 2014) ===== * **Araneum Anglicum** Maius & Minus 14.12 * **Araneum Anglicum Asiaticum** Maius & Minus 14.10 * **Araneum Finnicum** Maius & Minus 14.08 * **Araneum Francogallicum** Maius & Minus 14.04 * **Araneum Germanicum** Maius & Minus 14.04 * **Araneum Hungaricum** Maius & Minus 14.12 * **Araneum Hispanicum** Maius & Minus 14.07 * **Araneum Italicum** Maius & Minus 14.12 * **Araneum Nederlandicum** Maius & Minus 14.04 * **Araneum Polonicum** Maius & Minus 14.04 * **Araneum Russicum** Maius & Minus 14.04 * **Araneum Slovacum** Maius & Minus 14.08 ===== Citing Aranea ===== Benko, V.: Aranea - comparable web corpora. Ústav Českého národního korpusu FF UK, Praha 2015. Available on-line: . Benko, V. (2014): Aranea: Yet Another Family of (Comparable) Web Corpora. In: Sojka, P. – Horák, A. – Kopeček, I. – Pala, K. (eds): //TSD 2014//, LNAI 8655, 257–264. Springer International Publishing. ([[http://www.tsdconference.org/tsd2014/download/preprints/672.pdf|PDF to download]]) Benko, V. (2024): The Aranea Corpora Family: Ten+ Years of Processing Web-Crawled Data. In: Nöth, E. – Horák, A. – Sojka, P. (eds): //Text, Speech, and Dialogue. TSD 2024.// Lecture Notes in Computer Science, vol 15048. Springer, Cham. https://doi.org/10.1007/978-3-031-70563-2_5