AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
en:cnk:eebo [2016/04/04 13:57] – created veronikapojarovaen:cnk:eebo [2025/05/28 14:36] (current) – [How to cite] michalkren
Line 1: Line 1:
 ====== EEBO (Early English Books Online) ====== ====== EEBO (Early English Books Online) ======
  
-The EEBO corpus contains more than 25 000 English texts from the period 1475--1700, which were digitalized by the[[http://www.textcreationpartnership.org/|Text Creation Partnership]] organization as part of the [[http://www.textcreationpartnership.org/tcp-eebo/|Early English Books Online]] project; a detailed description of the digitalizstion process is available [[http://www.textcreationpartnership.org/docs/|here]]. The size of the corpus is approximately 730 million words.+The **EEBO version 1** corpus contains more than 25 000 English texts from the period 1475--1700, which were digitalized by the [[http://www.textcreationpartnership.org/|Text Creation Partnership]] organization during Phase 1 of the [[https://textcreationpartnership.org/tcp-texts/eebo-tcp-early-english-books-online/|Early English Books Online]] project; a detailed description of the digitalization process is available [[http://www.textcreationpartnership.org/docs/|here]]. Overall size of the EEBO v1 corpus is **730 million running words**.
  
 +The **EEBO version 2** corpus is composed of the 25,363 texts created during Phase 1 and the 28,462 texts created during Phase 2 of the [[https://ota.bodleian.ox.ac.uk/repository/xmlui/handle/20.500.12024/6|EEBO-TCP Partnership]]. The texts have been processed to facilitate linguistic research by the [[https://earlyprint.org/intros/|EarlyPrint]] collaborative effort, namely they have been tokenized, regularized, lemmatized and [[https://earlyprint.org/intros/nupos_tag_set.html|part of speech tagged]]. Overall size of the EEBO v2 corpus is **1,300 million running words**. Both the addition of the Phase 2 texts as well as the **linguistic processing and annotation** constitute the update over EEBO v1.
 + 
 Metadata and structuring of texts were treated for use in the KonText interface in such a way that their basic structural information was preserved (text highlights, its division etc.) including links to the on-line version. The meanings of the individual structures and their attributes are based on the [[http://www.tei-c.org/Vault/P5/current/doc/tei-p5-doc/en/html/|TEI P5]], and are also described in the following chart: Metadata and structuring of texts were treated for use in the KonText interface in such a way that their basic structural information was preserved (text highlights, its division etc.) including links to the on-line version. The meanings of the individual structures and their attributes are based on the [[http://www.tei-c.org/Vault/P5/current/doc/tei-p5-doc/en/html/|TEI P5]], and are also described in the following chart:
  
Line 31: Line 33:
 | ''<q>'' |  | citation | | ''<q>'' |  | citation |
 | ''<bibl>'' |  | bibliographic citation | | ''<bibl>'' |  | bibliographic citation |
 +
 +===== Wiki course =====
 +
 +For a basic overview of how to use the //**EEBO version 1**// corpus and how to input the data into the search interface check our wiki-course in eight lessons:
 +
 +  * [[en:eebo:first_query|Lesson 1 (First query)]]
 +  * [[en:eebo:orthography_spelling|Lesson 2 (Orthography and Spelling)]]
 +  * [[en:eebo:competing_forms|Lesson 3 (Competing forms)]]
 +  * [[en:eebo:specify_query|Lesson 4 (Specify query)]]
 +  * [[en:eebo:collocations|Lesson 5 (Collocations)]]
 +  * [[en:eebo:morphology1|Lesson 6 (Morphology I)]]
 +  * [[en:eebo:morphology2|Lesson 7 (Morphology II)]]
 +  * [[en:eebo:multiword|Lesson 8 (Multiword expressions)]]
  
 ===== How to cite ===== ===== How to cite =====
  
 <WRAP round tip 70%> <WRAP round tip 70%>
-EEBO Early English Books Online. Ústav Českého národního korpusu FF UK, Prague 2014. Available from WWW: http://www.korpus.cz+//EEBOEarly English Books Online, version 1, 1. 12. 2015//. Ústav Českého národního korpusu FF UK, Prague 2014. Available from WWW: http://www.korpus.cz 
 + 
 +//EEBO: Early English Books Online, version 2, 14. 3. 2025//. Ústav lingvistiky FF UK, Prague 2025. Available from WWW: http://www.korpus.cz
 </WRAP> </WRAP>
 +