AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
en:cnk:eebo [2025/03/17 13:47] – [Wiki course] michalkrenen:cnk:eebo [2025/05/28 14:36] (current) – [How to cite] michalkren
Line 1: Line 1:
 ====== EEBO (Early English Books Online) ====== ====== EEBO (Early English Books Online) ======
  
-The EEBO version 1 corpus contains more than 25 000 English texts from the period 1475--1700, which were digitalized by the [[http://www.textcreationpartnership.org/|Text Creation Partnership]] organization during Phase 1 of the [[http://www.textcreationpartnership.org/tcp-eebo/|Early English Books Online]] project; a detailed description of the digitalization process is available [[http://www.textcreationpartnership.org/docs/|here]].+The **EEBO version 1** corpus contains more than 25 000 English texts from the period 1475--1700, which were digitalized by the [[http://www.textcreationpartnership.org/|Text Creation Partnership]] organization during Phase 1 of the [[https://textcreationpartnership.org/tcp-texts/eebo-tcp-early-english-books-online/|Early English Books Online]] project; a detailed description of the digitalization process is available [[http://www.textcreationpartnership.org/docs/|here]]. Overall size of the EEBO v1 corpus is **730 million running words**.
  
-The EEBO version 2 corpus is composed of the 25,363 texts created during Phase 1 and the 28,462 texts created during Phase 2 of the [[https://ota.bodleian.ox.ac.uk/repository/xmlui/handle/20.500.12024/6|EEBO-TCP Partnership]]. The texts have been processed to facilitate linguistic research by the [[https://earlyprint.org/intros/|EarlyPrint]] collaborative effort, namely they have been tokenized, regularized, lemmatized and [[https://earlyprint.org/intros/nupos_tag_set.html|part of speech tagged]]. +The **EEBO version 2** corpus is composed of the 25,363 texts created during Phase 1 and the 28,462 texts created during Phase 2 of the [[https://ota.bodleian.ox.ac.uk/repository/xmlui/handle/20.500.12024/6|EEBO-TCP Partnership]]. The texts have been processed to facilitate linguistic research by the [[https://earlyprint.org/intros/|EarlyPrint]] collaborative effort, namely they have been tokenized, regularized, lemmatized and [[https://earlyprint.org/intros/nupos_tag_set.html|part of speech tagged]]. Overall size of the EEBO v2 corpus is **1,300 million running words**. Both the addition of the Phase 2 texts as well as the **linguistic processing and annotation** constitute the update over EEBO v1.
- +
-Both the addition of the Phase 2 texts as well as the linguistic processing and annotation constitute the update over EEBO v1.+
    
 Metadata and structuring of texts were treated for use in the KonText interface in such a way that their basic structural information was preserved (text highlights, its division etc.) including links to the on-line version. The meanings of the individual structures and their attributes are based on the [[http://www.tei-c.org/Vault/P5/current/doc/tei-p5-doc/en/html/|TEI P5]], and are also described in the following chart: Metadata and structuring of texts were treated for use in the KonText interface in such a way that their basic structural information was preserved (text highlights, its division etc.) including links to the on-line version. The meanings of the individual structures and their attributes are based on the [[http://www.tei-c.org/Vault/P5/current/doc/tei-p5-doc/en/html/|TEI P5]], and are also described in the following chart:
Line 38: Line 36:
 ===== Wiki course ===== ===== Wiki course =====
  
-For a basic overview of how to use the EEBO corpus and how to input the data into the search interface check our wiki-course in eight lessons:+For a basic overview of how to use the //**EEBO version 1**// corpus and how to input the data into the search interface check our wiki-course in eight lessons:
  
   * [[en:eebo:first_query|Lesson 1 (First query)]]   * [[en:eebo:first_query|Lesson 1 (First query)]]
Line 48: Line 46:
   * [[en:eebo:morphology2|Lesson 7 (Morphology II)]]   * [[en:eebo:morphology2|Lesson 7 (Morphology II)]]
   * [[en:eebo:multiword|Lesson 8 (Multiword expressions)]]   * [[en:eebo:multiword|Lesson 8 (Multiword expressions)]]
- 
-**Please note that the exercises above were created for EEBO v1.** 
  
 ===== How to cite ===== ===== How to cite =====
  
 <WRAP round tip 70%> <WRAP round tip 70%>
-//EEBO: Early English Books Online, version 1//. Ústav Českého národního korpusu FF UK, Prague 2014. Available from WWW: http://www.korpus.cz+//EEBO: Early English Books Online, version 1, 1. 12. 2015//. Ústav Českého národního korpusu FF UK, Prague 2014. Available from WWW: http://www.korpus.cz
  
-//EEBO: Early English Books Online, version 2, 14. 3. 2025//. Ústav Českého národního korpusu FF UK, Prague 2025. Available from WWW: http://www.korpus.cz+//EEBO: Early English Books Online, version 2, 14. 3. 2025//. Ústav lingvistiky FF UK, Prague 2025. Available from WWW: http://www.korpus.cz
 </WRAP> </WRAP>