| Both sides previous revisionPrevious revisionNext revision | Previous revision |
| en:cnk:eebo [2025/03/17 15:30] – [Wiki course] michalkren | en:cnk:eebo [2025/05/28 14:36] (current) – [How to cite] michalkren |
|---|
| ====== EEBO (Early English Books Online) ====== | ====== EEBO (Early English Books Online) ====== |
| |
| The **EEBO version 1** corpus contains more than 25 000 English texts from the period 1475--1700, which were digitalized by the [[http://www.textcreationpartnership.org/|Text Creation Partnership]] organization during Phase 1 of the [[http://quod.lib.umich.edu/e/eebo|Early English Books Online]] project; a detailed description of the digitalization process is available [[http://www.textcreationpartnership.org/docs/|here]]. Overall size of the EEBO v1 corpus is 730 million running words. | The **EEBO version 1** corpus contains more than 25 000 English texts from the period 1475--1700, which were digitalized by the [[http://www.textcreationpartnership.org/|Text Creation Partnership]] organization during Phase 1 of the [[https://textcreationpartnership.org/tcp-texts/eebo-tcp-early-english-books-online/|Early English Books Online]] project; a detailed description of the digitalization process is available [[http://www.textcreationpartnership.org/docs/|here]]. Overall size of the EEBO v1 corpus is **730 million running words**. |
| |
| The **EEBO version 2** corpus is composed of the 25,363 texts created during Phase 1 and the 28,462 texts created during Phase 2 of the [[https://ota.bodleian.ox.ac.uk/repository/xmlui/handle/20.500.12024/6|EEBO-TCP Partnership]]. The texts have been processed to facilitate linguistic research by the [[https://earlyprint.org/intros/|EarlyPrint]] collaborative effort, namely they have been tokenized, regularized, lemmatized and [[https://earlyprint.org/intros/nupos_tag_set.html|part of speech tagged]]. Overall size of the EEBO v2 corpus is 1,300 million running words. Both the addition of the Phase 2 texts as well as the linguistic processing and annotation constitute the update over EEBO v1. | The **EEBO version 2** corpus is composed of the 25,363 texts created during Phase 1 and the 28,462 texts created during Phase 2 of the [[https://ota.bodleian.ox.ac.uk/repository/xmlui/handle/20.500.12024/6|EEBO-TCP Partnership]]. The texts have been processed to facilitate linguistic research by the [[https://earlyprint.org/intros/|EarlyPrint]] collaborative effort, namely they have been tokenized, regularized, lemmatized and [[https://earlyprint.org/intros/nupos_tag_set.html|part of speech tagged]]. Overall size of the EEBO v2 corpus is **1,300 million running words**. Both the addition of the Phase 2 texts as well as the **linguistic processing and annotation** constitute the update over EEBO v1. |
| | |
| Metadata and structuring of texts were treated for use in the KonText interface in such a way that their basic structural information was preserved (text highlights, its division etc.) including links to the on-line version. The meanings of the individual structures and their attributes are based on the [[http://www.tei-c.org/Vault/P5/current/doc/tei-p5-doc/en/html/|TEI P5]], and are also described in the following chart: | Metadata and structuring of texts were treated for use in the KonText interface in such a way that their basic structural information was preserved (text highlights, its division etc.) including links to the on-line version. The meanings of the individual structures and their attributes are based on the [[http://www.tei-c.org/Vault/P5/current/doc/tei-p5-doc/en/html/|TEI P5]], and are also described in the following chart: |
| |
| <WRAP round tip 70%> | <WRAP round tip 70%> |
| //EEBO: Early English Books Online, version 1//. Ústav Českého národního korpusu FF UK, Prague 2014. Available from WWW: http://www.korpus.cz | //EEBO: Early English Books Online, version 1, 1. 12. 2015//. Ústav Českého národního korpusu FF UK, Prague 2014. Available from WWW: http://www.korpus.cz |
| |
| //EEBO: Early English Books Online, version 2, 14. 3. 2025//. Ústav Českého národního korpusu FF UK, Prague 2025. Available from WWW: http://www.korpus.cz | //EEBO: Early English Books Online, version 2, 14. 3. 2025//. Ústav lingvistiky FF UK, Prague 2025. Available from WWW: http://www.korpus.cz |
| </WRAP> | </WRAP> |
| |