====== EEBO (Early English Books Online) ====== The **EEBO version 1** corpus contains more than 25 000 English texts from the period 1475--1700, which were digitalized by the [[http://www.textcreationpartnership.org/|Text Creation Partnership]] organization during Phase 1 of the [[https://textcreationpartnership.org/tcp-texts/eebo-tcp-early-english-books-online/|Early English Books Online]] project; a detailed description of the digitalization process is available [[http://www.textcreationpartnership.org/docs/|here]]. Overall size of the EEBO v1 corpus is 730 million running words. The **EEBO version 2** corpus is composed of the 25,363 texts created during Phase 1 and the 28,462 texts created during Phase 2 of the [[https://ota.bodleian.ox.ac.uk/repository/xmlui/handle/20.500.12024/6|EEBO-TCP Partnership]]. The texts have been processed to facilitate linguistic research by the [[https://earlyprint.org/intros/|EarlyPrint]] collaborative effort, namely they have been tokenized, regularized, lemmatized and [[https://earlyprint.org/intros/nupos_tag_set.html|part of speech tagged]]. Overall size of the EEBO v2 corpus is 1,300 million running words. Both the addition of the Phase 2 texts as well as the linguistic processing and annotation constitute the update over EEBO v1. Metadata and structuring of texts were treated for use in the KonText interface in such a way that their basic structural information was preserved (text highlights, its division etc.) including links to the on-line version. The meanings of the individual structures and their attributes are based on the [[http://www.tei-c.org/Vault/P5/current/doc/tei-p5-doc/en/html/|TEI P5]], and are also described in the following chart: ^ structure ^ attribute ^ description ^ | '''' | title | document title | | '''' | author | document author | | '''' | year | publication year (may be an interval) | | '''' | decade | decade containing the publication year | | '''' | period | period containing the publication year | | '''' | biblio | bibliographic information | | '''' | webSource | full text link in HTML format | | '''' | ePubSource | full text link in ePUB format | | '''' | id | document identifier | | ''
'' | type | part of text and its type | | '''' | | heading | | ''

'' | | paragraph | | '''' | rend | highlight and type (typefaces etc.) | | '''' | facs | link to a page containing a scan (limited access) | | '''' | | verse | | '''' | | line | | '''' | | utterance (esp. in plays) | | '''' | | speaker (esp. in plays) | | '''' | | stage note (esp. in plays) | | '''' | | list | | ''