AplikaceAplikace
Nastavení

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
en:obc:start [2020/02/18 12:41] – [The digitalization process] michalskrabalen:obc:start [2021/02/04 18:59] (current) – [How to cite] michalkren
Line 12: Line 12:
 The original pages of the Proceedings were scanned and the scans are now available at [[https://www.oldbaileyonline.org/index.jsp|Old Bailey Online]]; you can access individual scans by clicking on the “see original” link on the right of the text of any trial (e.g. [[https://www.oldbaileyonline.org/browse.jsp?div=t18020217-3|here]]). The texts were then manually transcribed by multiple typists and an optical character recognition (OCR) software was employed to create transcriptions for comparison so any differences or inaccuracies could be resolved. However, as the original pages are often faded or otherwise damaged (see, for example, [[https://www.oldbaileyonline.org/images.jsp?doc=OA174005070012|here]]), it is not always possible to guarantee a 100% accuracy of the transcriptions. Users are therefore advised to consult the scanned pages when a very precise reading is required. More on the digitalization process [[https://www.oldbaileyonline.org/static/Project.jsp#methods|here]]. The original pages of the Proceedings were scanned and the scans are now available at [[https://www.oldbaileyonline.org/index.jsp|Old Bailey Online]]; you can access individual scans by clicking on the “see original” link on the right of the text of any trial (e.g. [[https://www.oldbaileyonline.org/browse.jsp?div=t18020217-3|here]]). The texts were then manually transcribed by multiple typists and an optical character recognition (OCR) software was employed to create transcriptions for comparison so any differences or inaccuracies could be resolved. However, as the original pages are often faded or otherwise damaged (see, for example, [[https://www.oldbaileyonline.org/images.jsp?doc=OA174005070012|here]]), it is not always possible to guarantee a 100% accuracy of the transcriptions. Users are therefore advised to consult the scanned pages when a very precise reading is required. More on the digitalization process [[https://www.oldbaileyonline.org/static/Project.jsp#methods|here]].
  
-The texts were marked-up in XML (Extensible Markup Language) according to the [https://tei-c.org/|TEI]] (Text Encoding Initiative) guidelines.+The texts were marked-up in XML (Extensible Markup Language) according to the [[https://tei-c.org/|TEI]] (Text Encoding Initiative) guidelines.
  
 Every single //doc// structure represents one proceeding and consists of multiple //text// structures, the first of which is usually the front matter (or else according to the //type// attribute) and the following contain the trial account itself.  Every single //doc// structure represents one proceeding and consists of multiple //text// structures, the first of which is usually the front matter (or else according to the //type// attribute) and the following contain the trial account itself. 
Line 48: Line 48:
  
 <WRAP round tip 70%> <WRAP round tip 70%>
-//OBC: The Old Bailey Corpus 2.0//. Ústav Českého národního korpusu FF UK, Prague 2020. Available from WWW: http://www.korpus.cz+//OBC: The Old Bailey Corpus 2.0//. Ústav Českého národního korpusu FF UK, Prague 2021. Available from WWW: http://www.korpus.cz
  
 **The original Old Bailey Corpus**: Huber, M. - Nissel, M. - Puga, K. (2016): //Old Bailey Corpus 2.0//. [[http://hdl.handle.net/11858/00-246C-0000-0023-8CFB-2|hdl:11858/00-246C-0000-0023-8CFB-2]] **The original Old Bailey Corpus**: Huber, M. - Nissel, M. - Puga, K. (2016): //Old Bailey Corpus 2.0//. [[http://hdl.handle.net/11858/00-246C-0000-0023-8CFB-2|hdl:11858/00-246C-0000-0023-8CFB-2]]