AplikaceAplikace
Nastavení

This is an old revision of the document!


Corpus lEstRepublicain

Corpus consists of 3 volumes (1999, 2002, 2003; not all of them complete) of French regional newspaper L'Est Républicain. It contains almost 120 million words and it was built from CNRTL data. The corpus is lemmatised and POS-tagged by TreeTagger.

For technical reasons, corpus lEstRepublicain is not included in the standard corpus list for Bonito 1; it is only available via the web interface.