Obsah

Park – the InterCorp User Interface: HOWTO in brief

InterCorp is accessible by the same login as for other CNK corpora - you can get it for free by registering on the Registration page or by using the top right link on the login page.

After entering your user ID and password a page opens with a link to enter a new query and a switch between the current and the previous version of the corpus. If you return to this page later from the navigation menu by clicking Home, a list of still active queries will be shown. Clicking on a query recalls it.

Restricting the search scope:

Clicking on New query opens a page with a list of currently available languages and texts. At first you need to left-click on the check boxes next to at least two languages.

If you wish to search all texts in the core while ignoring both the collections of automatically processed texts and the option to select specific texts to be searched, you can proceed straight to the specification of your query.

If you wish to search the collections in addition to the core, you can specify Include for each individual collection, as long as the collection is available for your choice of languages. The collections are added to the whole core and you can proceed to entering the query.

If you wish to restrict the set of searched texts by selection criteria applied to known parameters, you can use a filter in the following way:

The steps must be followed in that sequence becouse change in any preceding step usually resets the setting of all subsequent steps.

Making a query:

Displaying parallel concordances:

An option to go back to previous queries and results

Straightforward queries including contracted forms into tagged or lemmatized texts may fail. This includes forms such as can't or I'm, which are split by the tagger into two parts (ca+n't and I+'m) with corresponding lemmas and tags. Similarly with Polish forms byłam or gdybyś (była+m and gdyby+ś). Tokenization may even introduce errors: gdzie ś za Wisłą. A query intended to find the whole contracted form should be typed in as a Phrase, with the split parts separated by a space. Only the individual parts of the contracted form are assigned a tag and a lemma.

Morphological tags including characters with a special meaning in regular expressions, e.g. „$“ in the English tag „wp$“, must be preceded in queries by a backslash: tag=„wp\$“.

See description of the corpus for more details on morphosyntactic tags.

Last update: 10 April 2013