Multilingual Resources
of the University of Helsinki Language Corpus Server

Metadata Descriptions


The machine-readable linguistic data located at the University of Helsinki Language Corpus Server (UHLCS) are described and catalogued with the metadata descriptions, which are publicly open. The metadata descriptions are located at the sub-directory /corpus-metadata of the directory metadata. The metadata descriptions are first arranged (1) according to the names of the language families, then (2) according to the names of languages, and, finally, (3) according to the names of the corpora the metadata descriptions concern. The metadata descriptions of the word lists are located at a separate directory.

The metadata descriptions for the corpora located at the UHLCS are originally prepared with the metadata editor (IMDI-editor) (the Technical Department, Max Planck Institute for Psycholinguistics, Nijmegen). The metadata descriptions are adapted to concern written linguistic data. The principles followed in preparing the metadata descriptions are defined within the framework of the ISLE-Project (International Standard for Language Engineering).

On the tools available for the use of the corpora at the UHLCS