University homepage Suomeksi
University of Helsinki
Language Technology

Contact information

Department of Modern Languages
P.O.Box 24 (Unioninkatu 40)
FI-00014 University of Helsinki

Phone +358 (09) 1911
Fax +358 (09) 191 28313

PLEASE NOTE: The FinnWordNet Web pages have moved to the Portal of the Language Bank of Finland, and these outdated pages are no longer updated. (2019-09-19)

FinnWordNet: Download files

The FinnWordNet data files can be downloaded in three formats. If you want to compile the database yourself, you need a modified version of the WordNet Grind program. The FinnWordNet (and Priceton WordNet) lexical data is also used as a basis of HFST transducers working as thesauri or bilingual translation dictionaries.

Data files

The FinnWordNet data can be downloaded in three formats as ZIP packages. For more information, please see the README file common to all the packages.

  • (11.2 MiB) (README):
    FinnWordNet data as relations: synonym sets, word senses, semantic and lexical relations, and translations
  • (8.3 MiB) (README):
    Lists generated from the FinnWordNet data, supplementing the relational data
  • (10.4 MiB) (README):
    WordNet database files (in the format used by the wn search program)
  • (5.1 MiB) (README):
    Lexicographer files (source format for the data files)

Searching the FinnWordNet data files with the wn search program requires a version patched by Debian.

Modified Grind program

Compiling the FinnWordNet lexicographer files to the WordNet database format requires a modified version of the Princeton WordNet 3.0 Grind program. Since the compiled database is downloadable above, you probably do not need the modified Grind unless you modify the FinnWordNet data (lexicographer files).

The source code of the modified Grind is downloadable as either an independent package or a patch file, which requires the source code of the original WordNet Grind program. In both cases, compiling the program requires a Unix, Linux or similar environment and a C compiler. The README file contains more information about the modified Grind.

HFST thesaurus and translation dictionary transducers based on FinnWordNet

The FinnWordNet (and Princeton WordNet) data is also used in the HFST finite-state transducers that work as Finnish or English thesauri or Finnish–English or English–Finnish translation dictionaries. The transducers recognize inflected forms of words, and the thesauri have variants generating synonyms in the same inflected form as the input word. More information about the transducers is available in the README file.

To use the transducers, you need either the full HFST library and tools (version 3.2.0 or later), the stand-alone HFST optimized lookup (version 1.3 or later) or the Java implementation of the optimized lookup (2011-05-23 or later).

Search the department webpages:

Department homepage and contact information | Faculty homepage | University homepage

Copyright 2003-2008 University of Helsinki. All rights reserved.