DATABANK FOR
ENDANGERED FINNO-UGRIC LANGUAGES
Report



I. ORGANIZATION OF THE PROJECT

* Goals of the Project:

  1. Providing a text corpus collection of endangered Finno-Ugric languages in machine-readable form;
  2. Basic linguistic research.

*Economical Fund:

  1. Academy of Finland (SA)
  2. University of Helsinki: Department of Finno-Ugric Studies
  3. Department of General Linguistics
  4. Joint Committee of the Nordic Research Councils for the Humanities (NOS-H)
  5. The Nordic Research Council

*Length of the Research Period:

  1. Academy of Finland: 1996-1998
  2. Joint Committee of the Nordic Research Councils for the Humanities: 1996-1997

*Project personal:

  1. Seppo Suhonen, University of Helsinki, Department of Finno-Ugric Studies, chief of the project.
  2. Researchers and research-assistants :
  3. Finland

    1. University of Helsinki, Department of Finno-Ugric Studies:
      1. NOS-H: Jelena Adel
        (part-time researcher),
      2. Jarmo Alatalo (full-time researcher),
      3. Miikul Pahomov (part-time researcher),
      4. and Merja Salo (full-time researcher)
      5. Academy of Finland: Erja Kujala (short-term research-assistant)
      6. Jack Rueter,
      7. and Tapani Salminen (part-time researchers).
    2. Department of General Linguistics:


    3. Academy of Finland: Pirkko Suihkonen (full-time researcher).

    Sweden

    1. University of Uppsala, Department of Finno-Ugric Studies
      1. NOS-H: André Hesselbäck and
      2. Manja Lehto (full-time researchers)

    2. University of Umeå, Institute for Saami
      1. NOS-H: Olavi Korhonen (part-time researcher).

    Norway

    1. Nord-Trøndelag College, Department of Education
    2. :
        NOS-H: Nora Bransfjell (part-time researcher)

    3. Norwegian Computing Centre for the Humanities
      • NOS-H: Sjur Moshagen (full-time researcher)

    4. Norwegian University for Science and Technology, Department of Linguistics:
      • NOS-H: Sagka Renander (short-term research-assistant).


II. COMPUTER CORPORA

* Languages from which the computer corpora will be created:

  1. Finland: Komi and Erzya (Jack Rueter), Khanty (Merja Salo), Nenets (Tapani Salminen), Selkup and Kamassian (Jarmo Alatalo), Livonian (Seppo Suhonen);
  2. Sweden: Ingrian (Manja Lehto), Hill Mari (André Hesselbäck) and Ume Saami, (Olavi Korhonen)
  3. Norway: Southern Saami (Nora Bransfjäll, Sjur Moshagen, Sagka Renander).

*Samples of the following languages were adjusted for use in the University of Helsinki Language Corpus Server (UHLCS) in 1996 - 1999:


Pirkko Suihkonen, Aug. 9, 1998. Updated in Aug. 2002.