Methods of text representation for minority/endangered languages.
The last ten
to fifteen years have seen a significant theoretical effort resulting in
cumulative methodological developments in the area of minority language
documentation, including methods of interlinear glossing. The results have been
exhaustively presented in papers by Christian Lehmann as well as in the so
The issues to be considered at the Round Table include the following: the inventory of morphological glosses; inventory and rules of using the delimiters; standard (IPA) transcription and connected issues; representation of phenomena typical of some special kinds of texts (e.g. representing falstarts and self-repairs in spontaneously produced narratives and dialogs), and technical problems of representing the glossed texts by means of various pieces of software accessible worldwide. Some of these issues simply demand a discussion. One of the most important objectives of the discussion could be a search for a balance between representational standardization and efficiency of the standard for representing the data of various given languages. Specifically, a unified glossing standard must be both strict enough to make the representation of structurally different data comparable and loose enough to provide the researcher with a certain degree of flexibility regarding the choices necessary for an adequate and efficient representation of linguistic data. (It should be noted that not all choices of this sort are equally essential/obligatory, because the researcherís knowledge of the language is often limited). Some specific problems and questions are:
∑ Is it necessary or redundant to represent prosodic data (stress, phrasal prosody etc.)?
∑ The difficulties of choosing one or other delimiter because of the lack of the data or scalar opposition (such as dot or dash with fusion / cumulation);
∑ The absence of a robust theoretical approach or empirical data to suggest that a specific segment is a separate phonetic word or a clitic.
Just as is the case of the meta-language of theoretical linguistics, it is quite obvious that a search for a universal meta-language of text representation needs to be based on data from typologically diverse languages and requires a joint effort of experts in these languages. LENCA-3 is a forum that suits both these requirements very well.
Language: Russian, English.
Duration: one section.
A.E. Kibrik (
M.A. Daniel (