SOTAVerified

Natural Language Generation for Polysynthetic Languages: Language Teaching and Learning Software for Kanyen'k\'eha (Mohawk)

2018-08-01COLING 2018Unverified0· sign in to hype

Greg Lessard, Nathan Brinklow, Michael Levison

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Kanyen'k\'eha (in English, Mohawk) is an Iroquoian language spoken primarily in Eastern Canada (Ontario, Qu\'ebec). Classified as endangered, it has only a small number of speakers and very few younger native speakers. Consequently, teachers and courses, teaching materials and software are urgently needed. In the case of software, the polysynthetic nature of Kanyen'k\'eha means that the number of possible combinations grows exponentially and soon surpasses attempts to capture variant forms by hand. It is in this context that we describe an attempt to produce language teaching materials based on a generative approach. A natural language generation environment (ivi/Vinci) embedded in a web environment (VinciLingua) makes it possible to produce, by rule, variant forms of indefinite complexity. These may be used as models to explore, or as materials to which learners respond. Generated materials may take the form of written text, oral utterances, or images; responses may be typed on a keyboard, gestural (using a mouse) or, to a limited extent, oral. The software also provides complex orthographic, morphological and syntactic analysis of learner productions. We describe the trajectory of development of materials for a suite of four courses on Kanyen'k\'eha, the first of which will be taught in the fall of 2018.

Tasks

Reproductions