Corpus and Models for Lemmatisation and POS-tagging of Old French
2021-09-23Code Available0· sign in to hype
Jean-Baptiste Camps, Thibault Clérice, Frédéric Duval, Lucence Ing, Naomi Kanaoka, Ariane Pinche
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/chartes/OF3Cnone★ 2
Abstract
Old French is a typical example of an under-resourced historic languages, that furtherly displays animportant amount of linguistic variation. In this paper, we present the current results of a long going project (2015-...) and describe how we broached the difficult question of providing lemmatisation andPOS models for Old French with the help of neural taggers and the progressive constitution of dedicated corpora.