SOTAVerified

Automatic Reconstruction of Missing Romanian Cognates and Unattested Latin Words

2020-05-01LREC 2020Unverified0· sign in to hype

Alina Maria Ciobanu, Liviu P. Dinu, Laurentiu Zoicas

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Producing related words is a key concern in historical linguistics. Given an input word, the task is to automatically produce either its proto-word, a cognate pair or a modern word derived from it. In this paper, we apply a method for producing related words based on sequence labeling, aiming to fill in the gaps in incomplete cognate sets in Romance languages with Latin etymology (producing Romanian cognates that are missing) and to reconstruct uncertified Latin words. We further investigate an ensemble-based aggregation for combining and re-ranking the word productions of multiple languages.

Tasks

Reproductions