SOTAVerified

Alignment Data base for a Sign Language Concordancer

2020-05-01LREC 2020Unverified0· sign in to hype

Marion Kaczmarek, Michael Filhol

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This article deals with elaborating a data base of alignments of parallel Franch-LSF segments. This data base is meant to be searched using a concordancer which we are also designing. We wish to equip Sign Language translators with tools similar to those used in text-to-text translation. To do so, we need language resources to feed them. Already existing Sign Language corpora can be found, but do not match our needs: working around a Sign Language concordancer, the corpus must be a parallel one and provide various examples of vocabulary and grammatical construction. We started with a parallel corpus of 40 short news and 120 SL videos , which we aligned manually by segments of various length. We described the methodology we used, how we define our segments and alignments. The last part concerns how we hope to allow the data base to keep growing in a near future.

Tasks

Reproductions