SOTAVerified

Automatic Speech Recognition and Query By Example for Creole Languages Documentation

2022-05-01Findings (ACL) 2022Code Available0· sign in to hype

Cécile Macaire, Didier Schwab, Benjamin Lecouteux, Emmanuel Schang

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We investigate the exploitation of self-supervised models for two Creole languages with few resources: Gwadloupéyen and Morisien. Automatic language processing tools are almost non-existent for these two languages. We propose to use about one hour of annotated data to design an automatic speech recognition system for each language. We evaluate how much data is needed to obtain a query-by-example system that is usable by linguists. Moreover, our experiments show that multilingual self-supervised models are not necessarily the most efficient for Creole languages.

Tasks

Reproductions