Developing multilingual speech synthesis system for Ojibwe, Mi'kmaq, and Maliseet

2025-02-04Code Available1· sign in to hype

Shenran Wang, Changbing Yang, Mike Parkhill, Chad Quinn, Christopher Hammerly, Jian Zhu

Code Available — Be the first to reproduce this paper.

Code

github.com/ShenranTomWang/TTS
Officialpytorch★ 11

Abstract

We present lightweight flow matching multilingual text-to-speech (TTS) systems for Ojibwe, Mi'kmaq, and Maliseet, three Indigenous languages in North America. Our results show that training a multilingual TTS model on three typologically similar languages can improve the performance over monolingual models, especially when data are scarce. Attention-free architectures are highly competitive with self-attention architecture with higher memory efficiency. Our research not only advances technical development for the revitalization of low-resource languages but also highlights the cultural gap in human evaluation protocols, calling for a more community-centered approach to human evaluation.

Tasks

Speech Synthesis text-to-speech Text to Speech

Developing multilingual speech synthesis system for Ojibwe, Mi'kmaq, and Maliseet

Code

Abstract

Tasks

Reproductions