Open Source German Distant Speech Recognition: Corpus and Acoustic Model

2015-12-11International Conference on Text, Speech, and Dialogue 2015Code Available0· sign in to hype

Stephan Radeck-Arneth, Benjamin Milde, Arvid Lange, Evandro Gouvea, Stefan Radomski, Max Mühlhäuser, and Chris Biemann

Code Available — Be the first to reproduce this paper.

Code

github.com/tudarmstadt-lt/kaldi-tuda-de
OfficialIn papernone★ 0

Abstract

We present a new freely available corpus for German distant speech recognition and report speaker-independent word error rate (WER) results for two open source speech recognizers trained on this corpus. The corpus has been recorded in a controlled environment with three different microphones at a distance of one meter. It comprises 180 different speakers with a total of 36 hours of audio recordings. We show recognition results with the open source toolkit Kaldi (20.5% WER) and PocketSphinx (39.6% WER) and make a complete open source solution for German distant speech recognition possible.

Tasks

Distant Speech Recognition speech-recognition Speech Recognition

Open Source German Distant Speech Recognition: Corpus and Acoustic Model

Code

Abstract

Tasks

Reproductions