The RWTH Aachen LVCSR system for IWSLT-2016 German Skype conversation recognition task

2016-12-01IWSLT 2016Unverified0· sign in to hype

Wilfried Michel, Zoltán Tüske, M. Ali Basha Shaik, Ralf Schlüter, Hermann Ney

Unverified — Be the first to reproduce this paper.

Abstract

In this paper the RWTH large vocabulary continuous speech recognition (LVCSR) systems developed for the IWSLT-2016 evaluation campaign are described. This evaluation campaign focuses on transcribing spontaneous speech from Skype recordings. State-of-the-art bidirectional long short-term memory (LSTM) and deep, multilingually boosted feed-forward neural network (FFNN) acoustic models are trained an narrow and broadband features. An open vocabulary approach using subword units is also considered. LSTM and count-based full word and hybrid backoff language modeling methods are used to model the morphological richness of the German language. All these approaches are combined using confusion network combination (CNC) to yield a competitive WER.

Tasks

Language Modeling Language Modelling speech-recognition Speech Recognition

The RWTH Aachen LVCSR system for IWSLT-2016 German Skype conversation recognition task

Abstract

Tasks

Reproductions