Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq

2018-05-25Code Available0· sign in to hype

Oleksii Kuchaiev, Boris Ginsburg, Igor Gitman, Vitaly Lavrukhin, Jason Li, Huyen Nguyen, Carl Case, Paulius Micikevicius

arXiv PDF

Code Available — Be the first to reproduce this paper.

Reproduce

Code

github.com/NVIDIA/OpenSeq2Seq
OfficialIn papertf★ 0
github.com/rickyHong/OpenSeq2Seq-repl
tf★ 0
github.com/FazedAI/OpenSeq2Seq
tf★ 0

Abstract

We present OpenSeq2Seq - a TensorFlow-based toolkit for training sequence-to-sequence models that features distributed and mixed-precision training. Benchmarks on machine translation and speech recognition tasks show that models built using OpenSeq2Seq give state-of-the-art performance at 1.5-3x less training time. OpenSeq2Seq currently provides building blocks for models that solve a wide range of tasks including neural machine translation, automatic speech recognition, and speech synthesis.

Tasks

Automatic Speech Recognition Automatic Speech Recognition (ASR)Machine Translation speech-recognition Speech Recognition Speech Synthesis Translation

Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq

Code

Abstract

Tasks

Reproductions