SOTAVerified

Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces

2018-05-25Code Available1· sign in to hype

Alice Coucke, Alaa Saade, Adrien Ball, Théodore Bluche, Alexandre Caulier, David Leroy, Clément Doumouro, Thibault Gisselbrecht, Francesco Caltagirone, Thibaut Lavril, Maël Primet, Joseph Dureau

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

This paper presents the machine learning architecture of the Snips Voice Platform, a software solution to perform Spoken Language Understanding on microprocessors typical of IoT devices. The embedded inference is fast and accurate while enforcing privacy by design, as no personal user data is ever collected. Focusing on Automatic Speech Recognition and Natural Language Understanding, we detail our approach to training high-performance Machine Learning models that are small enough to run in real-time on small devices. Additionally, we describe a data generation procedure that provides sufficient, high-quality training data without compromising user privacy.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
LibriSpeech test-cleanSnipsWord Error Rate (WER)6.4Unverified
LibriSpeech test-otherSnipsWord Error Rate (WER)16.5Unverified

Reproductions