Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors
Julien Hauret, Malo Olivier, Thomas Joubaud, Christophe Langrenne, Sarah Poirée, Véronique Zimpfer, Éric Bavu
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/jhauret/vibravoxOfficialIn paperpytorch★ 48
Abstract
Vibravox is a dataset compliant with the General Data Protection Regulation (GDPR) containing audio recordings using five different body-conduction audio sensors: two in-ear microphones, two bone conduction vibration pickups, and a laryngophone. The dataset also includes audio data from an airborne microphone used as a reference. The Vibravox corpus contains 45 hours per sensor of speech samples and physiological sounds recorded by 188 participants under different acoustic conditions imposed by a high order ambisonics 3D spatializer. Annotations about the recording conditions and linguistic transcriptions are also included in the corpus. We conducted a series of experiments on various speech-related tasks, including speech recognition, speech enhancement, and speaker verification. These experiments were carried out using state-of-the-art models to evaluate and compare their performances on signals captured by the different audio sensors offered by the Vibravox dataset, with the aim of gaining a better grasp of their individual characteristics.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| VibraVox (forehead accelerometer) | ECAPA2 | Test EER | 0.01 | — | Unverified |
| VibraVox (headset microphone) | ECAPA2 | Test EER | 0 | — | Unverified |
| VibraVox (rigid in-ear microphone) | ECAPA2 | Test EER | 0.03 | — | Unverified |
| VibraVox (soft in-ear microphone) | ECAPA2 | Test EER | 0.02 | — | Unverified |
| VibraVox (temple vibration pickup) | ECAPA2 | Test EER | 0.08 | — | Unverified |
| VibraVox (throat microphone) | ECAPA2 | Test EER | 0.04 | — | Unverified |