A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative states

2024-07-09Scientific Data 2024Code Available0· sign in to hype

Estefanía Hernández-García, Alejandro Guerrero-López, Julián D. Arias-Londoño, Juan I. Godino-Llorente

Code Available — Be the first to reproduce this paper.

Code

github.com/BYO-UPM/CUCO_Database
none★ 2

Abstract

Many research articles have explored the impact of surgical interventions on voice and speech evaluations, but advances are limited by the lack of publicly accessible datasets. To address this, a comprehensive corpus of 107 Spanish Castilian speakers was recorded, including control speakers and patients who underwent upper airway surgeries such as Tonsillectomy, Functional Endoscopic Sinus Surgery, and Septoplasty. The dataset contains 3,800 audio files, averaging 35.51 ± 5.91 recordings per patient. This resource enables systematic investigation of the effects of upper respiratory tract surgery on voice and speech. Previous studies using this corpus have shown no relevant changes in key acoustic parameters for sustained vowel phonation, consistent with initial hypotheses. However, the analysis of speech recordings, particularly nasalised segments, remains open for further research. Additionally, this dataset facilitates the study of the impact of upper airway surgery on speaker recognition and identification methods, and testing of anti-spoofing methodologies for improved robustness. Moreover, regardless of the specific application domain presented, the model introduced has potential downstream utility in assessing the manner of articulation, whether influenced by other medical conditions or certain dialectal variations.

Tasks

Articles Classification Speaker Recognition Speech Recognition

A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative states

Code

Abstract

Tasks

Reproductions