Microphone Array Signal Processing and Deep Learning for Speech Enhancement

2025-01-13Unverified0· sign in to hype

Reinhold Haeb-Umbach, Tomohiro Nakatani, Marc Delcroix, Christoph Boeddeker, Tsubasa Ochiai

Unverified — Be the first to reproduce this paper.

Abstract

Multi-channel acoustic signal processing is a well-established and powerful tool to exploit the spatial diversity between a target signal and non-target or noise sources for signal enhancement. However, the textbook solutions for optimal data-dependent spatial filtering rest on the knowledge of second-order statistical moments of the signals, which have traditionally been difficult to acquire. In this contribution, we compare model-based, purely data-driven, and hybrid approaches to parameter estimation and filtering, where the latter tries to combine the benefits of model-based signal processing and data-driven deep learning to overcome their individual deficiencies. We illustrate the underlying design principles with examples from noise reduction, source separation, and dereverberation.

Tasks

Deep Learning Diversity parameter estimation Speech Enhancement

Microphone Array Signal Processing and Deep Learning for Speech Enhancement

Abstract

Tasks

Reproductions