DeepFilterNet: Perceptually Motivated Real-Time Speech Enhancement

2023-05-14Code Available4· sign in to hype

Hendrik Schröter, Tobias Rosenkranz, Alberto N. Escalante-B., Andreas Maier

Code Available — Be the first to reproduce this paper.

Code

github.com/rikorose/deepfilternet
OfficialIn paperpytorch★ 3,975

Abstract

Multi-frame algorithms for single-channel speech enhancement are able to take advantage from short-time correlations within the speech signal. Deep Filtering (DF) was proposed to directly estimate a complex filter in frequency domain to take advantage of these correlations. In this work, we present a real-time speech enhancement demo using DeepFilterNet. DeepFilterNet's efficiency is enabled by exploiting domain knowledge of speech production and psychoacoustic perception. Our model is able to match state-of-the-art speech enhancement benchmarks while achieving a real-time-factor of 0.19 on a single threaded notebook CPU. The framework as well as pretrained weights have been published under an open source license.

Tasks

CPU Speech Enhancement

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
VoiceBank + DEMAND	DeepFilterNet3	PESQ (wb)	3.17	—	Unverified

DeepFilterNet: Perceptually Motivated Real-Time Speech Enhancement

Code

Abstract

Tasks

Benchmark Results

Reproductions