SOTAVerified

Speech Separation

The task of extracting all overlapping speech sources in a given mixed speech signal refers to the Speech Separation. Speech Separation is a special scenario of source separation problem, where the focus is only on the overlapping speech signal sources and other interferences such as music or noise signals are not the main concern of the study. A recent representative Github project can be referred to ClearerVoice-Studio.

Source: A Unified Framework for Speech Separation

Image credit: Speech Separation of A Target Speaker Based on Deep Neural Networks

Papers

Showing 110 of 359 papers

TitleStatusHype
Dynamic Slimmable Networks for Efficient Speech Separation0
Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios0
SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative PipelineCode3
Attractor-Based Speech Separation of Multiple Utterances by Unknown Number of Speakers0
Single-Channel Target Speech Extraction Utilizing Distance and Room Clues0
Time-Frequency-Based Attention Cache Memory Model for Real-Time Speech Separation0
SepPrune: Structured Pruning for Efficient Deep Speech SeparationCode1
A Survey of Deep Learning for Complex Speech Spectrograms0
ArrayDPS: Unsupervised Blind Speech Separation with a Diffusion PriorCode1
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer0
Show:102550
← PrevPage 1 of 36Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SepTDASI-SDRi23.7Unverified
2MossFormer2SI-SDRi22.2Unverified
3MossFormer (L) + DMSI-SDRi21.2Unverified
4Separate And DiffuseSI-SDRi20.9Unverified
5MossFormer (M) + DMSI-SDRi20.8Unverified
6SepItSI-SDRi20.1Unverified
7SepFormerSI-SDRi19.5Unverified
8SandglassetSI-SDRi17.1Unverified
9Gated DualPathRNNSI-SDRi16.85Unverified