SOTAVerified

Streaming Target Sound Extraction

This task is a variant of the Target Sound Extraction task, with the constraint of causal streaming inference. Aiming for an algorithmic latency of less than 20 ms, at each time step, streaming audio models operate on an input audio chunk of length less than 20 ms. The causal constraint means that the model only has the knowledge of past chunks and no future chunks.

Papers

Showing 11 of 1 papers

TitleStatusHype
Real-Time Target Sound ExtractionCode2
Show:102550

No leaderboard results yet.