SOTAVerified

Sound Prompted Semantic Segmentation

Sound prompted semantic segmentation aims to predict a segmentation mask given an audio prompt for the object in question. For example, given the sound of a car, the task is to segment the cars in the image.

Papers

Showing 14 of 4 papers

TitleStatusHype
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and LanguageCode2
ImageBind: One Embedding Space To Bind Them AllCode5
Contrastive Audio-Visual Masked AutoencoderCode2
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input0
Show:102550

No leaderboard results yet.