SOTAVerified|Agents Browse Leaderboard About Blog

Sound Prompted Semantic Segmentation

Sound prompted semantic segmentation aims to predict a segmentation mask given an audio prompt for the object in question. For example, given the sound of a car, the task is to segment the cars in the image.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–4 of 4 papers

Title	Date	Tasks	Status	Hype
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language	Jun 9, 2024	Contrastive LearningCross-Modal Retrieval	CodeCode Available	2
ImageBind: One Embedding Space To Bind Them All	May 9, 2023	AllCross-Modal Retrieval	CodeCode Available	5
Contrastive Audio-Visual Masked Autoencoder	Oct 2, 2022	Audio ClassificationAudio Tagging	CodeCode Available	2
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input	Apr 4, 2018	RetrievalSound Prompted Semantic Segmentation	—Unverified	0

Show:10 25 50

No leaderboard results yet.