Speech Prompted Semantic Segmentation

Speech prompted semantic segmentation aims to predict semantic segments in an image from audio of a speaker saying the class or segment name

Papers

Showing 1–4 of 4 papers

Title	Date	Tasks	Status	Hype
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language	Jun 9, 2024	Contrastive LearningCross-Modal Retrieval	CodeCode Available	2
ImageBind: One Embedding Space To Bind Them All	May 9, 2023	AllCross-Modal Retrieval	CodeCode Available	5
Contrastive Audio-Visual Masked Autoencoder	Oct 2, 2022	Audio ClassificationAudio Tagging	CodeCode Available	2
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input	Apr 4, 2018	RetrievalSound Prompted Semantic Segmentation	—Unverified	0

Show:10 25 50

No leaderboard results yet.