SOTAVerified

GZSL Video Classification

Audio-visual zero-shot learning aims to recognize unseen categories based on paired audio-visual sequences.

Papers

Showing 17 of 7 papers

TitleStatusHype
Temporal and cross-modal attention for audio-visual zero-shot learningCode1
Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and LanguageCode1
Boosting Audio-visual Zero-shot Learning with Large Language ModelsCode0
Attribute Prototype Network for Any-Shot Learning0
Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zeroshot Classification and Retrieval of Videos0
Hyperbolic Audio-visual Zero-shot Learning0
AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings0
Show:102550

No leaderboard results yet.