SOTAVerified

audio-visual event localization

Papers

Showing 110 of 26 papers

TitleStatusHype
Audio-visual Event Localization on Portrait Mode Short Videos0
Audio-Visual Semantic Graph Network for Audio-Visual Event Localization0
Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity CollaborationCode1
Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization0
Towards Open-Vocabulary Audio-Visual Event LocalizationCode1
Multimodal Trustworthy Semantic Communication for Audio-Visual Event Localization0
CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event LocalizationCode0
Label-anticipated Event Disentanglement for Audio-Visual Video Parsing0
Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling0
UniAV: Unified Audio-Visual Perception for Multi-Task Video Event LocalizationCode1
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1UnAV mAP47.8Unverified
2ActionFormer mAP42.2Unverified