SOTAVerified

audio-visual event localization

Papers

Showing 1120 of 26 papers

TitleStatusHype
Dual-modality seq2seq network for audio-visual event localizationCode1
Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization0
Temporal Label-Refinement for Weakly-Supervised Audio-Visual Event Localization0
Label-anticipated Event Disentanglement for Audio-Visual Video Parsing0
Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling0
Audio-visual Event Localization on Portrait Mode Short Videos0
Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention0
Audio-Visual Semantic Graph Network for Audio-Visual Event Localization0
AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization0
Dual Attention Matching for Audio-Visual Event Localization0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1UnAV mAP47.8Unverified
2ActionFormer mAP42.2Unverified