SOTAVerified

audio-visual event localization

Papers

Showing 125 of 26 papers

TitleStatusHype
ActionFormer: Localizing Moments of Actions with TransformersCode2
UniAV: Unified Audio-Visual Perception for Multi-Task Video Event LocalizationCode1
Audio-Visual Event Localization in Unconstrained VideosCode1
Towards Open-Vocabulary Audio-Visual Event LocalizationCode1
Positive Sample Propagation along the Audio-Visual Event LineCode1
Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event ParserCode1
MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video ParsingCode1
Cross-Modal Background Suppression for Audio-Visual Event LocalizationCode1
Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity CollaborationCode1
Dual-modality seq2seq network for audio-visual event localizationCode1
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and BaselineCode1
CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event LocalizationCode0
Leveraging the Video-level Semantic Consistency of Event for Audio-visual Event LocalizationCode0
Label-anticipated Event Disentanglement for Audio-Visual Video Parsing0
Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling0
AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization0
Audio-Visual Semantic Graph Network for Audio-Visual Event Localization0
MPN: Multimodal Parallel Network for Audio-Visual Event Localization0
Multimodal Trustworthy Semantic Communication for Audio-Visual Event Localization0
Multi-Modulation Network for Audio-Visual Event Localization0
Past and Future Motion Guided Network for Audio Visual Event Localization0
Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization0
Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention0
Temporal Label-Refinement for Weakly-Supervised Audio-Visual Event Localization0
Audio-visual Event Localization on Portrait Mode Short Videos0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1UnAV mAP47.8Unverified
2ActionFormer mAP42.2Unverified