| Audio-visual Event Localization on Portrait Mode Short Videos | Apr 9, 2025 | audio-visual event localizationScene Understanding | —Unverified | 0 |
| Audio-Visual Semantic Graph Network for Audio-Visual Event Localization | Jan 1, 2025 | audio-visual event localizationcross-modal alignment | —Unverified | 0 |
| Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration | Dec 17, 2024 | audio-visual event localizationaudio-visual learning | CodeCode Available | 1 |
| Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization | Dec 9, 2024 | audio-visual event localizationAutonomous Driving | —Unverified | 0 |
| Towards Open-Vocabulary Audio-Visual Event Localization | Nov 18, 2024 | audio-visual event localization | CodeCode Available | 1 |
| Multimodal Trustworthy Semantic Communication for Audio-Visual Event Localization | Nov 4, 2024 | audio-visual event localizationSemantic Communication | —Unverified | 0 |
| CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event Localization | Aug 4, 2024 | audio-visual event localization | CodeCode Available | 0 |
| Label-anticipated Event Disentanglement for Audio-Visual Video Parsing | Jul 11, 2024 | audio-visual event localizationDisentanglement | —Unverified | 0 |
| Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling | Jun 3, 2024 | audio-visual event localizationDenoising | —Unverified | 0 |
| UniAV: Unified Audio-Visual Perception for Multi-Task Video Event Localization | Apr 4, 2024 | Action Localizationaudio-visual event localization | CodeCode Available | 1 |
| Temporal Label-Refinement for Weakly-Supervised Audio-Visual Event Localization | Jul 12, 2023 | audio-visual event localization | —Unverified | 0 |
| Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser | May 27, 2023 | audio-visual event localizationaudio-visual learning | CodeCode Available | 1 |
| Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline | Mar 22, 2023 | audio-visual event localization | CodeCode Available | 1 |
| AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization | Oct 11, 2022 | audio-visual event localization | —Unverified | 0 |
| Leveraging the Video-level Semantic Consistency of Event for Audio-visual Event Localization | Oct 11, 2022 | audio-visual event localization | CodeCode Available | 0 |
| Past and Future Motion Guided Network for Audio Visual Event Localization | May 8, 2022 | audio-visual event localization | —Unverified | 0 |
| ActionFormer: Localizing Moments of Actions with Transformers | Feb 16, 2022 | Action LocalizationAction Recognition | CodeCode Available | 2 |
| Cross-Modal Background Suppression for Audio-Visual Event Localization | Jan 1, 2022 | audio-visual event localization | CodeCode Available | 1 |
| MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing | Nov 24, 2021 | audio-visual event localizationVideo Understanding | CodeCode Available | 1 |
| Multi-Modulation Network for Audio-Visual Event Localization | Aug 26, 2021 | audio-visual event localization | —Unverified | 0 |
| MPN: Multimodal Parallel Network for Audio-Visual Event Localization | Apr 7, 2021 | audio-visual event localizationGeneral Classification | —Unverified | 0 |
| Positive Sample Propagation along the Audio-Visual Event Line | Apr 1, 2021 | audio-visual event localization | CodeCode Available | 1 |
| Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention | Aug 14, 2020 | audio-visual event localizationvalid | —Unverified | 0 |
| Dual Attention Matching for Audio-Visual Event Localization | Oct 1, 2019 | audio-visual event localization | —Unverified | 0 |
| Dual-modality seq2seq network for audio-visual event localization | Feb 20, 2019 | audio-visual event localization | CodeCode Available | 1 |