| Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline | Mar 22, 2023 | audio-visual event localization | CodeCode Available | 1 | 5 |
| CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event Localization | Aug 4, 2024 | audio-visual event localization | CodeCode Available | 0 | 5 |
| Leveraging the Video-level Semantic Consistency of Event for Audio-visual Event Localization | Oct 11, 2022 | audio-visual event localization | CodeCode Available | 0 | 5 |
| Label-anticipated Event Disentanglement for Audio-Visual Video Parsing | Jul 11, 2024 | audio-visual event localizationDisentanglement | —Unverified | 0 | 0 |
| Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling | Jun 3, 2024 | audio-visual event localizationDenoising | —Unverified | 0 | 0 |
| AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization | Oct 11, 2022 | audio-visual event localization | —Unverified | 0 | 0 |
| Audio-Visual Semantic Graph Network for Audio-Visual Event Localization | Jan 1, 2025 | audio-visual event localizationcross-modal alignment | —Unverified | 0 | 0 |
| MPN: Multimodal Parallel Network for Audio-Visual Event Localization | Apr 7, 2021 | audio-visual event localizationGeneral Classification | —Unverified | 0 | 0 |
| Multimodal Trustworthy Semantic Communication for Audio-Visual Event Localization | Nov 4, 2024 | audio-visual event localizationSemantic Communication | —Unverified | 0 | 0 |
| Multi-Modulation Network for Audio-Visual Event Localization | Aug 26, 2021 | audio-visual event localization | —Unverified | 0 | 0 |