| Temporal Label-Refinement for Weakly-Supervised Audio-Visual Event Localization | Jul 12, 2023 | audio-visual event localization | —Unverified | 0 |
| Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser | May 27, 2023 | audio-visual event localizationaudio-visual learning | CodeCode Available | 1 |
| Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline | Mar 22, 2023 | audio-visual event localization | CodeCode Available | 1 |
| AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization | Oct 11, 2022 | audio-visual event localization | —Unverified | 0 |
| Leveraging the Video-level Semantic Consistency of Event for Audio-visual Event Localization | Oct 11, 2022 | audio-visual event localization | CodeCode Available | 0 |
| Past and Future Motion Guided Network for Audio Visual Event Localization | May 8, 2022 | audio-visual event localization | —Unverified | 0 |
| ActionFormer: Localizing Moments of Actions with Transformers | Feb 16, 2022 | Action LocalizationAction Recognition | CodeCode Available | 2 |
| Cross-Modal Background Suppression for Audio-Visual Event Localization | Jan 1, 2022 | audio-visual event localization | CodeCode Available | 1 |
| MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing | Nov 24, 2021 | audio-visual event localizationVideo Understanding | CodeCode Available | 1 |
| Multi-Modulation Network for Audio-Visual Event Localization | Aug 26, 2021 | audio-visual event localization | —Unverified | 0 |