| Dual-modality seq2seq network for audio-visual event localization | Feb 20, 2019 | audio-visual event localization | CodeCode Available | 1 |
| Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization | Dec 9, 2024 | audio-visual event localizationAutonomous Driving | —Unverified | 0 |
| Temporal Label-Refinement for Weakly-Supervised Audio-Visual Event Localization | Jul 12, 2023 | audio-visual event localization | —Unverified | 0 |
| Label-anticipated Event Disentanglement for Audio-Visual Video Parsing | Jul 11, 2024 | audio-visual event localizationDisentanglement | —Unverified | 0 |
| Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling | Jun 3, 2024 | audio-visual event localizationDenoising | —Unverified | 0 |
| Audio-visual Event Localization on Portrait Mode Short Videos | Apr 9, 2025 | audio-visual event localizationScene Understanding | —Unverified | 0 |
| Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention | Aug 14, 2020 | audio-visual event localizationvalid | —Unverified | 0 |
| Audio-Visual Semantic Graph Network for Audio-Visual Event Localization | Jan 1, 2025 | audio-visual event localizationcross-modal alignment | —Unverified | 0 |
| AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization | Oct 11, 2022 | audio-visual event localization | —Unverified | 0 |
| Dual Attention Matching for Audio-Visual Event Localization | Oct 1, 2019 | audio-visual event localization | —Unverified | 0 |