| Audio-visual Event Localization on Portrait Mode Short Videos | Apr 9, 2025 | audio-visual event localizationScene Understanding | —Unverified | 0 |
| Audio-Visual Semantic Graph Network for Audio-Visual Event Localization | Jan 1, 2025 | audio-visual event localizationcross-modal alignment | —Unverified | 0 |
| Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration | Dec 17, 2024 | audio-visual event localizationaudio-visual learning | CodeCode Available | 1 |
| Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization | Dec 9, 2024 | audio-visual event localizationAutonomous Driving | —Unverified | 0 |
| Towards Open-Vocabulary Audio-Visual Event Localization | Nov 18, 2024 | audio-visual event localization | CodeCode Available | 1 |
| Multimodal Trustworthy Semantic Communication for Audio-Visual Event Localization | Nov 4, 2024 | audio-visual event localizationSemantic Communication | —Unverified | 0 |
| CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event Localization | Aug 4, 2024 | audio-visual event localization | CodeCode Available | 0 |
| Label-anticipated Event Disentanglement for Audio-Visual Video Parsing | Jul 11, 2024 | audio-visual event localizationDisentanglement | —Unverified | 0 |
| Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling | Jun 3, 2024 | audio-visual event localizationDenoising | —Unverified | 0 |
| UniAV: Unified Audio-Visual Perception for Multi-Task Video Event Localization | Apr 4, 2024 | Action Localizationaudio-visual event localization | CodeCode Available | 1 |