| Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events | Jan 1, 2023 | Action LocalizationPathfinder | CodeCode Available | 1 | 5 |
| Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization | Nov 6, 2022 | Optical Flow EstimationSound Source Localization | CodeCode Available | 1 | 5 |
| Audio-Visual Grouping Network for Sound Localization from Mixtures | Mar 29, 2023 | Object LocalizationSound Source Localization | CodeCode Available | 1 | 5 |
| Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment | Jul 18, 2024 | cross-modal alignmentCross-Modal Retrieval | CodeCode Available | 1 | 5 |
| Audio-Visual Instance Segmentation | Oct 28, 2023 | Instance SegmentationSegmentation | CodeCode Available | 1 | 5 |
| Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face Videos | Nov 5, 2021 | PredictionSaliency Prediction | CodeCode Available | 1 | 5 |
| Deep Neural Networks for Multiple Speaker Detection and Localization | Nov 30, 2017 | Sound Source Localization | CodeCode Available | 0 | 5 |
| Audio-Visual Scene Analysis with Self-Supervised Multisensory Features | Apr 10, 2018 | Action RecognitionAudio Source Separation | CodeCode Available | 0 | 5 |
| Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications | Nov 20, 2019 | Sound Source Localization | CodeCode Available | 0 | 5 |
| Iterative Sound Source Localization for Unknown Number of Sources | Jun 24, 2022 | Sound Source Localization | CodeCode Available | 0 | 5 |