| Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function | Oct 26, 2022 | Active Speaker DetectionSound Source Localization | —Unverified | 0 |
| Detection and Analysis of Content Creator Collaborations in YouTube Videos using Face- and Speaker-Recognition | Jul 5, 2018 | Active Speaker DetectionFace Recognition | —Unverified | 0 |
| Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization | Jan 6, 2022 | Action DetectionActive Speaker Detection | —Unverified | 0 |
| End-To-End Audiovisual Feature Fusion for Active Speaker Detection | Jul 27, 2022 | Active Speaker Detection | —Unverified | 0 |
| Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training | Apr 1, 2024 | Active Speaker DetectionAudio-Visual Active Speaker Detection | —Unverified | 0 |
| FaVoA: Face-Voice Association Favours Ambiguous Speaker Detection | Sep 1, 2021 | Active Speaker Detection | —Unverified | 0 |
| How to Squeeze An Explanation Out of Your Model | Dec 6, 2024 | Active Speaker Detection | —Unverified | 0 |
| ICTCAS-UCAS-TAL Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2021 | Jun 1, 2021 | Active Speaker DetectionAudio-Visual Active Speaker Detection | —Unverified | 0 |
| Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization | Oct 14, 2022 | Action DetectionActive Speaker Detection | —Unverified | 0 |
| Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos | Jul 10, 2023 | Active Speaker DetectionAudio Denoising | —Unverified | 0 |