| AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech Technologies | Feb 20, 2024 | Active Speaker Detection | CodeCode Available | 1 |
| A Real-Time Active Speaker Detection System Integrating an Audio-Visual Signal with a Spatial Querying Mechanism | Sep 15, 2023 | Active Speaker DetectionEdge-computing | —Unverified | 0 |
| Best of Both Worlds: Multi-task Audio-Visual Automatic Speech Recognition and Active Speaker Detection | May 10, 2022 | Active Speaker DetectionAutomatic Speech Recognition | —Unverified | 0 |
| Active Speaker Detection as a Multi-Objective Optimization with Uncertainty-based Multimodal Fusion | Jun 7, 2021 | Active Speaker DetectionAudio-Visual Active Speaker Detection | —Unverified | 0 |
| FaVoA: Face-Voice Association Favours Ambiguous Speaker Detection | Sep 1, 2021 | Active Speaker Detection | —Unverified | 0 |
| Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training | Apr 1, 2024 | Active Speaker DetectionAudio-Visual Active Speaker Detection | —Unverified | 0 |
| Audio-Visual Talker Localization in Video for Spatial Sound Reproduction | Jun 1, 2024 | Active Speaker Detection | —Unverified | 0 |
| How to Squeeze An Explanation Out of Your Model | Dec 6, 2024 | Active Speaker Detection | —Unverified | 0 |
| ICTCAS-UCAS-TAL Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2021 | Jun 1, 2021 | Active Speaker DetectionAudio-Visual Active Speaker Detection | —Unverified | 0 |
| End-To-End Audiovisual Feature Fusion for Active Speaker Detection | Jul 27, 2022 | Active Speaker Detection | —Unverified | 0 |