| Audio-Visual Talker Localization in Video for Spatial Sound Reproduction | Jun 1, 2024 | Active Speaker Detection | —Unverified | 0 |
| Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training | Apr 1, 2024 | Active Speaker DetectionAudio-Visual Active Speaker Detection | —Unverified | 0 |
| Robust Active Speaker Detection in Noisy Environments | Mar 27, 2024 | Active Speaker DetectionSpeech Separation | —Unverified | 0 |
| AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech Technologies | Feb 20, 2024 | Active Speaker Detection | CodeCode Available | 1 |
| Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization | Dec 21, 2023 | Active Speaker DetectionSelf-Supervised Learning | CodeCode Available | 0 |
| GestSync: Determining who is speaking without a talking head | Oct 8, 2023 | Active Speaker DetectionGesture Synchronization | CodeCode Available | 1 |
| Audio-visual child-adult speaker classification in dyadic interactions | Oct 3, 2023 | Active Speaker DetectionClassification | —Unverified | 0 |
| TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive Learning | Sep 21, 2023 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| A Real-Time Active Speaker Detection System Integrating an Audio-Visual Signal with a Spatial Querying Mechanism | Sep 15, 2023 | Active Speaker DetectionEdge-computing | —Unverified | 0 |
| Audio Inputs for Active Speaker Detection and Localization via Microphone Array | Jul 27, 2023 | Active Speaker Detection | —Unverified | 0 |