| CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization | May 6, 2025 | Active Speaker DetectionAudio-Visual Speech Recognition | CodeCode Available | 2 |
| UniTalk: Towards Universal Active Speaker Detection in Real World Scenarios | May 28, 2025 | Active Speaker Detection | CodeCode Available | 1 |
| LASER: Lip Landmark Assisted Speaker Detection for Robustness | Jan 21, 2025 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech Technologies | Feb 20, 2024 | Active Speaker Detection | CodeCode Available | 1 |
| GestSync: Determining who is speaking without a talking head | Oct 8, 2023 | Active Speaker DetectionGesture Synchronization | CodeCode Available | 1 |
| TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive Learning | Sep 21, 2023 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| Target Active Speaker Detection with Audio-visual Cues | May 22, 2023 | Active Speaker DetectionAudio-Visual Synchronization | CodeCode Available | 1 |
| WASD: A Wilder Active Speaker Detection Dataset | Mar 9, 2023 | Active Speaker Detection | CodeCode Available | 1 |
| A Light Weight Model for Active Speaker Detection | Mar 8, 2023 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| LoCoNet: Long-Short Context Network for Active Speaker Detection | Jan 19, 2023 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |