| UniTalk: Towards Universal Active Speaker Detection in Real World Scenarios | May 28, 2025 | Active Speaker Detection | CodeCode Available | 1 |
| CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization | May 6, 2025 | Active Speaker DetectionAudio-Visual Speech Recognition | CodeCode Available | 2 |
| Understanding Co-speech Gestures in-the-wild | Mar 28, 2025 | Active Speaker Detection | —Unverified | 0 |
| LASER: Lip Landmark Assisted Speaker Detection for Robustness | Jan 21, 2025 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| ASDnB: Merging Face with Body Cues For Robust Active Speaker Detection | Dec 11, 2024 | Active Speaker DetectionFeature Importance | CodeCode Available | 0 |
| How to Squeeze An Explanation Out of Your Model | Dec 6, 2024 | Active Speaker Detection | —Unverified | 0 |
| BIAS: A Body-based Interpretable Active Speaker Approach | Dec 6, 2024 | Active Speaker DetectionFeature Importance | CodeCode Available | 0 |
| FabuLight-ASD: Unveiling Speech Activity via Body Language | Nov 20, 2024 | Active Speaker Detection | CodeCode Available | 0 |
| An Efficient and Streaming Audio Visual Active Speaker Detection System | Sep 13, 2024 | Active Speaker DetectionAudio-Visual Active Speaker Detection | —Unverified | 0 |
| Imitation of human motion achieves natural head movements for humanoid robots in an active-speaker detection task | Jul 16, 2024 | Active Speaker Detection | CodeCode Available | 0 |