| Robust Active Speaker Detection in Noisy Environments | Mar 27, 2024 | Active Speaker DetectionSpeech Separation | —Unverified | 0 |
| Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization | Dec 21, 2023 | Active Speaker DetectionSelf-Supervised Learning | CodeCode Available | 0 |
| Audio-visual child-adult speaker classification in dyadic interactions | Oct 3, 2023 | Active Speaker DetectionClassification | —Unverified | 0 |
| A Real-Time Active Speaker Detection System Integrating an Audio-Visual Signal with a Spatial Querying Mechanism | Sep 15, 2023 | Active Speaker DetectionEdge-computing | —Unverified | 0 |
| Audio Inputs for Active Speaker Detection and Localization via Microphone Array | Jul 27, 2023 | Active Speaker Detection | —Unverified | 0 |
| Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos | Jul 10, 2023 | Active Speaker DetectionAudio Denoising | —Unverified | 0 |
| Whose Emotion Matters? Speaking Activity Localisation without Prior Knowledge | Nov 23, 2022 | Active Speaker DetectionAutomatic Speech Recognition | CodeCode Available | 0 |
| Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function | Oct 26, 2022 | Active Speaker DetectionSound Source Localization | —Unverified | 0 |
| Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization | Oct 14, 2022 | Action DetectionActive Speaker Detection | —Unverified | 0 |
| Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection | Oct 3, 2022 | Active Speaker DetectionAdversarial Robustness | —Unverified | 0 |