| Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos | Jul 10, 2023 | Active Speaker DetectionAudio Denoising | —Unverified | 0 |
| Target Active Speaker Detection with Audio-visual Cues | May 22, 2023 | Active Speaker DetectionAudio-Visual Synchronization | CodeCode Available | 1 |
| WASD: A Wilder Active Speaker Detection Dataset | Mar 9, 2023 | Active Speaker Detection | CodeCode Available | 1 |
| A Light Weight Model for Active Speaker Detection | Mar 8, 2023 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| LoCoNet: Long-Short Context Network for Active Speaker Detection | Jan 19, 2023 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection | Dec 1, 2022 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| Whose Emotion Matters? Speaking Activity Localisation without Prior Knowledge | Nov 23, 2022 | Active Speaker DetectionAutomatic Speech Recognition | CodeCode Available | 0 |
| Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function | Oct 26, 2022 | Active Speaker DetectionSound Source Localization | —Unverified | 0 |
| Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization | Oct 14, 2022 | Action DetectionActive Speaker Detection | —Unverified | 0 |
| Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection | Oct 3, 2022 | Active Speaker DetectionAdversarial Robustness | —Unverified | 0 |