| Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection | Dec 1, 2022 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| Unsupervised active speaker detection in media content using cross-modal information | Sep 24, 2022 | Active Speaker Detection | CodeCode Available | 1 |
| Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection | Jul 15, 2022 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement | Mar 4, 2022 | Active Speaker DetectionMulti-Task Learning | CodeCode Available | 1 |
| Look Who's Talking: Active Speaker Detection in the Wild | Aug 17, 2021 | Active Speaker Detection | CodeCode Available | 1 |
| Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection | Jul 14, 2021 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild | Jun 7, 2021 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| NUS-HLT Report for ActivityNet Challenge 2021 AVA (Speaker) | Jun 1, 2021 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| Self-Supervised Learning of Audio-Visual Objects from Video | Aug 10, 2020 | Active Speaker DetectionFace Detection | CodeCode Available | 1 |
| Active Speakers in Context | May 20, 2020 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |