SOTAVerified

Audio-Visual Active Speaker Detection

Determine if and when each visible person in the video is speaking.

Papers

Showing 110 of 25 papers

TitleStatusHype
LASER: Lip Landmark Assisted Speaker Detection for RobustnessCode1
An Efficient and Streaming Audio Visual Active Speaker Detection System0
Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training0
TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive LearningCode1
A Light Weight Model for Active Speaker DetectionCode1
LoCoNet: Long-Short Context Network for Active Speaker DetectionCode1
Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker DetectionCode1
Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection0
Learning Long-Term Spatial-Temporal Graphs for Active Speaker DetectionCode1
UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 20220
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.