SOTAVerified

Active Speaker Detection

Papers

Showing 125 of 63 papers

TitleStatusHype
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative SynchronizationCode2
UniTalk: Towards Universal Active Speaker Detection in Real World ScenariosCode1
LASER: Lip Landmark Assisted Speaker Detection for RobustnessCode1
AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech TechnologiesCode1
GestSync: Determining who is speaking without a talking headCode1
TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive LearningCode1
Target Active Speaker Detection with Audio-visual CuesCode1
WASD: A Wilder Active Speaker Detection DatasetCode1
A Light Weight Model for Active Speaker DetectionCode1
LoCoNet: Long-Short Context Network for Active Speaker DetectionCode1
Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker DetectionCode1
Unsupervised active speaker detection in media content using cross-modal informationCode1
Learning Long-Term Spatial-Temporal Graphs for Active Speaker DetectionCode1
Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech EnhancementCode1
Look Who's Talking: Active Speaker Detection in the WildCode1
Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker DetectionCode1
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the WildCode1
NUS-HLT Report for ActivityNet Challenge 2021 AVA (Speaker)Code1
Self-Supervised Learning of Audio-Visual Objects from VideoCode1
Active Speakers in ContextCode1
AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker DetectionCode1
Understanding Co-speech Gestures in-the-wild0
ASDnB: Merging Face with Body Cues For Robust Active Speaker DetectionCode0
BIAS: A Body-based Interpretable Active Speaker ApproachCode0
How to Squeeze An Explanation Out of Your Model0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GestSyncAccuracy87Unverified