SOTAVerified

Active Speaker Detection

Papers

Showing 150 of 63 papers

TitleStatusHype
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative SynchronizationCode2
Look Who's Talking: Active Speaker Detection in the WildCode1
AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech TechnologiesCode1
UniTalk: Towards Universal Active Speaker Detection in Real World ScenariosCode1
Unsupervised active speaker detection in media content using cross-modal informationCode1
Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker DetectionCode1
WASD: A Wilder Active Speaker Detection DatasetCode1
LASER: Lip Landmark Assisted Speaker Detection for RobustnessCode1
A Light Weight Model for Active Speaker DetectionCode1
Learning Long-Term Spatial-Temporal Graphs for Active Speaker DetectionCode1
AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker DetectionCode1
Active Speakers in ContextCode1
NUS-HLT Report for ActivityNet Challenge 2021 AVA (Speaker)Code1
Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker DetectionCode1
LoCoNet: Long-Short Context Network for Active Speaker DetectionCode1
GestSync: Determining who is speaking without a talking headCode1
Self-Supervised Learning of Audio-Visual Objects from VideoCode1
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the WildCode1
Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech EnhancementCode1
Target Active Speaker Detection with Audio-visual CuesCode1
TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive LearningCode1
Whose Emotion Matters? Speaking Activity Localisation without Prior KnowledgeCode0
Leveraging Visual Supervision for Array-based Active Speaker Detection and LocalizationCode0
ASDnB: Merging Face with Body Cues For Robust Active Speaker DetectionCode0
MAAS: Multi-modal Assignation for Active Speaker DetectionCode0
End-to-End Active Speaker DetectionCode0
FabuLight-ASD: Unveiling Speech Activity via Body LanguageCode0
BIAS: A Body-based Interpretable Active Speaker ApproachCode0
Bio-Inspired Modality Fusion for Active Speaker DetectionCode0
Imitation of human motion achieves natural head movements for humanoid robots in an active-speaker detection taskCode0
UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 20220
UniCon: Unified Context Network for Robust Active Speaker Detection0
Visually Supervised Speaker Detection and Localization via Microphone Array0
Understanding Co-speech Gestures in-the-wild0
An Efficient and Streaming Audio Visual Active Speaker Detection System0
A Real-Time Active Speaker Detection System Integrating an Audio-Visual Signal with a Spatial Querying Mechanism0
Audio Inputs for Active Speaker Detection and Localization via Microphone Array0
Audio-video fusion strategies for active speaker detection in meetings0
Audio-visual child-adult speaker classification in dyadic interactions0
Audio-Visual Talker Localization in Video for Spatial Sound Reproduction0
Best of Both Worlds: Multi-task Audio-Visual Automatic Speech Recognition and Active Speaker Detection0
Cross-modal Supervision for Learning Active Speaker Detection in Video0
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function0
Detection and Analysis of Content Creator Collaborations in YouTube Videos using Face- and Speaker-Recognition0
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization0
End-To-End Audiovisual Feature Fusion for Active Speaker Detection0
Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training0
FaVoA: Face-Voice Association Favours Ambiguous Speaker Detection0
How to Squeeze An Explanation Out of Your Model0
ICTCAS-UCAS-TAL Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 20210
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GestSyncAccuracy87Unverified