SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 326350 of 380 papers

TitleStatusHype
Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker0
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization0
TCG CREST System Description for the Second DISPLACE Challenge0
Spatio-Temporal Event Segmentation and Localization for Wildlife Extended Videos0
Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection0
Tensor vs Matrix Methods: Robust Tensor Decomposition under Block Sparse Perturbations0
The AFRL IWSLT 2020 Systems: Work-From-Home Edition0
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge0
The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 20220
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge0
The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 20230
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge0
The Impact of Silence on Speech Anti-Spoofing0
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge0
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 20220
The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description0
The RATS Collection: Supporting HLT Research with Degraded Audio Data0
Long-term Conversation Analysis: Exploring Utility and PrivacyCode0
rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection MethodCode0
S3D: Single Shot multi-Span Detector via Fully 3D Convolutional NetworksCode0
TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASRCode0
Personalized Activity Recognition with Deep Triplet EmbeddingsCode0
A Convolutional Neural Network Smartphone App for Real-Time Voice Activity DetectionCode0
Personal VAD: Speaker-Conditioned Voice Activity DetectionCode0
Automating Feedback Analysis in Surgical Training: Detection, Categorization, and AssessmentCode0
Show:102550
← PrevPage 14 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified