SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 251275 of 380 papers

TitleStatusHype
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks0
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription0
Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario0
Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction0
Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker0
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization0
TCG CREST System Description for the Second DISPLACE Challenge0
Spatio-Temporal Event Segmentation and Localization for Wildlife Extended Videos0
Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection0
Tensor vs Matrix Methods: Robust Tensor Decomposition under Block Sparse Perturbations0
The AFRL IWSLT 2020 Systems: Work-From-Home Edition0
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge0
The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 20220
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge0
The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 20230
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge0
The Impact of Silence on Speech Anti-Spoofing0
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge0
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 20220
The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description0
The RATS Collection: Supporting HLT Research with Degraded Audio Data0
The SAFE-T Corpus: A New Resource for Simulated Public Safety Communications0
The "Sound of Silence" in EEG -- Cognitive voice activity detection0
The Speed Submission to DIHARD II: Contributions & Lessons Learned0
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge0
Show:102550
← PrevPage 11 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified