SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 251300 of 380 papers

TitleStatusHype
Signed Latent Factors for Spamming Activity Detection0
Similarity R-C3D for Few-shot Temporal Activity Detection0
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios0
Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments0
Sparse Activity Discovery in Energy Constrained Multi-Cluster IoT Networks Using Group Testing0
Sparse Signal Processing for Massive Connectivity via Mixed-Integer Programming0
Spatial Correlation Aware Compressed Sensing for User Activity Detection and Channel Estimation in Massive MTC0
Spatial Morphing Kernel Regression For Feature Interpolation0
Spatiotemporal Deformable Scene Graphs for Complex Activity Detection0
Spatiotemporal Event Graphs for Dynamic Scene Understanding0
Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features0
An Acoustic Emission Activity Detection Method based on Short-Term Waveform Features: Application to Metallic Components under Uniaxial Tensile Test0
An AIoT-enabled Autonomous Dementia Monitoring System0
Speaker Embeddings With Weakly Supervised Voice Activity Detection For Efficient Speaker Diarization0
Speaker Independent Continuous Speech to Text Converter for Mobile Application0
Speech enhancement aided end-to-end multi-task learning for voice activity detection0
Spiking neural networks trained with backpropagation for low power neuromorphic implementation of voice activity detection0
SPIRE-SIES: A Spontaneous Indian English Speech Corpus0
Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments0
An Efficient Active Set Algorithm for Covariance Based Joint Data and Activity Detection for Massive Random Access with Massive MIMO0
Supporting More Active Users for Massive Access via Data-assisted Activity Detection0
sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks0
SVVAD: Personal Voice Activity Detection for Speaker Verification0
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks0
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription0
Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario0
Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction0
Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker0
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization0
TCG CREST System Description for the Second DISPLACE Challenge0
An Efficient Algorithm for Device Detection and Channel Estimation in Asynchronous IoT Systems0
Spatio-Temporal Event Segmentation and Localization for Wildlife Extended Videos0
Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection0
Tensor vs Matrix Methods: Robust Tensor Decomposition under Block Sparse Perturbations0
The AFRL IWSLT 2020 Systems: Work-From-Home Edition0
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge0
The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 20220
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge0
The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 20230
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge0
The Impact of Silence on Speech Anti-Spoofing0
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge0
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 20220
The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description0
The RATS Collection: Supporting HLT Research with Degraded Audio Data0
The SAFE-T Corpus: A New Resource for Simulated Public Safety Communications0
An enhanced system for the detection and active cancellation of snoring signals0
The "Sound of Silence" in EEG -- Cognitive voice activity detection0
The Speed Submission to DIHARD II: Contributions & Lessons Learned0
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge0
Show:102550
← PrevPage 6 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified