SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 251275 of 380 papers

TitleStatusHype
Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection0
Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization0
Multi-microphone Automatic Speech Segmentation in Meetings Based on Circular Harmonics Features0
Multimodal Learning For Classroom Activity Detection0
Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention0
Multi-task Self-Supervised Learning for Human Activity Detection0
Multi-Task Sub-Band Network For Deep Residual Echo Suppression0
Multi-timescale Event Detection in Nonintrusive Load Monitoring based on MDL Principle0
Multi-timescale Trajectory Prediction for Abnormal Human Activity Detection0
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining0
NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge0
Nudge: Accelerating Overdue Pull Requests Towards Completion0
OFDM-Based Massive Connectivity for LEO Satellite Internet of Things0
Online Anomaly Detection via Class-Imbalance Learning0
Online Target Speaker Voice Activity Detection for Speaker Diarization0
On Multitask Loss Function for Audio Event Detection and Localization0
On the Detection of Non-Cooperative RISs: Scan B-Testing via Deep Support Vector Data Description0
On training targets for noise-robust voice activity detection0
On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches0
Overcomplete Frame Thresholding for Acoustic Scene Analysis0
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion Information in Surveillance Videos0
Parallel Neurosymbolic Integration with Concordia0
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition0
PM-GANs: Discriminative Representation Learning for Action Recognition Using Partial-modalities0
Polish Read Speech Corpus for Speech Tools and Services0
Show:102550
← PrevPage 11 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified