SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 5175 of 380 papers

TitleStatusHype
Unveiling ECC Vulnerabilities: LSTM Networks for Operation Recognition in Side-Channel Attacks0
FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems0
LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems0
Unveiling the Power of Complex-Valued Transformers in Wireless Communications0
DT4ECG: A Dual-Task Learning Framework for ECG-Based Human Identity Recognition and Human Activity Detection0
Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge0
When do they StOP?: A First Step Towards Automatically Identifying Team Communication in the Operating RoomCode0
Pre-Equalization Aided Grant-Free Massive Access in Massive MIMO SystemCode0
Automatic detection and prediction of nAMD activity change in retinal OCT using Siamese networks and Wasserstein Distance for ordinalityCode0
Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection0
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining0
Fotheidil: an Automatic Transcription System for the Irish Language0
Comparative Analysis of Deep Learning Approaches for Harmful Brain Activity Detection Using EEG0
Asynchronous Random Access in Massive MIMO Systems Facilitated by the Delay-Angle Domain0
Automating Feedback Analysis in Surgical Training: Detection, Categorization, and AssessmentCode0
Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation0
A Flexible Framework for Grant-Free Random Access in Cell-Free Massive MIMO Systems0
Transferable Adversarial Attacks against ASR0
On the Detection of Non-Cooperative RISs: Scan B-Testing via Deep Support Vector Data Description0
Intelligent Video Recording Optimization using Activity Detection for Surveillance Systems0
User Activity Detection with Delay-Calibration for Asynchronous Massive Random Access0
Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization0
CLIP-VAD: Exploiting Vision-Language Models for Voice Activity Detection0
A Framework for Adapting Human-Robot Interaction to Diverse User GroupsCode0
Investigation of Speaker Representation for Target-Speaker Speech Processing0
Show:102550
← PrevPage 3 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified