SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 51100 of 380 papers

TitleStatusHype
Unveiling ECC Vulnerabilities: LSTM Networks for Operation Recognition in Side-Channel Attacks0
LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems0
FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems0
DT4ECG: A Dual-Task Learning Framework for ECG-Based Human Identity Recognition and Human Activity Detection0
Unveiling the Power of Complex-Valued Transformers in Wireless Communications0
Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge0
When do they StOP?: A First Step Towards Automatically Identifying Team Communication in the Operating RoomCode0
Pre-Equalization Aided Grant-Free Massive Access in Massive MIMO SystemCode0
Automatic detection and prediction of nAMD activity change in retinal OCT using Siamese networks and Wasserstein Distance for ordinalityCode0
Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection0
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining0
Fotheidil: an Automatic Transcription System for the Irish Language0
Comparative Analysis of Deep Learning Approaches for Harmful Brain Activity Detection Using EEG0
Asynchronous Random Access in Massive MIMO Systems Facilitated by the Delay-Angle Domain0
Automating Feedback Analysis in Surgical Training: Detection, Categorization, and AssessmentCode0
Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation0
A Flexible Framework for Grant-Free Random Access in Cell-Free Massive MIMO Systems0
Transferable Adversarial Attacks against ASR0
On the Detection of Non-Cooperative RISs: Scan B-Testing via Deep Support Vector Data Description0
Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization0
User Activity Detection with Delay-Calibration for Asynchronous Massive Random Access0
Intelligent Video Recording Optimization using Activity Detection for Surveillance Systems0
CLIP-VAD: Exploiting Vision-Language Models for Voice Activity Detection0
A Framework for Adapting Human-Robot Interaction to Diverse User GroupsCode0
Investigation of Speaker Representation for Target-Speaker Speech Processing0
Raising the Bar(ometer): Identifying a User's Stair and Lift Usage Through Wearable Sensor Data Analysis0
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses0
TCG CREST System Description for the Second DISPLACE Challenge0
A Comprehensive Methodological Survey of Human Activity Recognition Across Divers Data Modalities0
Evaluation of real-time transcriptions using end-to-end ASR models0
NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge0
Blind User Activity Detection for Grant-Free Random Access in Cell-Free mMIMO Networks0
Long-Term Conversation Analysis: Privacy-Utility Trade-off under Noise and Reverberation0
TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASRCode0
Automatic Speech Recognition for Hindi0
Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 20240
AnimalFormer: Multimodal Vision Framework for Behavior-based Precision Livestock Farming0
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness0
Vessel Re-identification and Activity Detection in Thermal Domain for Maritime Surveillance0
Deep Learning-Based Approach for User Activity Detection with Grant-Free Random Access in Cell-Free Massive MIMO0
Precise Analysis of Covariance Identifiability for Activity Detection in Grant-Free Random Access0
A Real-Time Voice Activity Detection Based On Lightweight Neural0
Speaker Embeddings With Weakly Supervised Voice Activity Detection For Efficient Speaker Diarization0
Whispy: Adapting STT Whisper Models to Real-Time Environments0
Activity Detection for Massive Random Access using Covariance-based Matching Pursuit0
FAD-SAR: A Novel Fishing Activity Detection System via Synthetic Aperture Radar Images Based on Deep Learning Method0
A Customer Level Fraudulent Activity Detection Benchmark for Enhancing Machine Learning Model Research and Evaluation0
Leveraging 3D LiDAR Sensors to Enable Enhanced Urban Safety and Public Health: Pedestrian Monitoring and Abnormal Activity Detection0
Deep Learning-Assisted Parallel Interference Cancellation for Grant-Free NOMA in Machine-Type Communication0
Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications0
Show:102550
← PrevPage 2 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified