SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 201250 of 380 papers

TitleStatusHype
Incorporating VAD into ASR System by Multi-task Learning0
Unveiling ECC Vulnerabilities: LSTM Networks for Operation Recognition in Side-Channel Attacks0
Long-Term Conversation Analysis: Privacy-Utility Trade-off under Noise and Reverberation0
Low Resource Species Agnostic Bird Activity Detection0
MarbleNet: Deep 1D Time-Channel Separable Convolutional Neural Network for Voice Activity Detection0
Massive Machine Type Communication Pilot-Hopping Sequence Detection Architectures Based on Non-Negative Least Squares for Grant-Free Random Access0
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses0
Merry Go Round: Rotate a Frame and Fool a DNN0
MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection0
MicroNAS: An Automated Framework for Developing a Fall Detection System0
Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge0
Mixture of Experts-augmented Deep Unfolding for Activity Detection in IRS-aided Systems0
MLNET: An Adaptive Multiple Receptive-field Attention Neural Network for Voice Activity Detection0
A Customer Level Fraudulent Activity Detection Benchmark for Enhancing Machine Learning Model Research and Evaluation0
Activity Recognition with Moving Cameras and Few Training Examples: Applications for Detection of Autism-Related Headbanging0
Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection0
Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization0
Multi-microphone Automatic Speech Segmentation in Meetings Based on Circular Harmonics Features0
Multimodal Learning For Classroom Activity Detection0
An Empirical Study on Activity Recognition in Long Surgical Videos0
Activity Detection from Wearable Electromyogram Sensors using Hidden Markov Model0
Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention0
Multi-task Self-Supervised Learning for Human Activity Detection0
Multi-Task Sub-Band Network For Deep Residual Echo Suppression0
Multi-timescale Event Detection in Nonintrusive Load Monitoring based on MDL Principle0
Multi-timescale Trajectory Prediction for Abnormal Human Activity Detection0
Activity Detection for Massive Random Access using Covariance-based Matching Pursuit0
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining0
NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge0
Nudge: Accelerating Overdue Pull Requests Towards Completion0
OFDM-Based Massive Connectivity for LEO Satellite Internet of Things0
Online Anomaly Detection via Class-Imbalance Learning0
Activity Detection for Grant-Free NOMA in Massive IoT Networks0
Online Target Speaker Voice Activity Detection for Speaker Diarization0
On Multitask Loss Function for Audio Event Detection and Localization0
On the Detection of Non-Cooperative RISs: Scan B-Testing via Deep Support Vector Data Description0
On training targets for noise-robust voice activity detection0
On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches0
Unveiling the Power of Complex-Valued Transformers in Wireless Communications0
Overcomplete Frame Thresholding for Acoustic Scene Analysis0
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion Information in Surveillance Videos0
Parallel Neurosymbolic Integration with Concordia0
User Activity Detection and Channel Estimation of Spatially Correlated Channels via AMP in Massive MTC0
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition0
User Activity Detection for Irregular Repetition Slotted Aloha based MMTC0
PM-GANs: Discriminative Representation Learning for Action Recognition Using Partial-modalities0
Polish Read Speech Corpus for Speech Tools and Services0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition0
PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System0
Precise Analysis of Covariance Identifiability for Activity Detection in Grant-Free Random Access0
Show:102550
← PrevPage 5 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified