SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 201250 of 380 papers

TitleStatusHype
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining0
NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge0
Nudge: Accelerating Overdue Pull Requests Towards Completion0
OFDM-Based Massive Connectivity for LEO Satellite Internet of Things0
Online Anomaly Detection via Class-Imbalance Learning0
Online Target Speaker Voice Activity Detection for Speaker Diarization0
On Multitask Loss Function for Audio Event Detection and Localization0
On the Detection of Non-Cooperative RISs: Scan B-Testing via Deep Support Vector Data Description0
On training targets for noise-robust voice activity detection0
On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches0
Activity Recognition with Moving Cameras and Few Training Examples: Applications for Detection of Autism-Related Headbanging0
Overcomplete Frame Thresholding for Acoustic Scene Analysis0
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion Information in Surveillance Videos0
Parallel Neurosymbolic Integration with Concordia0
A Customer Level Fraudulent Activity Detection Benchmark for Enhancing Machine Learning Model Research and Evaluation0
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition0
ADA-VAD: Unpaired Adversarial Domain Adaptation for Noise-Robust Voice Activity Detection0
PM-GANs: Discriminative Representation Learning for Action Recognition Using Partial-modalities0
Polish Read Speech Corpus for Speech Tools and Services0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition0
PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System0
Precise Analysis of Covariance Identifiability for Activity Detection in Grant-Free Random Access0
Advanced Image Segmentation Techniques for Neural Activity Detection via C-fos Immediate Early Gene Expression0
Prompt-driven Target Speech Diarization0
Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation0
A Boosting Algorithm for Positive-Unlabeled Learning0
Proximal Gradient-Based Unfolding for Massive Random Access in IoT Networks0
Quantum Learning Based Nonrandom Superimposed Coding for Secure Wireless Access in 5G URLLC0
Raising the Bar(ometer): Identifying a User's Stair and Lift Usage Through Wearable Sensor Data Analysis0
Random Access with Massive MIMO-OTFS in LEO Satellite Communications0
A Flexible Framework for Grant-Free Random Access in Cell-Free Massive MIMO Systems0
Real-Time Radar-Based Gesture Detection and Recognition Built in an Edge-Computing Platform0
Recursive Binary Neural Network Learning Model with 2-bit/weight Storage Requirement0
Reformulating Zero-shot Action Recognition for Multi-label Actions0
Accelerating Coordinate Descent via Active Set Selection for Device Activity Detection for Multi-Cell Massive Random Access0
Revisiting Few-shot Activity Detection with Class Similarity Control0
RIS Assisted Device Activity Detection with Statistical Channel State Information0
Robust Activity Detection for Massive Random Access0
Robust Learning-Based Sparse Recovery for Device Activity Detection in Grant-Free Random Access Cell-Free Massive MIMO: Enhancing Resilience to Impairments0
A Hybrid Graph Network for Complex Activity Detection in Video0
SegCodeNet: Color-Coded Segmentation Masks for Activity Detection from Wearable Cameras0
Self-Adaptive Soft Voice Activity Detection using Deep Neural Networks for Robust Speaker Verification0
Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions0
Algorithm Unrolling for Massive Access via Deep Neural Network with Theoretical Guarantee0
Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction0
Semi-supervised acoustic modelling for five-lingual code-switched ASR using automatically-segmented soap opera speech0
Semi-supervised Acoustic Modelling for Five-lingual Code-switched ASR using Automatically-segmented Soap Opera Speech0
Sequence Block based Compressed Sensing Multiuser Detection for 5G0
Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation0
Siamese Neural Networks for Class Activity Detection0
Show:102550
← PrevPage 5 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified