SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 101150 of 380 papers

TitleStatusHype
sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks0
Fast Low-parameter Video Activity Localization in Collaborative Learning Environments0
Joint Activity-Delay Detection and Channel Estimation for Asynchronous Massive Random Access: A Free Probability Theory Approach0
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection0
Device Activity Detection and Channel Estimation for Millimeter-Wave Massive MIMO0
A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model0
Joint User Detection and Localization in Near-Field Using Reconfigurable Intelligent Surfaces0
Activity Detection for Massive Connectivity in Cell-free Networks with Unknown Large-scale Fading, Channel Statistics, Noise Variance, and Activity Probability: A Bayesian ApproachCode0
Self-supervised New Activity Detection in Sensor-based Smart Environments0
Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization0
Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments0
Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions0
Advanced Image Segmentation Techniques for Neural Activity Detection via C-fos Immediate Early Gene Expression0
Spatiotemporal Event Graphs for Dynamic Scene Understanding0
Towards More Practical Group Activity Detection: A New Benchmark and Model0
SPIRE-SIES: A Spontaneous Indian English Speech Corpus0
Combatting Human Trafficking in the Cyberspace: A Natural Language Processing-Based Methodology to Analyze the Language in Online Advertisements0
A Hybrid Graph Network for Complex Activity Detection in Video0
Device Detection and Channel Estimation in MTC with Correlated Activity Pattern0
Prompt-driven Target Speech Diarization0
Enhancing Illicit Activity Detection using XAI: A Multimodal Graph-LLM Framework0
Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation0
Hierarchical MTC User Activity Detection and Channel Estimation with Unknown Spatial Covariance0
End-to-end Online Speaker Diarization with Target Speaker Tracking0
VSANet: Real-time Speech Enhancement Based on Voice Activity Detection and Causal Spatial Attention0
PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System0
The Impact of Silence on Speech Anti-Spoofing0
Effective Abnormal Activity Detection on Multivariate Time Series Healthcare Data0
In-Ear-Voice: Towards Milli-Watt Audio Enhancement With Bone-Conduction Microphones for In-Ear Sensing Platforms0
The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 20230
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for ConversationsCode0
An enhanced system for the detection and active cancellation of snoring signals0
Long-term Conversation Analysis: Exploring Utility and PrivacyCode0
Multi-microphone Automatic Speech Segmentation in Meetings Based on Circular Harmonics Features0
Parallel Neurosymbolic Integration with Concordia0
SVVAD: Personal Voice Activity Detection for Speaker Verification0
Building Accurate Low Latency ASR for Streaming Voice Search0
Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction0
Joint Activity-Delay Detection and Channel Estimation for Asynchronous Massive Random Access0
FunASR: A Fundamental End-to-End Speech Recognition ToolkitCode0
Deep Learning for Asynchronous Massive Access with Data Frame Length Diversity0
Joint Activity Detection and Channel Estimation for Clustered Massive Machine Type Communications0
Cooperative Multi-Cell Massive Access with Temporally Correlated Activity0
Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence0
Grant-free Massive Random Access with Retransmission: Receiver Optimization and Performance Analysis0
Evaluation of Noise Reduction Methods for Sentence Recognition by Sinhala Speaking ListenersCode0
Better Together: Dialogue Separation and Voice Activity Detection for Audio Personalization in TV0
End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations0
A processing framework to access large quantities of whispered speech found in ASMR0
Multi-Task Sub-Band Network For Deep Residual Echo Suppression0
Show:102550
← PrevPage 3 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified