SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 51100 of 380 papers

TitleStatusHype
Evaluation of real-time transcriptions using end-to-end ASR models0
Blind User Activity Detection for Grant-Free Random Access in Cell-Free mMIMO Networks0
Long-Term Conversation Analysis: Privacy-Utility Trade-off under Noise and Reverberation0
TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASRCode0
Automatic Speech Recognition for Hindi0
Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 20240
AnimalFormer: Multimodal Vision Framework for Behavior-based Precision Livestock Farming0
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness0
Vessel Re-identification and Activity Detection in Thermal Domain for Maritime Surveillance0
Deep Learning-Based Approach for User Activity Detection with Grant-Free Random Access in Cell-Free Massive MIMO0
InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender SegmentationCode1
Precise Analysis of Covariance Identifiability for Activity Detection in Grant-Free Random Access0
A Real-Time Voice Activity Detection Based On Lightweight Neural0
Speaker Embeddings With Weakly Supervised Voice Activity Detection For Efficient Speaker Diarization0
Whispy: Adapting STT Whisper Models to Real-Time Environments0
Activity Detection for Massive Random Access using Covariance-based Matching Pursuit0
FAD-SAR: A Novel Fishing Activity Detection System via Synthetic Aperture Radar Images Based on Deep Learning Method0
A Customer Level Fraudulent Activity Detection Benchmark for Enhancing Machine Learning Model Research and Evaluation0
Leveraging 3D LiDAR Sensors to Enable Enhanced Urban Safety and Public Health: Pedestrian Monitoring and Abnormal Activity Detection0
Deep Learning-Assisted Parallel Interference Cancellation for Grant-Free NOMA in Machine-Type Communication0
Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications0
sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks0
Fast Low-parameter Video Activity Localization in Collaborative Learning Environments0
Joint Activity-Delay Detection and Channel Estimation for Asynchronous Massive Random Access: A Free Probability Theory Approach0
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection0
Device Activity Detection and Channel Estimation for Millimeter-Wave Massive MIMO0
A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model0
Joint User Detection and Localization in Near-Field Using Reconfigurable Intelligent Surfaces0
Online speaker diarization of meetings guided by speech separationCode1
Activity Detection for Massive Connectivity in Cell-free Networks with Unknown Large-scale Fading, Channel Statistics, Noise Variance, and Activity Probability: A Bayesian ApproachCode0
Self-supervised New Activity Detection in Sensor-based Smart Environments0
Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization0
Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments0
Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions0
Advanced Image Segmentation Techniques for Neural Activity Detection via C-fos Immediate Early Gene Expression0
Spatiotemporal Event Graphs for Dynamic Scene Understanding0
Towards More Practical Group Activity Detection: A New Benchmark and Model0
SPIRE-SIES: A Spontaneous Indian English Speech Corpus0
Combatting Human Trafficking in the Cyberspace: A Natural Language Processing-Based Methodology to Analyze the Language in Online Advertisements0
A Hybrid Graph Network for Complex Activity Detection in Video0
Prompt-driven Target Speech Diarization0
Device Detection and Channel Estimation in MTC with Correlated Activity Pattern0
Enhancing Illicit Activity Detection using XAI: A Multimodal Graph-LLM Framework0
Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation0
Hierarchical MTC User Activity Detection and Channel Estimation with Unknown Spatial Covariance0
End-to-end Online Speaker Diarization with Target Speaker Tracking0
VSANet: Real-time Speech Enhancement Based on Voice Activity Detection and Causal Spatial Attention0
PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System0
The Impact of Silence on Speech Anti-Spoofing0
Effective Abnormal Activity Detection on Multivariate Time Series Healthcare Data0
Show:102550
← PrevPage 2 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified