SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 151200 of 380 papers

TitleStatusHype
Iterative Reweighted Algorithms for Joint User Identification and Channel Estimation in Spatially Correlated Massive MTC0
Joint Activity-Delay Detection and Channel Estimation for Asynchronous Massive Random Access0
Joint Activity-Delay Detection and Channel Estimation for Asynchronous Massive Random Access: A Free Probability Theory Approach0
Joint Activity Detection and Channel Estimation for Clustered Massive Machine Type Communications0
Joint Activity Detection and Channel Estimation for Massive Connectivity: Where Message Passing Meets Score-Based Generative Priors0
Joint Activity Detection and Data Decoding in Massive Random Access via a Turbo Receiver0
Joint Activity Detection, Channel Estimation, and Data Decoding for Grant-free Massive Random Access0
Joint Channel Estimation and Device Activity Detection in Heterogeneous Networks0
Joint Estimation of Clustered User Activity and Correlated Channels with Unknown Covariance in mMTC0
Jointly Detecting and Separating Singing Voice: A Multi-Task Approach0
Jointly Sparse Signal Recovery and Support Recovery via Deep Learning with Applications in MIMO-based Grant-Free Random Access0
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture0
Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization0
Joint User Activity and Data Detection in Grant-Free NOMA using Generative Neural Networks0
Joint User Detection and Localization in Near-Field Using Reconfigurable Intelligent Surfaces0
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection0
Kernel-based Sensor Fusion with Application to Audio-Visual Voice Activity Detection0
KIDS: kinematics-based (in)activity detection and segmentation in a sleep case study0
Learnable Acoustic Frontends in Bird Activity Detection0
Learnable Frontends that do not Learn: Quantifying Sensitivity to Filterbank Initialisation0
Learning Activity Progression in LSTMs for Activity Detection and Early Detection0
A Bin Encoding Training of a Spiking Neural Network-based Voice Activity Detection0
Learning Proximal Operator Methods for Massive Connectivity in IoT Networks0
Learning Spatio-Temporal Structure from RGB-D Videos for Human Activity Detection and Anticipation0
Learning Visual Voice Activity Detection with an Automatically Annotated Dataset0
Leaving Some Stones Unturned: Dynamic Feature Prioritization for Activity Detection in Streaming Video0
Leveraging 3D LiDAR Sensors to Enable Enhanced Urban Safety and Public Health: Pedestrian Monitoring and Abnormal Activity Detection0
Lightweight Learning for Grant-Free Activity Detection in Cell-Free Massive MIMO Networks0
LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems0
Incorporating VAD into ASR System by Multi-task Learning0
Long-Term Conversation Analysis: Privacy-Utility Trade-off under Noise and Reverberation0
Low Resource Species Agnostic Bird Activity Detection0
MarbleNet: Deep 1D Time-Channel Separable Convolutional Neural Network for Voice Activity Detection0
Massive Machine Type Communication Pilot-Hopping Sequence Detection Architectures Based on Non-Negative Least Squares for Grant-Free Random Access0
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses0
Merry Go Round: Rotate a Frame and Fool a DNN0
MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection0
MicroNAS: An Automated Framework for Developing a Fall Detection System0
Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge0
Mixture of Experts-augmented Deep Unfolding for Activity Detection in IRS-aided Systems0
MLNET: An Adaptive Multiple Receptive-field Attention Neural Network for Voice Activity Detection0
Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection0
Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization0
Multi-microphone Automatic Speech Segmentation in Meetings Based on Circular Harmonics Features0
Multimodal Learning For Classroom Activity Detection0
Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention0
Multi-task Self-Supervised Learning for Human Activity Detection0
Multi-Task Sub-Band Network For Deep Residual Echo Suppression0
Multi-timescale Event Detection in Nonintrusive Load Monitoring based on MDL Principle0
Multi-timescale Trajectory Prediction for Abnormal Human Activity Detection0
Show:102550
← PrevPage 4 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified