SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 201250 of 380 papers

TitleStatusHype
Random Access with Massive MIMO-OTFS in LEO Satellite Communications0
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition0
Active Privacy-Utility Trade-off Against Inference in Time-Series Data Sharing0
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge0
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge0
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge0
Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals0
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization0
Merry Go Round: Rotate a Frame and Fool a DNN0
Binary Image Skeletonization Using 2-Stage U-Net0
Low Resource Species Agnostic Bird Activity Detection0
User Activity Detection and Channel Estimation of Spatially Correlated Channels via AMP in Massive MTC0
Learning Proximal Operator Methods for Massive Connectivity in IoT Networks0
Reformulating Zero-shot Action Recognition for Multi-label Actions0
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual InformationCode0
Weakly-guided Self-supervised Pretraining for Temporal Activity DetectionCode0
User Activity Detection for Irregular Repetition Slotted Aloha based MMTC0
Access Delay Constrained Activity Detection in Massive Random Access0
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition0
Deep Learning-based Action Detection in Untrimmed Videos: A Survey0
The VVAD-LRS3 Dataset for Visual Voice Activity Detection0
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge0
Sparse Signal Processing for Massive Connectivity via Mixed-Integer Programming0
Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker0
Fine-Grained Classroom Activity Detection from Audio with Neural NetworksCode0
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording0
Joint Activity Detection, Channel Estimation, and Data Decoding for Grant-free Massive Random Access0
Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets0
Dealing with training and test segmentation mismatch: FBK@IWSLT20210
EML Online Speech Activity Detection for the Fearless Steps Challenge Phase-III0
Algorithm Unrolling for Massive Access via Deep Neural Network with Theoretical Guarantee0
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection0
Joint Channel Estimation and Device Activity Detection in Heterogeneous Networks0
Accelerating Coordinate Descent via Active Set Selection for Device Activity Detection for Multi-Cell Massive Random Access0
Joint Activity Detection and Data Decoding in Massive Random Access via a Turbo Receiver0
Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct Speech Translation0
Spatial Correlation Aware Compressed Sensing for User Activity Detection and Channel Estimation in Massive MTC0
Spatiotemporal Deformable Scene Graphs for Complex Activity Detection0
Improvement of Noise-Robust Single-Channel Voice Activity Detection with Spatial Pre-processing0
Early Detection of In-Memory Malicious Activity based on Run-time Environmental Features0
Sparse Activity Discovery in Energy Constrained Multi-Cluster IoT Networks Using Group Testing0
USTC-NELSLIP System Description for DIHARD-III Challenge0
Iterative Reweighted Algorithms for Joint User Identification and Channel Estimation in Spatially Correlated Massive MTC0
An Ultra-low Power RNN Classifier for Always-On Voice Wake-Up Detection Robust to Real-World Scenarios0
Incorporating VAD into ASR System by Multi-task Learning0
Coarse-Fine Networks for Temporal Activity Detection in VideosCode0
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend0
Supporting More Active Users for Massive Access via Data-assisted Activity Detection0
On training targets for noise-robust voice activity detection0
Show:102550
← PrevPage 5 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified