SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 201225 of 380 papers

TitleStatusHype
Random Access with Massive MIMO-OTFS in LEO Satellite Communications0
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition0
Active Privacy-Utility Trade-off Against Inference in Time-Series Data Sharing0
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge0
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge0
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge0
Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals0
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization0
Merry Go Round: Rotate a Frame and Fool a DNN0
Binary Image Skeletonization Using 2-Stage U-Net0
Low Resource Species Agnostic Bird Activity Detection0
User Activity Detection and Channel Estimation of Spatially Correlated Channels via AMP in Massive MTC0
Learning Proximal Operator Methods for Massive Connectivity in IoT Networks0
Reformulating Zero-shot Action Recognition for Multi-label Actions0
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual InformationCode0
Weakly-guided Self-supervised Pretraining for Temporal Activity DetectionCode0
User Activity Detection for Irregular Repetition Slotted Aloha based MMTC0
Access Delay Constrained Activity Detection in Massive Random Access0
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition0
Deep Learning-based Action Detection in Untrimmed Videos: A Survey0
The VVAD-LRS3 Dataset for Visual Voice Activity Detection0
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge0
Sparse Signal Processing for Massive Connectivity via Mixed-Integer Programming0
Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker0
Show:102550
← PrevPage 9 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified