SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 301350 of 380 papers

TitleStatusHype
The VVAD-LRS3 Dataset for Visual Voice Activity Detection0
"This is Houston. Say again, please". The Behavox system for the Apollo-11 Fearless Steps Challenge (phase II)0
An Ensemble SVM-based Approach for Voice Activity Detection0
Towards end-2-end learning for predicting behavior codes from spoken utterances in psychotherapy conversations0
A new network-based algorithm for human activity recognition in video0
Towards More Practical Group Activity Detection: A New Benchmark and Model0
Towards Robust Overlapping Speech Detection: A Speaker-Aware Progressive Approach Using WavLM0
AnimalFormer: Multimodal Vision Framework for Behavior-based Precision Livestock Farming0
Trajectory-User Linking Is Easier Than You Think0
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR0
Transferable Adversarial Attacks against ASR0
TRECVID 2019: An Evaluation Campaign to Benchmark Video Activity Detection, Video Captioning and Matching, and Video Search & Retrieval0
Tri-axial Self-Attention for Concurrent Activity Recognition0
TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge0
Two-stream Multi-dimensional Convolutional Network for Real-time Violence Detection0
Two-Stream Region Convolutional 3D Network for Temporal Activity Detection0
Ultra-sensitive Flexible Sponge-Sensor Array for Muscle Activities Detection and Human Limb Motion Recognition0
Union of Low-Rank Subspaces Detector0
Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection0
Anomalous Event Recognition in Videos Based on Joint Learningof Motion and Appearance with Multiple Ranking Measures0
Anomalous Sound Detection Based on Machine Activity Detection0
Unveiling ECC Vulnerabilities: LSTM Networks for Operation Recognition in Side-Channel Attacks0
Unveiling the Power of Complex-Valued Transformers in Wireless Communications0
User Activity Detection and Channel Estimation of Spatially Correlated Channels via AMP in Massive MTC0
User Activity Detection for Irregular Repetition Slotted Aloha based MMTC0
User Activity Detection with Delay-Calibration for Asynchronous Massive Random Access0
USTC-NELSLIP System Description for DIHARD-III Challenge0
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording0
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition0
Vessel Re-identification and Activity Detection in Thermal Domain for Maritime Surveillance0
vireoJD-MM at Activity Detection in Extended Videos0
Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets0
Voice Activity Detection using Temporal Characteristics of Autocorrelation Lag and Maximum Spectral Amplitude in Sub-bands0
VOXLINGUA107: A DATASET FOR SPOKEN LANGUAGE RECOGNITION0
VSANet: Real-time Speech Enhancement Based on Voice Activity Detection and Causal Spatial Attention0
A Nonparametric Model for Multimodal Collaborative Activities Summarization0
Whispy: Adapting STT Whisper Models to Real-Time Environments0
DASZL: Dynamic Action Signatures for Zero-shot Learning0
Learning Latent Super-Events to Detect Multiple Activities in VideosCode0
A Convolutional Neural Network Smartphone App for Real-Time Voice Activity DetectionCode0
Activity Detection for Massive Connectivity in Cell-free Networks with Unknown Large-scale Fading, Channel Statistics, Noise Variance, and Activity Probability: A Bayesian ApproachCode0
ActivityNet: A Large-Scale Video Benchmark for Human Activity UnderstandingCode0
Adversarial Multi-Task Deep Learning for Noise-Robust Voice Activity Detection with Low Algorithmic DelayCode0
A Framework for Adapting Human-Robot Interaction to Diverse User GroupsCode0
A Pursuit of Temporal Accuracy in General Activity DetectionCode0
Argus: Efficient Activity Detection System for Extended Video AnalysisCode0
Automatic detection and prediction of nAMD activity change in retinal OCT using Siamese networks and Wasserstein Distance for ordinalityCode0
Automating Feedback Analysis in Surgical Training: Detection, Categorization, and AssessmentCode0
Coarse-Fine Networks for Temporal Activity Detection in VideosCode0
Dual Attention in Time and Frequency Domain for Voice Activity DetectionCode0
Show:102550
← PrevPage 7 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified