SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 201250 of 380 papers

TitleStatusHype
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual InformationCode0
Weakly-guided Self-supervised Pretraining for Temporal Activity DetectionCode0
User Activity Detection for Irregular Repetition Slotted Aloha based MMTC0
Access Delay Constrained Activity Detection in Massive Random Access0
AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-OccurrenceCode1
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control CommunicationsCode1
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition0
Deep Learning-based Action Detection in Untrimmed Videos: A Survey0
The VVAD-LRS3 Dataset for Visual Voice Activity Detection0
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge0
Sparse Signal Processing for Massive Connectivity via Mixed-Integer Programming0
Classification of Abnormal Hand Movement for Aiding in Autism Detection: Machine Learning StudyCode1
Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker0
Fine-Grained Classroom Activity Detection from Audio with Neural NetworksCode0
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording0
Joint Activity Detection, Channel Estimation, and Data Decoding for Grant-free Massive Random Access0
Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets0
Dealing with training and test segmentation mismatch: FBK@IWSLT20210
EML Online Speech Activity Detection for the Fearless Steps Challenge Phase-III0
Algorithm Unrolling for Massive Access via Deep Neural Network with Theoretical Guarantee0
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection0
WASE: Learning When to Attend for Speaker Extraction in Cocktail Party EnvironmentsCode1
Joint Channel Estimation and Device Activity Detection in Heterogeneous Networks0
Accelerating Coordinate Descent via Active Set Selection for Device Activity Detection for Multi-Cell Massive Random Access0
Joint Activity Detection and Data Decoding in Massive Random Access via a Turbo Receiver0
Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct Speech Translation0
Spatial Correlation Aware Compressed Sensing for User Activity Detection and Channel Estimation in Massive MTC0
Spatiotemporal Deformable Scene Graphs for Complex Activity Detection0
Improvement of Noise-Robust Single-Channel Voice Activity Detection with Spatial Pre-processing0
End-to-end speaker segmentation for overlap-aware resegmentationCode1
Sparse Activity Discovery in Energy Constrained Multi-Cluster IoT Networks Using Group Testing0
Early Detection of In-Memory Malicious Activity based on Run-time Environmental Features0
USTC-NELSLIP System Description for DIHARD-III Challenge0
Iterative Reweighted Algorithms for Joint User Identification and Channel Estimation in Spatially Correlated Massive MTC0
Learning spectro-temporal representations of complex sounds with parameterized neural networksCode1
An Ultra-low Power RNN Classifier for Always-On Voice Wake-Up Detection Robust to Real-World Scenarios0
A Hybrid CNN-BiLSTM Voice Activity DetectorCode1
Incorporating VAD into ASR System by Multi-task Learning0
Coarse-Fine Networks for Temporal Activity Detection in VideosCode0
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend0
ROAD: The ROad event Awareness Dataset for Autonomous DrivingCode1
Supporting More Active Users for Massive Access via Data-assisted Activity Detection0
On training targets for noise-robust voice activity detection0
An Efficient Active Set Algorithm for Covariance Based Joint Data and Activity Detection for Massive Random Access with Massive MIMO0
Anomalous Event Recognition in Videos Based on Joint Learningof Motion and Appearance with Multiple Ranking Measures0
Quantum Learning Based Nonrandom Superimposed Coding for Secure Wireless Access in 5G URLLC0
Activity Recognition with Moving Cameras and Few Training Examples: Applications for Detection of Autism-Related Headbanging0
Joint User Activity and Data Detection in Grant-Free NOMA using Generative Neural Networks0
AV Taris: Online Audio-Visual Speech RecognitionCode1
Show:102550
← PrevPage 5 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified