SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 151200 of 380 papers

TitleStatusHype
Improving Transformer-based End-to-End Speaker Diarization by Assigning Auxiliary Losses to Attention Heads0
Learnable Frontends that do not Learn: Quantifying Sensitivity to Filterbank Initialisation0
The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description0
KIDS: kinematics-based (in)activity detection and segmentation in a sleep case study0
Activity Detection for Grant-Free NOMA in Massive IoT Networks0
Trajectory-User Linking Is Easier Than You Think0
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks0
BC-VAD: A Robust Bone Conduction Voice Activity Detection0
Proximal Gradient-Based Unfolding for Massive Random Access in IoT Networks0
Joint Estimation of Clustered User Activity and Correlated Channels with Unknown Covariance in mMTC0
Multi-timescale Event Detection in Nonintrusive Load Monitoring based on MDL Principle0
On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches0
Token Turing Machines0
Two-stream Multi-dimensional Convolutional Network for Real-time Violence Detection0
OFDM-Based Massive Connectivity for LEO Satellite Internet of Things0
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition0
Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction0
TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge0
Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization0
The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 20220
Learnable Acoustic Frontends in Bird Activity Detection0
Signed Latent Factors for Spamming Activity Detection0
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture0
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 20220
Cross-domain Voice Activity Detection with Self-Supervised Representations0
GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge0
Hardware Accelerator and Neural Network Co-Optimization for Ultra-Low-Power Audio Processing Devices0
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization0
Online Target Speaker Voice Activity Detection for Speaker Diarization0
Fine-grained Activities of People Worldwide0
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription0
Adversarial Multi-Task Deep Learning for Noise-Robust Voice Activity Detection with Low Algorithmic DelayCode0
An AIoT-enabled Autonomous Dementia Monitoring System0
Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection0
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios0
RIS Assisted Device Activity Detection with Statistical Channel State Information0
Data-aided Active User Detection with a User Activity Extraction Network for Grant-free SCMA Systems0
A Boosting Algorithm for Positive-Unlabeled Learning0
Double-Sided Information Aided Temporal-Correlated Massive Access0
An Empirical Study on Activity Recognition in Long Surgical Videos0
Ultra-sensitive Flexible Sponge-Sensor Array for Muscle Activities Detection and Human Limb Motion Recognition0
ADA-VAD: Unpaired Adversarial Domain Adaptation for Noise-Robust Voice Activity Detection0
Anomalous Sound Detection Based on Machine Activity Detection0
Automated speech tools for helping communities process restricted-access corpora for language revival efforts0
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition0
Gan-Based Joint Activity Detection and Channel Estimation For Grant-free Random AccessCode0
Deep Learning for Encrypted Traffic Classification and Unknown Data Detection0
Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios0
Human Attention Detection Using AM-FM Representations0
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion Information in Surveillance Videos0
Show:102550
← PrevPage 4 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified