SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 151200 of 380 papers

TitleStatusHype
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture0
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 20220
Cross-domain Voice Activity Detection with Self-Supervised Representations0
GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge0
Hardware Accelerator and Neural Network Co-Optimization for Ultra-Low-Power Audio Processing Devices0
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization0
Online Target Speaker Voice Activity Detection for Speaker Diarization0
MM-ALT: A Multimodal Automatic Lyric Transcription SystemCode1
A semi-supervised methodology for fishing activity detection using the geometry behind the trajectory of multiple vesselsCode1
Fine-grained Activities of People Worldwide0
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription0
Adversarial Multi-Task Deep Learning for Noise-Robust Voice Activity Detection with Low Algorithmic DelayCode0
An AIoT-enabled Autonomous Dementia Monitoring System0
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency FilteringCode1
Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection0
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios0
RIS Assisted Device Activity Detection with Statistical Channel State Information0
Data-aided Active User Detection with a User Activity Extraction Network for Grant-free SCMA Systems0
A Boosting Algorithm for Positive-Unlabeled Learning0
Double-Sided Information Aided Temporal-Correlated Massive Access0
An Empirical Study on Activity Recognition in Long Surgical Videos0
Ultra-sensitive Flexible Sponge-Sensor Array for Muscle Activities Detection and Human Limb Motion Recognition0
ADA-VAD: Unpaired Adversarial Domain Adaptation for Noise-Robust Voice Activity Detection0
Anomalous Sound Detection Based on Machine Activity Detection0
Automated speech tools for helping communities process restricted-access corpora for language revival efforts0
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition0
Low-Latency Speech Separation Guided Diarization for Telephone ConversationsCode1
Gan-Based Joint Activity Detection and Channel Estimation For Grant-free Random AccessCode0
Deep Learning for Encrypted Traffic Classification and Unknown Data Detection0
Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios0
Human Attention Detection Using AM-FM Representations0
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion Information in Surveillance Videos0
Random Access with Massive MIMO-OTFS in LEO Satellite Communications0
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition0
Active Privacy-Utility Trade-off Against Inference in Time-Series Data Sharing0
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge0
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge0
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge0
HGCN: Harmonic gated compensation network for speech enhancementCode1
NAS-VAD: Neural Architecture Search for Voice Activity DetectionCode1
Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals0
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization0
Exploiting Temporal Side Information in Massive IoT ConnectivityCode1
Merry Go Round: Rotate a Frame and Fool a DNN0
Binary Image Skeletonization Using 2-Stage U-Net0
Low Resource Species Agnostic Bird Activity Detection0
X-Vector based voice activity detection for multi-genre broadcast speech-to-textCode1
User Activity Detection and Channel Estimation of Spatially Correlated Channels via AMP in Massive MTC0
Learning Proximal Operator Methods for Massive Connectivity in IoT Networks0
Reformulating Zero-shot Action Recognition for Multi-label Actions0
Show:102550
← PrevPage 4 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified