SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 2650 of 380 papers

TitleStatusHype
NAS-VAD: Neural Architecture Search for Voice Activity DetectionCode1
A Hybrid CNN-BiLSTM Voice Activity DetectorCode1
VoxLingua107: a Dataset for Spoken Language RecognitionCode1
VANPY: Voice Analysis FrameworkCode1
Speaker Diarization with Overlapping Community Detection Using Graph Attention Networks and Label Propagation AlgorithmCode1
ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and DevelopmentCode1
A semi-supervised methodology for fishing activity detection using the geometry behind the trajectory of multiple vesselsCode1
Low-Latency Speech Separation Guided Diarization for Telephone ConversationsCode1
An End-to-End Architecture for Keyword Spotting and Voice Activity DetectionCode1
Personal VAD: Speaker-Conditioned Voice Activity DetectionCode0
Activity Detection for Massive Connectivity in Cell-free Networks with Unknown Large-scale Fading, Channel Statistics, Noise Variance, and Activity Probability: A Bayesian ApproachCode0
Pre-Equalization Aided Grant-Free Massive Access in Massive MIMO SystemCode0
Optimizing Large Language Models for ESG Activity Detection in Financial TextsCode0
Personalized Activity Recognition with Deep Triplet EmbeddingsCode0
Protest Activity Detection and Perceived Violence Estimation from Social Media ImagesCode0
A Framework for Adapting Human-Robot Interaction to Diverse User GroupsCode0
Adversarial Multi-Task Deep Learning for Noise-Robust Voice Activity Detection with Low Algorithmic DelayCode0
Long-term Conversation Analysis: Exploring Utility and PrivacyCode0
Argus: Efficient Activity Detection System for Extended Video AnalysisCode0
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for ConversationsCode0
A Pursuit of Temporal Accuracy in General Activity DetectionCode0
Learning Latent Super-Events to Detect Multiple Activities in VideosCode0
Fine-Grained Classroom Activity Detection from Audio with Neural NetworksCode0
A Convolutional Neural Network Smartphone App for Real-Time Voice Activity DetectionCode0
FunASR: A Fundamental End-to-End Speech Recognition ToolkitCode0
Show:102550
← PrevPage 2 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified