SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 150 of 380 papers

TitleStatusHype
Moshi: a speech-text foundation model for real-time dialogueCode9
pyannote.audio: neural building blocks for speaker diarizationCode3
audino: A Modern Annotation Tool for Audio and SpeechCode2
Exploiting Temporal Side Information in Massive IoT ConnectivityCode1
NAS-VAD: Neural Architecture Search for Voice Activity DetectionCode1
Harvesting Ambient RF for Presence Detection Through Deep LearningCode1
WASE: Learning When to Attend for Speaker Extraction in Cocktail Party EnvironmentsCode1
Low-Latency Speech Separation Guided Diarization for Telephone ConversationsCode1
Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural DiarizationCode1
VoxLingua107: a Dataset for Spoken Language RecognitionCode1
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency FilteringCode1
ROAD: The ROad event Awareness Dataset for Autonomous DrivingCode1
A Hybrid CNN-BiLSTM Voice Activity DetectorCode1
InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender SegmentationCode1
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimationCode1
AV Taris: Online Audio-Visual Speech RecognitionCode1
HGCN: Harmonic gated compensation network for speech enhancementCode1
Learning spectro-temporal representations of complex sounds with parameterized neural networksCode1
Classification of Abnormal Hand Movement for Aiding in Autism Detection: Machine Learning StudyCode1
Online speaker diarization of meetings guided by speech separationCode1
Speaker Diarization with Overlapping Community Detection Using Graph Attention Networks and Label Propagation AlgorithmCode1
VANPY: Voice Analysis FrameworkCode1
X-Vector based voice activity detection for multi-genre broadcast speech-to-textCode1
End-to-end speaker segmentation for overlap-aware resegmentationCode1
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker EmbeddingsCode1
SG-VAD: Stochastic Gates Based Speech Activity DetectionCode1
MM-ALT: A Multimodal Automatic Lyric Transcription SystemCode1
Multitask Detection of Speaker Changes, Overlapping Speech and Voice Activity Using wav2vec 2.0Code1
WiFi CSI Based Temporal Activity Detection via Dual Pyramid NetworkCode1
ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and DevelopmentCode1
A semi-supervised methodology for fishing activity detection using the geometry behind the trajectory of multiple vesselsCode1
AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-OccurrenceCode1
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control CommunicationsCode1
An End-to-End Architecture for Keyword Spotting and Voice Activity DetectionCode1
An Acoustic Emission Activity Detection Method based on Short-Term Waveform Features: Application to Metallic Components under Uniaxial Tensile Test0
Algorithm Unrolling for Massive Access via Deep Neural Network with Theoretical Guarantee0
Access Delay Constrained Activity Detection in Massive Random Access0
A Hybrid Graph Network for Complex Activity Detection in Video0
Activity Detection for Grant-Free NOMA in Massive IoT Networks0
AAD: Adaptive Anomaly Detection through traffic surveillance videos0
Activity Detection And Modeling Using Smart Meter Data: Concept And Case Studies0
A Flexible Framework for Grant-Free Random Access in Cell-Free Massive MIMO Systems0
Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion0
Active Privacy-Utility Trade-off Against Inference in Time-Series Data Sharing0
Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals0
Advanced Image Segmentation Techniques for Neural Activity Detection via C-fos Immediate Early Gene Expression0
Accelerating Coordinate Descent via Active Set Selection for Device Activity Detection for Multi-Cell Massive Random Access0
A Time-Frequency based Suspicious Activity Detection for Anti-Money Laundering0
Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence0
A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments0
Show:102550
← PrevPage 1 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified