SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 5175 of 380 papers

TitleStatusHype
Evaluation of real-time transcriptions using end-to-end ASR models0
Blind User Activity Detection for Grant-Free Random Access in Cell-Free mMIMO Networks0
Long-Term Conversation Analysis: Privacy-Utility Trade-off under Noise and Reverberation0
TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASRCode0
Automatic Speech Recognition for Hindi0
Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 20240
AnimalFormer: Multimodal Vision Framework for Behavior-based Precision Livestock Farming0
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness0
Vessel Re-identification and Activity Detection in Thermal Domain for Maritime Surveillance0
Deep Learning-Based Approach for User Activity Detection with Grant-Free Random Access in Cell-Free Massive MIMO0
InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender SegmentationCode1
Precise Analysis of Covariance Identifiability for Activity Detection in Grant-Free Random Access0
A Real-Time Voice Activity Detection Based On Lightweight Neural0
Speaker Embeddings With Weakly Supervised Voice Activity Detection For Efficient Speaker Diarization0
Whispy: Adapting STT Whisper Models to Real-Time Environments0
Activity Detection for Massive Random Access using Covariance-based Matching Pursuit0
FAD-SAR: A Novel Fishing Activity Detection System via Synthetic Aperture Radar Images Based on Deep Learning Method0
A Customer Level Fraudulent Activity Detection Benchmark for Enhancing Machine Learning Model Research and Evaluation0
Leveraging 3D LiDAR Sensors to Enable Enhanced Urban Safety and Public Health: Pedestrian Monitoring and Abnormal Activity Detection0
Deep Learning-Assisted Parallel Interference Cancellation for Grant-Free NOMA in Machine-Type Communication0
Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications0
sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks0
Fast Low-parameter Video Activity Localization in Collaborative Learning Environments0
Joint Activity-Delay Detection and Channel Estimation for Asynchronous Massive Random Access: A Free Probability Theory Approach0
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection0
Show:102550
← PrevPage 3 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified