SOTAVerified

Activity Detection

Detecting activities in extended videos.

Papers

Showing 351380 of 380 papers

TitleStatusHype
Evaluation of Noise Reduction Methods for Sentence Recognition by Sinhala Speaking ListenersCode0
Fine-grained Activity Recognition in Baseball VideosCode0
Fine-Grained Classroom Activity Detection from Audio with Neural NetworksCode0
FunASR: A Fundamental End-to-End Speech Recognition ToolkitCode0
Gan-Based Joint Activity Detection and Channel Estimation For Grant-free Random AccessCode0
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for ConversationsCode0
Multi-Stage Speaker Diarization for Noisy ClassroomsCode0
Long-term Conversation Analysis: Exploring Utility and PrivacyCode0
Optimizing Large Language Models for ESG Activity Detection in Financial TextsCode0
Personalized Activity Recognition with Deep Triplet EmbeddingsCode0
Personal VAD: Speaker-Conditioned Voice Activity DetectionCode0
Pre-Equalization Aided Grant-Free Massive Access in Massive MIMO SystemCode0
Protest Activity Detection and Perceived Violence Estimation from Social Media ImagesCode0
R-C3D: Region Convolutional 3D Network for Temporal Activity DetectionCode0
RespVAD: Voice Activity Detection via Video-Extracted Respiration PatternsCode0
rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection MethodCode0
S3D: Single Shot multi-Span Detector via Fully 3D Convolutional NetworksCode0
Weakly-guided Self-supervised Pretraining for Temporal Activity DetectionCode0
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual InformationCode0
Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting ScenariosCode0
Structure-Aware Convolutional Neural NetworksCode0
Temporal Activity Detection in Untrimmed Videos with Recurrent Neural NetworksCode0
Temporal Gaussian Mixture Layer for VideosCode0
The Second DIHARD Diarization Challenge: Dataset, task, and baselinesCode0
Token Turing MachinesCode0
TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASRCode0
Toyota Smarthome Untrimmed: Real-World Untrimmed Videos for Activity DetectionCode0
Unstructured Human Activity Detection from RGBD ImagesCode0
Untrimmed Video Classification for Activity Detection: submission to ActivityNet ChallengeCode0
When do they StOP?: A First Step Towards Automatically Identifying Team Communication in the Operating RoomCode0
Show:102550
← PrevPage 8 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN-BiLSTM_bestROC-AUC95.14Unverified
2CNN-BiLSTM_smallROC-AUC95.13Unverified
3SG-VAD (ours)ROC-AUC94.3Unverified
4ADA-VADROC-AUC79.1Unverified