Moshi: a speech-text foundation model for real-time dialogue Sep 17, 2024 Action Detection Activity Detection
Code Code Available 95 pyannote.audio: neural building blocks for speaker diarization Nov 4, 2019 Action Detection Activity Detection
Code Code Available 35 OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection Feb 27, 2025 Action Detection Benchmarking
Code Code Available 35 Efficient Video Action Detection with Token Dropout and Context Refinement Apr 17, 2023 Action Detection Decoder
Code Code Available 35 Harnessing Temporal Causality for Advanced Temporal Action Detection Jul 25, 2024 Action Detection Action Recognition
Code Code Available 35 UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection Apr 7, 2024 Action Detection Moment Queries
Code Code Available 25 Temporal Action Detection with Structured Segment Networks Apr 20, 2017 Action Detection Action Recognition
Code Code Available 25 YOWOv2: A Stronger yet Efficient Multi-level Detection Framework for Real-time Spatio-temporal Action Detection Feb 14, 2023 Action Detection
Code Code Available 25 Temporal Action Localization with Enhanced Instant Discriminability Sep 11, 2023 Action Detection Action Localization
Code Code Available 25 Act3D: 3D Feature Field Transformers for Multi-Task Robotic Manipulation Jun 30, 2023 Action Detection Pose Prediction
Code Code Available 25 TriDet: Temporal Action Detection with Relative Boundary Modeling Mar 13, 2023 Action Detection Temporal Action Localization
Code Code Available 25 audino: A Modern Annotation Tool for Audio and Speech Jun 9, 2020 Action Detection Activity Detection
Code Code Available 25 YOWOv3: An Efficient and Generalized Framework for Human Action Detection and Recognition Aug 5, 2024 Action Detection
Code Code Available 25 Structured Attention Composition for Temporal Action Localization May 20, 2022 Action Detection Action Localization
Code Code Available 25 Colar: Effective and Efficient Online Action Detection by Consulting Exemplars Mar 2, 2022 Action Detection Online Action Detection
Code Code Available 25 End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames Nov 28, 2023 Action Detection Temporal Action Localization
Code Code Available 25 TIM: A Time Interval Machine for Audio-Visual Action Recognition Apr 8, 2024 Action Detection Action Recognition
Code Code Available 25 End-to-End Semi-Supervised Learning for Video Action Detection Mar 8, 2022 Action Detection Classification Consistency
Code Code Available 15 E^2TAD: An Energy-Efficient Tracking-based Action Detector Apr 9, 2022 Action Detection Action Localization
Code Code Available 15 End-to-end speaker segmentation for overlap-aware resegmentation Apr 8, 2021 Action Detection Activity Detection
Code Code Available 15 DyFADet: Dynamic Feature Aggregation for Temporal Action Detection Jul 3, 2024 Action Detection Dynamic neural networks
Code Code Available 15 BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection May 5, 2022 Action Detection object-detection
Code Code Available 15 Adapting Short-Term Transformers for Action Detection in Untrimmed Videos Dec 4, 2023 Action Detection Video Recognition
Code Code Available 15 DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer May 9, 2025 Action Detection Decoder
Code Code Available 15 AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence Nov 2, 2021 Action Detection Activity Detection
Code Code Available 15 AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions May 23, 2017 Actin Detection Action Detection
Code Code Available 15 A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos May 2, 2020 Action Detection Form
Code Code Available 15 Actions as Moving Points Jan 14, 2020 Action Detection Action Recognition
Code Code Available 15 E2E-LOAD: End-to-End Long-form Online Action Detection Jun 13, 2023 Action Detection Form
Code Code Available 15 AV Taris: Online Audio-Visual Speech Recognition Dec 14, 2020 Action Detection Activity Detection
Code Code Available 15 End-to-end Temporal Action Detection with Transformer Jun 18, 2021 Action Detection Temporal Action Localization
Code Code Available 15 Continuous control with deep reinforcement learning Sep 9, 2015 Action Detection continuous-control
Code Code Available 15 Context-Aware RCNN: A Baseline for Action Detection in Videos Jul 20, 2020 Action Detection Action Recognition
Code Code Available 15 Coupling Intent and Action for Pedestrian Crossing Behavior Prediction May 10, 2021 Action Detection Autonomous Vehicles
Code Code Available 15 An End-to-End Architecture for Keyword Spotting and Voice Activity Detection Nov 28, 2016 Action Detection Activity Detection
Code Code Available 15 An Empirical Study of End-to-End Temporal Action Detection Apr 6, 2022 Action Classification Action Detection
Code Code Available 15 COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action Spotting using Transformers Sep 3, 2023 Action Detection Action Spotting
Code Code Available 15 DCAN: Improving Temporal Action Detection via Dual Context Aggregation Dec 7, 2021 Action Detection Temporal Action Localization
Code Code Available 15 BSN: Boundary Sensitive Network for Temporal Action Proposal Generation Jun 8, 2018 Action Detection Temporal Action Localization
Code Code Available 15 Classification of Abnormal Hand Movement for Aiding in Autism Detection: Machine Learning Study Aug 18, 2021 Action Detection Activity Detection
Code Code Available 15 A Multigrid Method for Efficiently Training Video Models Dec 2, 2019 Action Detection Action Recognition
Code Code Available 15 AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation Oct 5, 2022 Action Detection Temporal Action Proposal Generation
Code Code Available 15 A semi-supervised methodology for fishing activity detection using the geometry behind the trajectory of multiple vessels Jul 12, 2022 Action Detection Activity Detection
Code Code Available 15 Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization Jun 14, 2020 Action Detection Action Localization
Code Code Available 15 Context-Enhanced Memory-Refined Transformer for Online Action Detection Mar 24, 2025 Action Detection Decoder
Code Code Available 15 Continual Transformers: Redundancy-Free Attention for Online Inference Jan 17, 2022 Action Detection Audio Classification
Code Code Available 15 CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1) Jun 13, 2020 Action Detection Action Localization
Code Code Available 15 Action detection using a neural network elucidates the genetics of mouse grooming behavior Mar 17, 2021 Action Detection Diversity
Code Code Available 15 AViD Dataset: Anonymized Videos from Diverse Countries Jul 10, 2020 Action Classification Action Detection
Code Code Available 15 A Hybrid CNN-BiLSTM Voice Activity Detector Mar 5, 2021 Action Detection Activity Detection
Code Code Available 15