Moshi: a speech-text foundation model for real-time dialogue Sep 17, 2024 Action Detection Activity Detection
Code Code Available 9OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection Feb 27, 2025 Action Detection Benchmarking
Code Code Available 3Harnessing Temporal Causality for Advanced Temporal Action Detection Jul 25, 2024 Action Detection Action Recognition
Code Code Available 3Efficient Video Action Detection with Token Dropout and Context Refinement Apr 17, 2023 Action Detection Decoder
Code Code Available 3pyannote.audio: neural building blocks for speaker diarization Nov 4, 2019 Action Detection Activity Detection
Code Code Available 3YOWOv3: An Efficient and Generalized Framework for Human Action Detection and Recognition Aug 5, 2024 Action Detection
Code Code Available 2TIM: A Time Interval Machine for Audio-Visual Action Recognition Apr 8, 2024 Action Detection Action Recognition
Code Code Available 2UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection Apr 7, 2024 Action Detection Moment Queries
Code Code Available 2End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames Nov 28, 2023 Action Detection Temporal Action Localization
Code Code Available 2Temporal Action Localization with Enhanced Instant Discriminability Sep 11, 2023 Action Detection Action Localization
Code Code Available 2Act3D: 3D Feature Field Transformers for Multi-Task Robotic Manipulation Jun 30, 2023 Action Detection Pose Prediction
Code Code Available 2TriDet: Temporal Action Detection with Relative Boundary Modeling Mar 13, 2023 Action Detection Temporal Action Localization
Code Code Available 2YOWOv2: A Stronger yet Efficient Multi-level Detection Framework for Real-time Spatio-temporal Action Detection Feb 14, 2023 Action Detection
Code Code Available 2Structured Attention Composition for Temporal Action Localization May 20, 2022 Action Detection Action Localization
Code Code Available 2Colar: Effective and Efficient Online Action Detection by Consulting Exemplars Mar 2, 2022 Action Detection Online Action Detection
Code Code Available 2audino: A Modern Annotation Tool for Audio and Speech Jun 9, 2020 Action Detection Activity Detection
Code Code Available 2Temporal Action Detection with Structured Segment Networks Apr 20, 2017 Action Detection Action Recognition
Code Code Available 2Speaker Diarization with Overlapping Community Detection Using Graph Attention Networks and Label Propagation Algorithm Jun 3, 2025 Action Detection Activity Detection
Code Code Available 1DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer May 9, 2025 Action Detection Decoder
Code Code Available 1Context-Enhanced Memory-Refined Transformer for Online Action Detection Mar 24, 2025 Action Detection Decoder
Code Code Available 1VANPY: Voice Analysis Framework Feb 17, 2025 Action Detection Activity Detection
Code Code Available 1Preventing Rogue Agents Improves Multi-Agent Collaboration Feb 9, 2025 Action Detection
Code Code Available 1Training-Free Zero-Shot Temporal Action Detection with Vision-Language Models Jan 23, 2025 Action Detection Pseudo Label
Code Code Available 1MS-Temba : Multi-Scale Temporal Mamba for Efficient Temporal Action Detection Jan 10, 2025 Action Detection GPU
Code Code Available 1WiFi CSI Based Temporal Activity Detection via Dual Pyramid Network Dec 19, 2024 Action Detection Action Recognition
Code Code Available 1USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation Dec 12, 2024 Action Detection Action Recognition
Code Code Available 1Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection Nov 17, 2024 Action Detection Open Vocabulary Action Detection
Code Code Available 1Towards Student Actions in Classroom Scenes: New Dataset and Baseline Sep 2, 2024 Action Detection Benchmarking
Code Code Available 1ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos Jul 17, 2024 Action Detection Action Localization
Code Code Available 1MMAD: Multi-label Micro-Action Detection in Videos Jul 7, 2024 Action Analysis Action Detection
Code Code Available 1DyFADet: Dynamic Feature Aggregation for Temporal Action Detection Jul 3, 2024 Action Detection Dynamic neural networks
Code Code Available 1InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation Jun 6, 2024 Action Detection Activity Detection
Code Code Available 1No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding May 14, 2024 Action Detection GPU
Code Code Available 1TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression Apr 3, 2024 Action Detection object-detection
Code Code Available 1Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions Mar 29, 2024 Action Detection Benchmarking
Code Code Available 1Online speaker diarization of meetings guided by speech separation Jan 30, 2024 Action Detection Activity Detection
Code Code Available 1Glance and Focus: Memory Prompting for Multi-Event Video Question Answering Jan 3, 2024 Action Detection Human-Object Interaction Detection
Code Code Available 1Generative Model-based Feature Knowledge Distillation for Action Recognition Dec 14, 2023 Action Detection Action Recognition
Code Code Available 1Adapting Short-Term Transformers for Action Detection in Untrimmed Videos Dec 4, 2023 Action Detection Video Recognition
Code Code Available 1ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee Behaviors Oct 25, 2023 Action Detection Pose Estimation
Code Code Available 1COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action Spotting using Transformers Sep 3, 2023 Action Detection Action Spotting
Code Code Available 1Memory-and-Anticipation Transformer for Online Action Understanding Aug 15, 2023 Action Detection Action Understanding
Code Code Available 1ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and Development Jul 17, 2023 Action Detection Activity Detection
Code Code Available 1Multi-Granularity Hand Action Detection Jun 19, 2023 Action Detection Action Localization
Code Code Available 1E2E-LOAD: End-to-End Long-form Online Action Detection Jun 13, 2023 Action Detection Form
Code Code Available 1WEAR: An Outdoor Sports Dataset for Wearable and Egocentric Activity Recognition Apr 11, 2023 Action Detection Action Localization
Code Code Available 1Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection Apr 10, 2023 Action Detection Language Modeling
Code Code Available 1DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion Mar 27, 2023 Action Detection Decoder
Code Code Available 1TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings Mar 7, 2023 Action Detection Activity Detection
Code Code Available 1MiniROAD: Minimal RNN Framework for Online Action Detection Jan 1, 2023 Action Detection Online Action Detection
Code Code Available 1