InternVideo2: Scaling Foundation Models for Multimodal Video Understanding Mar 22, 2024 Action Classification Action Recognition
Code Code Available 7TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis Oct 5, 2022 Action Recognition Anomaly Detection
Code Code Available 6DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework Mar 19, 2025 8k Action Recognition
Code Code Available 4SAT: Dynamic Spatial Aptitude Training for Multimodal Language Models Dec 10, 2024 Action Recognition Spatial Reasoning
Code Code Available 4Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation Oct 9, 2023 Action Recognition Image Generation
Code Code Available 4InternVideo: General Video Foundation Models via Generative and Discriminative Learning Dec 6, 2022 Action Classification Action Recognition
Code Code Available 4Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models Jan 30, 2025 Action Recognition Domain Adaptation
Code Code Available 3Harnessing Temporal Causality for Advanced Temporal Action Detection Jul 25, 2024 Action Detection Action Recognition
Code Code Available 3Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects Mar 25, 2024 Action Recognition Motion Generation
Code Code Available 3Humans in 4D: Reconstructing and Tracking Humans with Transformers May 31, 2023 3D Human Pose Estimation Action Recognition
Code Code Available 3MotionBERT: A Unified Perspective on Learning Human Motion Representations Oct 12, 2022 3D Human Pose Estimation 3D Pose Estimation
Code Code Available 3Expanding Language-Image Pretrained Models for General Video Recognition Aug 4, 2022 Action Classification Action Recognition
Code Code Available 3A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications Jun 2, 2022 Action Recognition Sports Analytics
Code Code Available 3VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training Mar 23, 2022 4k Action Classification
Code Code Available 3EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks May 28, 2019 Action Recognition Domain Generalization
Code Code Available 3Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings Mar 25, 2025 4k Action Recognition
Code Code Available 2LLaVAction: evaluating and training multi-modal large language models for action recognition Mar 24, 2025 Action Recognition Action Understanding
Code Code Available 2Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-based Action Recognition Nov 28, 2024 Action Recognition Skeleton Based Action Recognition
Code Code Available 2AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation Jul 5, 2024 Action Recognition Few-Shot Image Classification
Code Code Available 2EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation Jun 26, 2024 Action Anticipation Action Recognition
Code Code Available 2Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba May 9, 2024 Action Recognition Mamba
Code Code Available 2Leveraging Temporal Contextualization for Video Action Recognition Apr 15, 2024 Action Recognition Temporal Action Localization
Code Code Available 2TIM: A Time Interval Machine for Audio-Visual Action Recognition Apr 8, 2024 Action Detection Action Recognition
Code Code Available 2OmniVid: A Generative Framework for Universal Video Understanding Mar 26, 2024 Action Recognition Decoder
Code Code Available 2Understanding Long Videos with Multimodal Language Models Mar 25, 2024 Action Recognition Fine-grained Action Recognition
Code Code Available 2DeGCN: Deformable Graph Convolutional Networks for Skeleton-Based Action Recognition Mar 25, 2024 Action Recognition Skeleton Based Action Recognition
Code Code Available 2vid-TLDR: Training Free Token merging for Light-weight Video Transformer Mar 20, 2024 Action Recognition Computational Efficiency
Code Code Available 2Hierarchical NeuroSymbolic Approach for Comprehensive and Explainable Action Quality Assessment Mar 20, 2024 Action Quality Assessment Action Quality Assessment Report Generation
Code Code Available 2SkateFormer: Skeletal-Temporal Transformer for Human Action Recognition Mar 14, 2024 Action Recognition Human Interaction Recognition
Code Code Available 2Dynamic 3D Point Cloud Sequences as 2D Videos Mar 2, 2024 Action Recognition Self-Supervised Learning
Code Code Available 2Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data Feb 8, 2024 Action Recognition Mamba
Code Code Available 2FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action Recognition Feb 5, 2024 Action Recognition Open Vocabulary Action Recognition
Code Code Available 2BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition Jan 1, 2024 Action Recognition Skeleton Based Action Recognition
Code Code Available 2Hulk: A Universal Knowledge Translator for Human-Centric Tasks Dec 4, 2023 3D Human Pose Estimation Action Recognition
Code Code Available 2Is Weakly-supervised Action Segmentation Ready For Human-Robot Interaction? No, Let's Improve It With Action-union Learning Oct 22, 2023 Action Recognition Action Segmentation
Code Code Available 2Frozen Transformers in Language Models Are Effective Visual Encoder Layers Oct 19, 2023 Action Recognition Image-text Retrieval
Code Code Available 2Valley: Video Assistant with Large Language model Enhanced abilitY Jun 12, 2023 Action Recognition Instruction Following
Code Code Available 2On the Benefits of 3D Pose and Tracking for Human Action Recognition Apr 3, 2023 Action Recognition Temporal Action Localization
Code Code Available 2VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking Mar 29, 2023 Action Classification Action Recognition
Code Code Available 2AIM: Adapting Image Models for Efficient Video Action Recognition Feb 6, 2023 Action Classification Action Recognition
Code Code Available 2Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models Dec 31, 2022 Action Classification Action Recognition
Code Code Available 2Learning Video Representations from Large Language Models Dec 8, 2022 Action Classification Action Recognition
Code Code Available 2Deep Architectures for Content Moderation and Movie Content Rating Dec 8, 2022 Action Recognition Genre classification
Code Code Available 2UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer Sep 22, 2022 Action Classification Action Recognition
Code Code Available 2Revisiting Classifier: Transferring Vision-Language Models for Video Recognition Jul 4, 2022 Action Classification Action Recognition
Code Code Available 2Revealing Single Frame Bias for Video-and-Language Learning Jun 7, 2022 Action Recognition Fine-grained Action Recognition
Code Code Available 2Egocentric Video-Language Pretraining Jun 3, 2022 Action Recognition Contrastive Learning
Code Code Available 2AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition May 26, 2022 Action Recognition Video Recognition
Code Code Available 2ActionFormer: Localizing Moments of Actions with Transformers Feb 16, 2022 Action Localization Action Recognition
Code Code Available 2HAKE: A Knowledge Engine Foundation for Human Activity Understanding Feb 14, 2022 Action Recognition Human-Object Interaction Detection
Code Code Available 2