Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding Sep 22, 2024 Anomaly Detection GPU
Code Code Available 45 An Egocentric Vision-Language Model based Portable Real-time Smart Assistant Mar 6, 2025 Language Modeling Language Modelling
Code Code Available 25 VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding May 22, 2024 Dense Video Captioning Highlight Detection
Code Code Available 25 Egocentric Video-Language Pretraining Jun 3, 2022 Action Recognition Contrastive Learning
Code Code Available 25 VideoSAGE: Video Summarization with Graph Representation Learning Apr 14, 2024 Graph Representation Learning Node Classification
Code Code Available 25 UniVTG: Towards Unified Video-Language Temporal Grounding Jul 31, 2023 Highlight Detection Moment Retrieval
Code Code Available 25 ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of Video Jan 10, 2024 Video Summarization
Code Code Available 25 Combining Global and Local Attention with Positional Encoding for Video Summarization Dec 1, 2021 Supervised Video Summarization Video Summarization
Code Code Available 15 Learning Discriminative Prototypes with Dynamic Time Warping Mar 17, 2021 Action Segmentation Dynamic Time Warping
Code Code Available 15 Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark Dec 12, 2024 Highlight Detection Video Summarization
Code Code Available 15 Multi-modal Summarization for Video-containing Documents Sep 17, 2020 Question Answering Video Summarization
Code Code Available 15 Adopting Self-Supervised Learning into Unsupervised Video Summarization through Restorative Score. Sep 11, 2023 Self-Supervised Learning Unsupervised Video Summarization
Code Code Available 15 Joint Moment Retrieval and Highlight Detection Via Natural Language Queries May 8, 2023 Decoder Highlight Detection
Code Code Available 15 Hierarchical Video-Moment Retrieval and Step-Captioning Mar 29, 2023 Information Retrieval Moment Retrieval
Code Code Available 15 Progressive Video Summarization via Multimodal Self-supervised Learning Jan 7, 2022 Self-Supervised Learning Supervised Video Summarization
Code Code Available 15 TRECVID 2020: A comprehensive campaign for evaluating video retrieval tasks across multiple application domains Apr 27, 2021 Ad-hoc video search Instance Search
Code Code Available 15 Unsupervised Video Summarization via Multi-source Features May 26, 2021 Unsupervised Video Summarization Video Summarization
Code Code Available 15 DSNet: A Flexible Detect-to-Summarize Network for Video Summarization Dec 1, 2020 regression Supervised Video Summarization
Code Code Available 15 Supervised Video Summarization via Multiple Feature Sets with Parallel Attention Apr 23, 2021 Automated Feature Engineering image-classification
Code Code Available 15 Adopting Self-Supervised Learning into Unsupervised Video Summarization through Restorative Score Sep 11, 2023 Self-Supervised Learning Unsupervised Video Summarization
Code Code Available 15 Video Joint Modelling Based on Hierarchical Transformer for Co-summarization Dec 27, 2021 Retrieval Supervised Video Summarization
Code Code Available 15 VideoSum: A Python Library for Surgical Video Summarization Feb 15, 2023 Video Summarization
Code Code Available 15 MHSCNet: A Multimodal Hierarchical Shot-aware Convolutional Network for Video Summarization Apr 18, 2022 Video Summarization
Code Code Available 15 LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNN Jan 22, 2022 Video Summarization
Code Code Available 15 Shot2Story20K: A New Benchmark for Comprehensive Understanding of Multi-shot Videos Dec 16, 2023 Video Captioning video narration captioning
Code Code Available 15 Movie Summarization via Sparse Graph Construction Dec 14, 2020 graph construction Turning Point Identification
Code Code Available 15 Multimodal Summarization of User-Generated Videos Jun 5, 2021 Video Summarization
Code Code Available 15 Query-controllable Video Summarization Apr 7, 2020 Video Summarization
Code Code Available 15 MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos Jun 7, 2023 Text Summarization Video Summarization
Code Code Available 15 Align and Attend: Multimodal Summarization with Dual Contrastive Losses Mar 13, 2023 Extractive Text Summarization Supervised Video Summarization
Code Code Available 15 Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization May 31, 2024 Sentence Video Captioning
Code Code Available 15 EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone Jul 11, 2023 Action Recognition Moment Queries
Code Code Available 15 IntentVizor: Towards Generic Query Guided Interactive Video Summarization Sep 30, 2021 Video Summarization Video Understanding
Code Code Available 15 Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization Nov 18, 2022 Diversity image-classification
Code Code Available 15 AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Video Summarization Nov 16, 2020 Generative Adversarial Network Unsupervised Video Summarization
Code Code Available 15 Discriminative Latent Semantic Graph for Video Captioning Aug 8, 2021 Decoder Object
Code Code Available 15 Do Language Models Understand Time? Dec 18, 2024 Action Recognition Anomaly Detection
Code Code Available 15 A Comprehensive Review of the Video-to-Text Problem Mar 27, 2021 Question Answering Retrieval
Code Code Available 15 Convolutional Hierarchical Attention Network for Query-Focused Video Summarization Jan 31, 2020 Query focused video summarization Video Summarization
Code Code Available 15 Self-Attention Recurrent Summarization Network with Reinforcement Learning for Video Summarization Task Jun 9, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 15 Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of the Video Frames Jun 29, 2022 Benchmarking Diversity
Code Code Available 15 Ultrasound Video Summarization using Deep Reinforcement Learning May 19, 2020 Deep Reinforcement Learning Diagnostic
Code Code Available 15 VideoXum: Cross-modal Visual and Textural Summarization of Videos Mar 21, 2023 Text Summarization Video Summarization
Code Code Available 15 Multi-Stream Dynamic Video Summarization Dec 1, 2018 Video Summarization
Code Code Available 05 Query-adaptive Video Summarization via Quality-aware Relevance Estimation May 1, 2017 Diversity Video Summarization
Code Code Available 05 Adaptive frame selection in two dimensional convolutional neural network action recognition Dec 28, 2022 Action Recognition Video Summarization
Code Code Available 05 A Stepwise, Label-based Approach for Improving the Adversarial Training in Unsupervised Video Summarization Oct 21, 2019 Benchmarking Unsupervised Video Summarization
Code Code Available 05 APES: Audiovisual Person Search in Untrimmed Video Jun 3, 2021 Person Retrieval Person Search
Code Code Available 05 Iterative Projection and Matching: Finding Structure-preserving Representatives and Its Application to Computer Vision Nov 29, 2018 Action Recognition Active Learning
Code Code Available 05 R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding Apr 2, 2024 Highlight Detection Moment Retrieval
Code Code Available 05