TRIM: A Self-Supervised Video Summarization Framework Maximizing Temporal Relative Information and Representativeness Jun 25, 2025 Self-Supervised Learning Supervised Video Summarization
— Unverified 0MF2Summ: Multimodal Fusion for Video Summarization with Temporal Alignment Jun 12, 2025 Video Summarization
— Unverified 0Prompts to Summaries: Zero-Shot Language-Guided Video Summarization Jun 12, 2025 GPU Query focused video summarization
— Unverified 0Enhancing Video Memorability Prediction with Text-Motion Cross-modal Contrastive Loss and Its Application in Video Summarization Jun 10, 2025 Prediction Video Summarization
— Unverified 0TriPSS: A Tri-Modal Keyframe Extraction Framework Using Perceptual, Structural, and Semantic Representations Jun 3, 2025 Retrieval Video Summarization
— Unverified 0Unsupervised Transcript-assisted Video Summarization and Highlight Detection May 29, 2025 Highlight Detection Reinforcement Learning (RL)
— Unverified 0REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing May 24, 2025 Language Modeling Language Modelling
— Unverified 0SD-VSum: A Method and Dataset for Script-Driven Video Summarization May 6, 2025 Video Summarization
Code Code Available 0Video Summarization with Large Language Models Apr 15, 2025 Large Language Model Video Summarization
— Unverified 0Automatic Detection of Intro and Credits in Video using CLIP and Multihead Attention Apr 13, 2025 CPU Highlight Detection
— Unverified 0FaVChat: Unlocking Fine-Grained Facail Video Understanding with Multimodal Large Language Models Mar 12, 2025 Mixture-of-Experts Question Answering
— Unverified 0A Novel Trustworthy Video Summarization Algorithm Through a Mixture of LoRA Experts Mar 8, 2025 Mixture-of-Experts Video Summarization
— Unverified 0An Egocentric Vision-Language Model based Portable Real-time Smart Assistant Mar 6, 2025 Language Modeling Language Modelling
Code Code Available 2Parameter-free Video Segmentation for Vision and Language Understanding Mar 3, 2025 Question Answering Video Question Answering
— Unverified 0CFSum: A Transformer-Based Multi-Modal Video Summarization Framework With Coarse-Fine Fusion Mar 1, 2025 Video Summarization
— Unverified 0Integrate the temporal scheme for unsupervised video summarization via attention mechanism Feb 26, 2025 Unsupervised Video Summarization Video Summarization
Code Code Available 0Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications Feb 20, 2025 Decision Making Deep Reinforcement Learning
— Unverified 0What Is That Talk About? A Video-to-Text Summarization Dataset for Scientific Presentations Feb 12, 2025 Text Summarization Video Summarization
Code Code Available 0FullTransNet: Full Transformer with Local-Global Attention for Video Summarization Jan 1, 2025 Decoder Supervised Video Summarization
— Unverified 0Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning Dec 18, 2024 Moment Retrieval Multi-Task Learning
— Unverified 0Do Language Models Understand Time? Dec 18, 2024 Action Recognition Anomaly Detection
Code Code Available 1Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark Dec 12, 2024 Highlight Detection Video Summarization
Code Code Available 1Agent-based Video Trimming Dec 12, 2024 Highlight Detection Moment Retrieval
— Unverified 0Video Summarization using Denoising Diffusion Probabilistic Model Dec 11, 2024 Denoising model
— Unverified 0Personalized Video Summarization by Multimodal Video Understanding Nov 5, 2024 Unsupervised Video Summarization Video Summarization
— Unverified 0Your Interest, Your Summaries: Query-Focused Long Video Summarization Oct 17, 2024 Query focused video summarization Video Summarization
Code Code Available 0Exploring Efficient Foundational Multi-modal Models for Video Summarization Oct 9, 2024 Language Modeling Language Modelling
— Unverified 0Realizing Video Summarization from the Path of Language-based Semantic Understanding Oct 6, 2024 Mixture-of-Experts Video Generation
— Unverified 0Video Summarization Techniques: A Comprehensive Review Oct 6, 2024 Abstractive Text Summarization Extractive Summarization
— Unverified 0Does SpatioTemporal information benefit Two video summarization benchmarks? Oct 4, 2024 Activity Recognition Video Summarization
Code Code Available 0EDSNet: Efficient-DSNet for Video Summarization Sep 23, 2024 Video Summarization
— Unverified 0Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding Sep 22, 2024 Anomaly Detection GPU
Code Code Available 4Personalized Video Summarization using Text-Based Queries and Conditional Modeling Aug 27, 2024 Video Summarization Word Embeddings
— Unverified 0EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos Jul 30, 2024 Audio Synthesis Video Summarization
— Unverified 0Multimodal Language Models for Domain-Specific Procedural Video Summarization Jul 7, 2024 Video Summarization
— Unverified 0Unsupervised Video Summarization via Reinforcement Learning and a Trained Evaluator Jul 5, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos Jun 24, 2024 Triplet Video Summarization
Code Code Available 0A Human-Annotated Video Dataset for Training and Evaluation of 360-Degree Video Summarization Methods Jun 5, 2024 Video Summarization
Code Code Available 0Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization May 31, 2024 Sentence Video Captioning
Code Code Available 1VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding May 22, 2024 Dense Video Captioning Highlight Detection
Code Code Available 2CSTA: CNN-based Spatiotemporal Attention for Video Summarization May 20, 2024 Supervised Video Summarization Video Summarization
Code Code Available 0"Previously on ..." From Recaps to Story Summarization May 19, 2024 Video Summarization
— Unverified 0An Integrated Framework for Multi-Granular Explanation of Video Summarization May 16, 2024 Benchmarking Panoptic Segmentation
Code Code Available 0Language-Guided Self-Supervised Video Summarization Using Text Semantic Matching Considering the Diversity of the Video May 14, 2024 Diversity Supervised Video Summarization
— Unverified 0Pegasus-v1 Technical Report Apr 23, 2024 Language Modeling Language Modelling
— Unverified 0V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning Apr 18, 2024 Text Summarization Video Summarization
— Unverified 0VideoSAGE: Video Summarization with Graph Representation Learning Apr 14, 2024 Graph Representation Learning Node Classification
Code Code Available 2Enhancing Video Summarization with Context Awareness Apr 6, 2024 Benchmarking Informativeness
Code Code Available 0Cluster-based Video Summarization with Temporal Context Awareness Apr 6, 2024 Clustering Unsupervised Video Summarization
Code Code Available 0Scaling Up Video Summarization Pretraining with Large Language Models Apr 4, 2024 Video Alignment Video Summarization
— Unverified 0